Data Integration Engineer (SSIS/ETL/MongoDB) - 3654

MSI is seeking a Data Integration Engineer to support our government client in Bethesda, MD


Our government client has an opportunity for a Senior Data Integration Engineer with a focus on backend data flows design, development, and support. 
In this role, you will help migrate our ETL/ELT processes from the current system, using MS SQL Server Integration Services (SSIS) to load data into MS SQL Server, to a new system using Apache NiFi to load data into MongoDB.
Additional responsibilities will include development of new processes for loading and manipulating data in order to ensure the data sources are efficiently and effectively connected with the NIH Biomedical Translational Research Information System (BTRIS Data Repository).
On this program, our government client helps to develop and manage a large data warehouse with over 19 billion rows of data collected from 50 different sources nightly for NIH clinical research studies.  Their team follows a modified Agile methodology and is implementing continuous integration / continuous development (CI/CD).
This is a temp to perm position with our government client and will be based at the NIH customer site off Democracy Blvd in Bethesda, MD.  You can work a flexible schedule Monday through Friday around core business hours.
Responsibilities:
Design and develop NiFi workflows to replace SSIS packages
Collaborate with the Technical Manager to plan, design and analyze performance of, integration routines to address data requirements from new data sources
Provide guidance on strategies that minimize implementation risk and time and/or improve system reliability and performance
Review integration solutions developed for compliance with best practices, standards, enterprise architecture and documentation requirements, ETL process, programs and scripts
Conduct root cause analysis and resolve performance and production problems and data issues
Validate the data in the database and test the routines developed
Provide ongoing maintenance and support of assigned integration routines
Establish and enforce data warehousing standards at the client site to meet client requirements and business needs
Provide descriptive analyses of customer, product, and market trends and identifies and resolves data issues
Perform quality assurance testing of data integration and report development
Monitor data load operations to ensure accuracy
 

Required Qualifications:

  • Bachelor’s degree and 5+ years of experience in data integration and/or computer programming (or equivalent combination of education and experience)
  • Experience with Extract, Transform, and Load (ETL) processes, including document parsing techniques and managing large complex data sets
  • Experience with SSIS, or equivalent tool (such as Informatica, Talend, Pentaho, etc)
  • Experience with MS SQL Server stored procedures, functions, cursors, and dynamic SQL
  • Experience working with NoSQL Databases (preferably MongoDB)
  • Experience with scripting languages (Python preferred)  
  • Experience working with industry standards, regulations and guidelines in database warehousing and other relevant systems
  • Experience working with domain structures, user authentication, and digital signatures
  • Good understanding of JSON
  • Competencies:
  • Proactive, self-starter, who takes initiative on projects
  • Ability to articulate challenges, creatively solve problems and resolve ETL issues (should be able to give examples of complex projects)
  • Great attention to detail
  • Strong communication skills to interact with team members, clients, and support personnel
  • Ability to work independently and as part of a team

Desired:

  • Degree in information science, data management, computer science or related field preferred
  • 5+ years of experience in ETL/Data warehouse development.
  • Highly skilled in SQL and capable of working with database administrator to enhance query performance. 
  • Experience with Apache NiFi is preferred
  • Experience with multiple OS including UNIX, Linux, and Windows
  • Experience with advanced SQL query writing, data retrieval, and data mining from relational databases, such as Microsoft SQL Server
  • Solid experience in analyzing query performance issues and modifying data structures or application code to remedy performance problems
  • Excellent understanding of relational and dimensional data models
  • Experience with Netezza Data Warehouse appliances
  • Experience with data replication in a distributed environment
  • Experience with MS SQL Server Change Data Capture
  • Experience with Elasticsearch
  • Java programming experience
  • Experience working in an agile environment