Data Engineer II (REMOTE-USA) at Penn Foster #vacancy #remote

COMPENSATION: $97,000-$115,000 per year. You are eligible to a Short-Term Incentive Plan with the target at 7.5% of your annual earnings, terms and conditions apply.

JOB OVERVIEW: The main responsibility of this role is to collaborate with stakeholders across the organization to design methodologies and tools that leverage the vast amount of multimodal available to guide clinical, pharmaceutical, commercial, and business decisions. This position is responsible for creating new clinical data warehousing solutions, data transformations, and data integration assets, as well as supporting changes, enhancements, and maintenance of existing assets in support of clinical data initiatives The role includes building infrastructure to support query functionality for our databases, complex querying of databases, and to perform advanced data interpretation. This role may entail reporting, online analytical processing, analytics, data mining, business performance management, benchmarking, text mining, and predictive analytics. This role collaborates with groups across the organization including Commercial, Clinical, and Lab personnel. RESPONSIBILITIES: Develop and implement new methods, protocols, and algorithms for data queries, management, and governance. Collaborate with statisticians and machine learning specialists to support Advanced Analytics, Application Delivery, Clinical, Commercial, R&D, and Lab teams with data access and tools for research and analysis. Serve as a subject matter expert to support data interrogation, database consistency, and mapping for stakeholders’ needs, including business partnerships and data integration. Work closely with cross-functional teams, including healthcare professionals, data architects, and IT specialists, to develop robust data pipelines, implement data quality controls, and generate insights to support clinical decision-making. Utilize cloud-based solutions, particularly on AWS, for datalake expertise and manage ETL processes to structure clinical data for analysis. Contribute to the growth of the Data Warehouse architecture by creating and implementing custom clinical data models and ETL processes. Coordinate data, such as EHR, integration projects with system architects, DBAs, and vendors Deploy data warehouse content using approved AWS systems and processes. Perform comprehensive analysis of clinical data using statistical techniques and data mining methodologies. Troubleshoot and analyze ETL process failures, data anomalies, and other data warehouse issues, recommending improvements as needed. Create and maintain accurate metadata models for custom warehouse data structures. Provide technical expertise to support end users and offer business logic for data pipeline transformations. Respond to and track issues using Jira Service Desk, gathering additional information from customers and resolving or escalating as needed. Manage projects, create timelines, identify risks and milestones, and provide status reporting to stakeholders. Support data warehouse developers, analysts, and users to validate data and ensure data warehouse validity. Utilize approved development tools to identify data quality and relationships for efficient reporting solutions. EDUCATION: PhD or a Master’s degree in Data Science, Computer Science, or related field or an equivalent combination of education and applicable job experience. Familiarity with data and applied computer science is critical Familiarity with molecular biology, genomics, clinical fields, and/or bioinformatics is preferred TECHNICAL COMPETENCIES: Proficiency in at least one programming language (e.g., Perl, Python,). Expertise in SQL with familiarity in NoSQL query languages for complex data transformations and optimizations. Knowledge of cloud computing platforms, particularly for data engineering and analytics (e.g., AWS, Azure, Google Cloud). Experience in designing and developing cloud-based data pipelines and ETL processes. Familiarity with big data technologies (e.g., Hadoop, Spark) for processing and analyzing large-scale genomics data. Strong understanding of data warehousing architecture and dimensional modeling concepts. Proficiency in using version control systems (e.g., Git) for managing code and configurations. Proven track record of implementing scalable and efficient solutions for genomics data storage and processing in the cloud. Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes) for deployment and scalability. Knowledge of genomics data formats and standards (e.g., VCF, BAM, FASTQ). Experience with clinical data ontologies (e.g., SNOMED, ICD-10, HPO) or medical data models (OMOP). Strongly preferred experience with EHR integrations, either HL7 or FHIR based. Understanding of data governance principles and best practices for ensuring data integrity, security, and compliance. Familiarity with genomics research workflows and data analysis pipelines. Knowledge of data privacy and compliance regulations specific to genomics data (e.g., HIPAA, GDPR). Strong analytical and problem-solving abilities for addressing complex data engineering challenges. Excellent oral and written communication skills for effective collaboration and documentation. Ability to accurately document technical artifacts and maintain comprehensive documentation for ongoing production support. Project management skills with a keen attention to detail and ability to handle multiple tasks simultaneously. Ability to collaborate effectively with cross-functional teams, including bioinformaticians, data scientists, and domain experts. EXPERIENCE: At least 3 years research and development experience in fields related to clinical diagnostics Minimum 3-5 years experience in computer programing or related field Minimum of 2 years experience in data analytics and computer programming Experience in a genetics laboratory/basic genetics setting a plus About Us: Ambry Genetics Corporation is a CAP-accredited and CLIA-licensed molecular genetics laboratory based in Aliso Viejo, California. We are a genetics-based healthcare company that is dedicated to open scientific exchange so we can work together to understand and treat all human disease faster. At Ambry, everyone is welcome. A career at Ambry Genetics is a chance to be part of a dynamic company that aims to improve health by understanding the relationships between genetics and human disease. We earned our reputation as industry leaders by responsibly introducing cutting-edge genetic testing solutions and continually sharing what we learn with the global scientific community. At Ambry you will be learning, challenging yourself, and having fun while collaborating with teammates through the open exchange of ideas. Our outstanding benefits program includes medical, dental, vision, 401k with a 4% employer match, FSA, paid sick leave and generous paid time off (PTO) program. You can learn more about the benefits here. Ambry Genetics is an Equal Opportunity Employer (EOE) and we maintain a drug-free work environment. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. All qualified applicants will receive consideration for employment without regard to race (and traits historically associated with race, including, but not limited to hair texture and protective hairstyles such as braids, locks, and twists), color, creed, religion, sex, sexual orientation, gender identity, gender expression (including transgender status), national origin, ancestry, age, marital status or protected veteran status and will not be discriminated against on the basis of disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances. If you have a disability or special need that requires accommodation, please contact us at Ambry does not accept unsolicited resumes from individual recruiters, third party recruiting agencies, outside recruiters or firms without an executed contract in place. We are not responsible for any fees related to resumes that are unsolicited or are received by Ambry. Such resumes will be deemed the sole property of Ambry and will be processed accordingly. PRIVACY NOTICES To review Ambry’s Privacy Notice, Click here: To review the California privacy notice, click here: California Privacy Notice | Ambry Genetics To review the UKG privacy notice, click here: California Privacy Notice | UKG #LI-REMOTE #LI-NK1 #J-18808-Ljbffr

Git Apache Spark Amazon Web Services (AWS) Data Engineering hl7 HIPAA fastq hl7-fhir bam Docker SQL vcf-vcard Kubernetes NoSQL Hadoop GDPR

Leave a Reply