Resp & Qualifications
PURPOSE: This is a Big Data/Cloudera Administrator Lead position and not a developer position and need CDP7 experience. The Lead Data Engineer is responsible for orchestrating, deploying, maintaining and scaling cloud OR on-premise infrastructure targeting big data and platform data management (e.g., data warehouses, data lakes) including data access APIs. Prepares and manipulates data using Hadoop or equivalent.) with emphasis on high availability, reliability, automation and performance. This role will focus on leading the migration and set up of the Enterprise Data Platform on Cloud using a combination of Cloudera CDP public cloud and other AWS services. ESSENTIAL FUNCTIONS: Represents team in all architectural and design discussions. Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas. Require project tracking and task monitoring. the lead position ensures an overall successful implementation especially where team members all are working on multiple efforts at the same time. Lead the team to design, configure, implement, monitor, and manage all aspects of Data Integration Framework. Defines and develop the Data Integration best practices for the data management environment of optimal performance and reliability. Plan, develop and lead administrators with project and efforts, achieve milestones and objectives. Oversees the delivery of engineering data initiatives and projects including hands on with install, configure, automation script, and deploy. Design and maintains infrastructure systems (e.g., data warehouses, data lakes) including data access APIs. Prepares and manipulates data using Hadoop or equivalent MapReduce platform. Provides detailed guidance and performs work related to Modeling Data Warehouse solutions in the cloud OR on-premise. Understands Dimensional Modeling, De-normalized Data Structures, OLAP, and Data Warehousing concepts. Oversees the delivery of engineering data initiatives and projects. Manages customer and stakeholder needs, generates and develops requirements, and performs functional analysis. Fulfills business objectives by collaborating with network staff to ensure reliable software and systems. Enforces the implementation of best practices for data auditing, scalability, reliability, high availability and application performance. Develop and apply data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources. Enforces the implementation of best practices for data auditing, scalability, reliability and application performance. Develop and apply data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources. Acts as a mentor for junior and senior team members. Interprets data, analyzes results using statistical techniques, and provides ongoing reports. Executes quantitative analyses that translate data into actionable insights. Provides analytical and data-driven decision-making support for key projects. Designs, manages, and conducts quality control procedures for data sets using data from multiple systems. Installs, tunes, upgrades, troubleshoots, and maintains all computer systems relevant to the supported applications including all necessary tasks to perform operating system administration, user account management, disaster recovery strategy and networking configuration. Improves and expands data delivery engineering job knowledge and leading technologies by attending educational workshops; reviewing professional publications; establishing personal networks; benchmarking state-of-the-art practices; participating in professional societies. Advanced (expert preferred) level experience in administrating and engineering relational databases (ex. MySQL, PostgreSQL), Big Data systems (ex. Cloudera Data Platform Private Cloud and Public Cloud), Apache Solr as SME, ETL (ex. Ab Initio), BI (ex. MicroStrategy), automation tools (ex. Ansible, Terraform, Bit Bucket) and experience working cloud solutions (specifically data products on AWS) are necessary. At least 8 years of Experienced with all the tasks involved in administration of big data and Meta Data Hubsuch as Cloudera. Experience with Ab Initio, EMR, S3, Dynamo DB, Mongo DB, ProgreSQL, RDS, DB2 is a Plus. DevOps (CI/CD Pipeline) is a Plus. Experience with Advance knowledge of UNIX and SQL. Experience with manage metadata hub-MDH, Operational Console and troubleshoot environmental issues which affect these components. Require prior experience with migration from on-premise to AWS Cloud. Represents team in all architectural and design discussions. Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas. Require tracking and monitoring projects and tasks as the lead. SUPERVISORY RESPONSIBILITY: Position does not have direct reports but is expected to assist in guiding and mentoring less experienced staff. May lead a team of matrixed resources. QUALIFICATIONS: Education Level: Bachelor’s Degree in Computer Science, Information Technology or Engineering or related field OR in lieu of a Bachelor’s degree, an additional 4 years of relevant work experience is required in addition to the required work experience. Experience: 8 years Experience in leading data engineering and cross functional team to implement scalable and fine tuned ETL/ELT solutions for optimal performance. Experience developing and updating ETL/ELT scripts. Hands-on experience with application development, relational database layout, development, data modeling. Knowledge, Skills and Abilities (KSAs) Knowledge and understanding of at least one programming language (i.e., SQL, NoSQL, Python). Knowledge and understanding of database design and implementation concepts. Knowledge and understanding of data exchange formats. Knowledge and understanding of data movement concepts. Strong technical and analytical and problem solving skills to troubleshoot to solve a variety of problems. Requires strong organizational and communication skills, written and verbal, with the ability to handle multiple priorities. Able to effectively provide direction to and lead technical teams. Salary Range: $108,936 – $216,359 Salary Range Disclaimer The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the work is being performed. This compensation range is specific and considers factors such as (but not limited to) the scope and responsibilites of the position, the candidate’s work experience, education/training, internal peer equity, and market and business consideration. It is not typical for an individual to be hired at the top of the range, as compensation decisions depend on each case’s facts and circumstances, including but not limited to experience, internal equity, and location. In addition to your compensation, CareFirst offers a comprehensive benefits package, various incentive programs/plans, and 401k contribution programs/plans (all benefits/incentives are subject to eligibility requirements). Department Informatics Database Administration Equal Employment Opportunity CareFirst BlueCross BlueShield is an Equal Opportunity (EEO) employer. It is the policy of the Company to provide equal employment opportunities to all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, protected veteran or disabled status, or genetic information. Where To Apply Please visit our website to apply: Federal Disc/Physical Demand Note: The incumbent is required to immediately disclose any debarment, exclusion, or other event that makes him/her ineligible to perform work directly or indirectly on Federal health care programs. PHYSICAL DEMANDS: The associate is primarily seated while performing the duties of the position. Occasional walking or standing is required. The hands are regularly used to write, type, key and handle or feel small controls and objects. The associate must frequently talk and hear. Weights up to 25 pounds are occasionally lifted. Sponsorship in US Must be eligible to work in the U.S. without Sponsorship. #LI-KT1 REQNUMBER: 19664
PostgreSQL Terraform Amazon Web Services (AWS) Amazon EMR amazon-dynamodb microstrategy Hadoop MySQL cloudera-cdp Apache Solr Business Intelligence (BI) Unix CI/CD OLAP amazon-s3 Python data-warehouse MongoDB dimensional-modeling Ab Initio Big data DevOps SQL ETL DB2 NoSQL amazon-rds Ansible