Commercial experience 3+ years in data engineering. Strong programming skills in Python. Solid with distributed computing approaches, patterns, and technologies (PySpark). Experience working with any cloud platform (GCP, AWS, Azure) and its data-oriented components. Proficiency in SQL and query tuning. Understanding of data warehousing principles and modeling concepts (e.g., knowledge […]
Search Results for: parquet
Data Engineer - REMOTE at Essentia Health #vacancy #remote
Building Location: Peerless Building Department: 47670 Analytics & Architecture Job Description: The Data Engineer is responsible for developing data pipelines with a focus on scalability, maintainability, performance, and quality. Collaborates with stakeholders and peers across the organization to develop data driven solutions to improve patient care, outcomes, and optimize business […]
Senior Data Engineer, Finance Data Team (REMOTE) at ISTITUTO MARANGONI #vacancy #remote
Position Description Our Senior Data Engineer is a key member of the engineering staff working across the organization to provide a friction-less experience to our customers and maintain the exit highest standards of protection and availability. Our team thrives and succeeds in delivering high quality technology products and services in […]
Senior Backend Software Engineer, Data Pipelines (REMOTE - Palo Alto, CA) at Skyflow #vacancy #remote
Senior Backend Software Engineer, Data Pipelines (REMOTE – Palo Alto, CA) About Skyflow: Skyflow is a data privacy vault company built to radically simplify how companies isolate, protect, and govern their customers’ most sensitive data. With its global network of data privacy vaults, Skyflow is also a comprehensive solution for […]
Sr. Software Engineer II - Data Platform (Remote) at CrowdStrike Holdings, Inc. #vacancy #remote
#WeAreCrowdStrike and our mission is to stop breaches. As a global leader in cybersecurity, our team changed the game. Since our inception, our market leading cloud-native platform has offered unparalleled protection against the most sophisticated cyberattacks. We work on large scale distributed systems, processing over 1 trillion events a day […]
AWS Cloud Engineer Wisconsin Residents REMOTE at Beacon Hill #vacancy #remote
Top Skills (3) & Years of Experience: 3 years advanced hands-on experience designing AWS data lake solutions, integrating Redshift with other AWS services, such as DMS, Glue, Lambda, S3, Athena, Airflow, experience with Pyspark and Glue ETL scripting including functions like relationalize, performing joins and transforming dataframes with pyspark code […]
AWS Cloud Engineer Wisconsin Residents REMOTE at Beacon Hill #vacancy #remote
Top Skills (3) & Years of Experience: 3 years advanced hands-on experience designing AWS data lake solutions, integrating Redshift with other AWS services, such as DMS, Glue, Lambda, S3, Athena, Airflow, experience with Pyspark and Glue ETL scripting including functions like relationalize, performing joins and transforming dataframes with pyspark code […]
Senior Data Engineer, Finance Data Team (REMOTE) at GEICO #vacancy #remote
Position Description Our Senior Data Engineer is a key member of the engineering staff working across the organization to provide a friction-less experience to our customers and maintain the exit highest standards of protection and availability. Our team thrives and succeeds in delivering high quality technology products and services in […]
Remote Senior Data Engineer at Varwise #vacancy #remote
Minimum 8 years of relevant professional experience including Java/Scala and Spark Proficiency in all aspects of SDLC, from concept to running production systems Proficiency using Spark (PySpark) or Tensorflow Experience participating in ETL and ML pipeline projects based on Airflow, Kubeflow, Mleap, Sagemaker or similar AWS experience including Kafka, Lambda, […]
AWS Cloud Engineer Wisconsin Residents REMOTE at Beacon Hill #vacancy #remote
Top Skills (3) & Years of Experience: 3 years advanced hands-on experience designing AWS data lake solutions, integrating Redshift with other AWS services, such as DMS, Glue, Lambda, S3, Athena, Airflow, experience with Pyspark and Glue ETL scripting including functions like relationalize, performing joins and transforming dataframes with pyspark code […]