Vratislavia Software is looking for: Senior Data Engineer
Our client is a leader in software, automotive, software development, cybersecurity and ALM solutions.
As part of the cooperation, you will have the opportunity to take part in an international project for the medical industry.
Your responsibilities:
The data engineers are responsible for the data on-boarding and hence custom integration work. Therefore, a solid knowledge of AWS infrastructure is a must in order to build different connectors such as FTP, API or JDBC integrations. Making the data available in the data lake through AWS Glue, Amazon AppFlow and AWS Lake Formation are responsibilities as well as writing unit/data tests and monitoring the quality of the overall on-boarding process.
Our expectations:
– Experience building data pipelines in Python (experience with PySpark is a plus)
– Understanding on AWS Cloud fundamentals (AWS certification is advised)
– Solid knowledge in infrastructure as code -> cdk
– Git & CI/CD knowledge is a must
– Experience with common data Python libs (pandas, awswrangler,…)
– Understands REST APIs from the consumer perspective
– ML knowledge is a plus (kubeflow)
– Fluent English
What we offer?
– 100% remote work Employment based on a B2B contract
– Work in a stable company that values long-term cooperation
– Competitive remuneration package
– Flexible working hours
– Possibility of continuous learning and development
– Fast, two-stage recruitment process
PySpark Git SDK REST kubeflow CI/CD API pandas Python Amazon Web Services (AWS) JDBC Data Engineering Machine Learning aws-glue FTP