Expected, Python, SQL, Spark, Google Cloud Platform, GKE, Docker
Optional, Apache Airflow, Prefect, Dragster, Git
About the project, We run a variety of projects in which our sweepmasters can excel. Advanced Analytics, Data Platforms, Streaming Analytics Platforms, Machine Learning Models, Generative AI and more. We like working with top technologies and open-source solutions for Data & AI and ML/AI. In our portfolio, you can find Clients from many industries, e.g., media, e-commerce, retail, fintech, banking, and telcos, such as Truecaller, Spotify, ING, Acast, Volt, Play, and Allegro., , Data & AI projects that we run and the company’s philosophy of sharing knowledge and ideas in this field make GetInData | Part of Xebia not only a great place to work but also a place that provides you with a real opportunity to boost your career., , About role, , A Data Engineer’s role involves crafting, constructing, and upholding the structure, tools, and procedures essential for an organization to gather, store, modify, and scrutinize extensive data amounts. This position involves creating data platforms using typically provided infrastructure and establishing a clear path for Analytics Engineers who utilize the system.
Your responsibilities, Working alongside Platform Engineers to assess and choose suitable technologies and tools for the project, R&D, maintenance, and monitoring of the platform’s components, Implementing intricate data intake procedures, Constructing efficient data models, Implementing and executing policies aligned to the strategic plans of the company concerning used technologies, work organization, etc., Ensuring compliance with industry standards and regulations in terms of security and data privacy applied in the data processing layer, Providing training and fostering knowledge-sharing
Proficiency in a programming language like Python and SQL, Knowledge of the BigQuery DWH platform, Working with Spark messaging systems, Experience as a programmer and knowledge of software engineering, good principles, practices, and solutions, Familiarity with cloud Google Cloud Platform (GCP), Knowledge of at least one orchestration and scheduling tool, for example, Airflow, Prefect, Dragster, etc., Familiarity with DevOps area and tools – GKE, Docker, Experience with Version Control System, preferably GIT, Ability to actively participate/lead discussions with clients to identify and assess concrete and ambitious avenues for improvement
Division of working time, Remote work – 100%
This is how we work, in house, at the client’s site
This is how we work on a project, Continuous Deployment, Continuous Integration, DevOps
Development opportunities we offer, external training, technical knowledge exchange within the company
What we offer, Salary: 160 – 200 PLN net + VAT/h B2B (depending on knowledge and experience), 100% remote work, Flexible working hours, Possibility to work from the office located in the heart of Warsaw, Opportunity to learn and develop with the best Big Data experts, International projects, Possibility of conducting workshops and training, Certifications, Co-financing sport card, Co-financing health care, All equipment needed for work
Recruitment stages, HR Interview, Tech Interview, Manager Meeting
What else do we do besides working on projects?, We conduct many initiatives like Guilds and Labs and other knowledge-sharing initiatives. We build a community around Data & AI, thanks to our conference Big Data Technology Warsaw Summit, meetup Warsaw Data Tech Talks, Radio Data podcast, and DATA Pill newsletter.
GETINDATA POLAND sp. z o.o., GetInData | Part of Xebia is a leading data company working for international Clients, delivering innovative projects related to Data, AI, Cloud, Analytics, ML/LLM, and GenAI. The company was founded in 2014 by data engineers and today brings together 120 Data & AI experts. Our Clients are both fast-growing scaleups and large corporations that are industry leaders. In 2022, we joined forces with Xebia Group to broaden our horizons and bring new international opportunities.
This is how we work,
Google Kubernetes Engine (GKE) Git DevOps Google Cloud Platform (GCP) Docker SQL Python Airflow Apache Spark Data Engineering Prefect