Role Overview
As a Data Engineer, you’ll play a crucial role in designing and building data systems that provide real-time insights. You’ll work with cutting-edge technologies in big data and distributed computing, enabling data-driven decision-making and operational support. Collaborating with senior software engineers, you’ll design robust architectures and evaluate technology choices to solve complex problems.
Responsibilities
- Analyze raw data and organize it effectively.
- Develop and maintain data pipelines and datasets.
- Improve data quality and efficiency.
- Interpret trends and patterns from data.
- Prepare data for predictive and prescriptive modeling.
- Build algorithms and prototypes.
- Explore ways to enhance data reliability.
- Collaborate with data scientists and architects on various projects.
Requirements
- Experience: At least 5 years in production software engineering with a server-side programming language (Scala is a must-have; Java, Python, or Kotlin are nice-to-have).
- Technical Skills:
- Event-driven architectures (experience with Kafka is a plus).
- SQL and NoSQL databases (knowledge of HBase is beneficial).
- Big data/distributed computing engines (e.g., Spark, Akka, ZIO).
- Familiarity with at least one cloud computing platform (GCP, Azure, AWS).
- Automated testing (unit, integration, regression, NFT).
- CI/CD workflows and versioning systems (Git).
- Build tools (SBT, Maven, etc.).
- Understanding of common algorithms, data structures, OOD, design patterns, SOLID principles.
Soft Skills:
- Strong analytical and problem-solving abilities.
- Fluent in English.
- Demonstrates ownership and takes initiative.
- Ability to work independently and as part of an Agile team.
- Quick learner with effective collaboration skills.
What We Offer
- A challenging role in a unique environment.
- Competitive gross salary.
- Opportunities for personal development through training and education.
- Comprehensive fringe benefits, including discounts on airline tickets (after 6 months of service).
OOP Git HBase Akka CI/CD Scala Python sbt Apache Spark Amazon Web Services (AWS) Apache Kafka Data Engineering Azure Kotlin Google Cloud Platform (GCP) NFT SQL Java ZIO Maven NoSQL