Must have:
- Have relevant industrial experience and Bachelor diploma in Computer Science or related engineering field
- Have experience in a Site or Network Reliability, Software Engineering role working with a large-scale distributed system
- Have extensive experience with continuous integration / continuous deployment tools such as Jenkins, Github Actions, or similar
- Have advanced proficiency with configuration management tools such as Ansible, Salt Stack, Chef, Puppet, or similar
- Have experience with Linux networking at the userspace level and open source networking software
- Have a comprehensive understanding of networking such as TCP/IP, OSI model, network acls, routing protocol and routing policy
- Have experience developing applications, contributing to codebases, and writing scripts using languages such as Python, Bash, Go, Rust, or similar
- Have experience with one or more observability tools such as Prometheus, Grafana, Loki, ELK
Our team designs, develops, and manages applications and infrastructure that support Akamai’s Compute products and services. We specialize in building and maintaining fast, efficient, scalable, and reliable routing software. As well as maintaining infrastructure that is responsible for all network delivery to Akamai Connected Cloud customers.
As a Network Reliability Engineer, you will collaborate across software development, operations teams and network infrastructure teams. You’ll be developing organizational standards and tooling to ensure the growth and stability of our global platform.
,[Partnering across teams to ensure the reliability, scalability and usability of our products and services, Defining requirements as part of the product lifecycle to influence new designs and standards, Developing automation pipelines to support development, testing, and deployment workflows, Defining SLOs for our compute infrastructure to ensure high availability and performance, Working with Dev and Quality Assurance teams to create more robust solutions, code improvement and stability, Collaborating with our support, operations, and engineering teams to investigate and troubleshoot complex network-related problems] Requirements: Ansible, CI/CD, Python, Networking, Linux, Docker Tools: Jira, GIT, Agile. Additionally: Private healthcare, Small teams, International projects, Home Office Budget, Free coffee, Gym, Canteen, Bike parking, Playroom, Shower, Free parking, In-house trainings, In-house hack days, Modern office, Startup atmosphere, No dress code.
Agile Troubleshooting configuration-management puppet loki Computer Science Linux Networking TCP/IP Prometheus Docker Routing protocols Grafana Elastic Stack Rust salt-stack Git Go CI/CD Python Network Reliability Engineer Quality Assurance (QA) GitHub Actions Software Development Engineer Chef Infra Support Specialist high-availability DevOps Jira Bash Jenkins Ansible