Senior Site Reliability Engineer (Remote) at Teaching Strategies #vacancy #remote

Be a Part of our Team! Join a working family that is dedicated to the mission of the work we do! Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build dynamic, top-quality digital products that integrate all of the essential elements of a high-quality solution: curriculum, assessment, professional development, and family engagement. We are building a team of results-oriented individuals who will thrive in a collaborative, work-hard/play-hard culture. We pride ourselves on the impact we have on the early childhood field through supporting teachers who are doing the most important work there is, teaching children to become creative, confident thinkers. Position Overview Teaching Strategies is looking for a highly talented, innovative, creative, and experienced SRE to join our Infrastructure Engineering group. The Senior SRE Engineer will be an integral member of our Technology department and will work closely with the development, QA and Core Platform Engineering (CPE) teams. The role of Senior SRE Engineer is a hands-on technical role and requires a thorough understanding of all components of a modern web application stack, including front-end, database, networking, and systems engineering. The ideal candidate will bring forward innovative and creative ideas around performance, security, systems design, Infrastructure-as-Code and automation, and CI/CD in order to help us maintain scalable and performant technology solutions for our products. And help advance us towards our vision of a platform provisioned and managed by Infrastructure-as-Code with automations and tooling that empowers and accelerates our engineering teams. Specific Roles & Responsibilities: Passion for reliability and performance, you will own uptime and support all customer-facing services and products Own and drive improvements to observability of service performance metrics, monitors, and alerting Provision, manage and automate our SaaS platform across multiple production and test environments Support and enhance build and release pipelines using process and tooling to provide self-service automations Collaborate with development teams on software and platform, helping to identify and remove potential performance bottlenecks Help our engineering partners establish SLIs and SLOs for their services Participate in the on-call rotation with the team Resolve incidents, perform root cause analysis, and grow our library of runbooks Implement and automate security controls, governance processes, and compliance validation Actively participate in and drive infrastructure architecture decisions Mentor junior members of the team Occasional domestic travel required for in-person team, department, and company meetings Qualifications: Minimum of 8 years experience in a 24x7x365 SaaS production environment Build automation and release management experience Hands-on experience with Linux and system administration and engineering Comfortable in a containerized world of Kubernetes (EKS), helm, and ArgoCD Proficiency with configuration management tools such as Ansible, Chef, Salt Production experience in operations for an always-up, always-available mission-critical service Strong knowledge of ephemeral infrastructure, horizontal scaling, self-healing architectures, service discovery, logging, monitoring and alerting Expert level experience with AWS and hybrid cloud systems/designs Proficiency with IaC tools such as Terraform and AWS CloudFormation Expert understanding and ability to troubleshoot systems at the protocol layer – TCP/IP, UDP, SSL/TLS, and DNS Proficient with multiple scripting languages such as Bash, Python, or Go Experience developing CI/CD pipelines using Jenkins or BitBucket Pipelines Knowledge of best-practice security, performance, and networking techniques for high-traffic customer-facing systems Experience with monitoring and logging tools such as New Relic or AWS CloudWatch Experience with relational and NoSQL databases, including Microsoft SQL, Postgres, and MongoDB Strong bias for security posture; experience with PCI compliance is a plus Excellent troubleshooting and testing skills A passion for learning new technologies Experience with Agile methodology and passion for software development best practices Strong sense of collaboration, teamwork, and accountability Bonus: Experience working for a B2C SaaS company Why Teaching Strategies At Teaching Strategies, our solutions and services are only as strong as the teams that create them. By bringing passion, dedication, and creativity to your job every day, there’s no telling what you can do and where you can go! We provide a competitive compensation and benefits package, flexible work schedules, opportunities to engage with co-workers, access to career advancement and professional development opportunities, and the chance to make a difference in the communities we serve. Let’s open the door to your career at Teaching Strategies! Some additional benefits & perks while working with Teaching Strategies Teaching Strategies offers our employees a robust suite of benefits and other perks which include: Competitive compensation package, including Employee Equity Appreciation Program Health insurance benefits 401k with employer match 100% remote work environment Unlimited paid time off (which includes paid holidays and Winter Break) Paid parental leave Tuition assistance and Professional development and growth opportunities 100% paid life, short and long term disability insurance Pre-tax medical and dependent care flexible spending accounts (FSA) Voluntary life and critical illness insurance Teaching Strategies, LLC is committed to creating a diverse workplace and is proud to be an equal opportunity employer of Minorities, all Genders, Protected Veterans, and Individuals with Disabilities. #J-18808-Ljbffr

PostgreSQL Agile Terraform Amazon Web Services (AWS) Linux argocd TCP/IP DNS pci salt Go CI/CD newrelic Python Quality Assurance (QA) MongoDB Microsoft SQL Server Chef Infra UDP amazon-cloudwatch amazon-cloudformation amazon-eks Kubernetes Bash Jenkins Site Reliability Engineering (SRE) Ansible Bitbucket

Залишити відповідь