¡Activa las notificaciones laborales por email!

Staff Software Engineer - SRE

Insulet

Guadalajara

Presencial

USD 80,000 - 120,000

Jornada completa

Hace 30+ días

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

An established industry player is seeking a Staff SRE to architect and maintain scalable systems in a rapidly growing environment. This role involves leading a team, driving best practices in reliability and automation, and collaborating with cross-functional teams to ensure high availability. The ideal candidate will have extensive experience in cloud computing, programming, and infrastructure management. Join a dynamic organization focused on improving lives through innovative technology, where your contributions will have a significant impact on global health solutions.

Formación

  • 8+ years of experience in Site Reliability Engineering or DevOps.
  • Strong understanding of cloud computing platforms and container orchestration.

Responsabilidades

  • Lead a team of SRE engineers, driving best practices and automation.
  • Collaborate with teams to design scalable and resilient systems.
  • Implement monitoring solutions to proactively identify issues.

Conocimientos

Python
Java
Go
AWS
Azure
GCP
Kubernetes
Terraform
Ansible
Problem-Solving

Educación

Bachelor’s in Computer Science
Engineering or Related Field

Herramientas

AWS Services
CloudWatch
VPC
EC2
ECS
Lambda

Descripción del empleo

Insulet started in 2000 with an idea and a mission to enable our customers to enjoy simplicity, freedom and healthier lives through the use of our Omnipod product platform. In the last two decades we have improved the lives of hundreds of thousands of patients by using innovative technology that is wearable, waterproof, and lifestyle accommodating.

We are looking for highly motivated, performance driven individuals to be a part of our expanding team. We do this by hiring amazing people guided by shared values who exceed customer expectations. Our continued success depends on it!

Company Overview

Insulet started in 2000 driven to achieve our mission of enabling our customers to enjoy simplicity, freedom and healthier lives through the use of our Omnipod product platform. In the last two decades we have improved the lives of hundreds of thousands of patients who have insulin-requiring diabetes, by using innovative technology that is wearable, waterproof, and lifestyle accommodating. We are on an exciting trajectory of significant growth and global expansion enabling us to reach more patients around the globe.

We are looking for highly motivated, performance driven individuals who want to be part of building our Center of Excellence and be at the forefront of our rapidly growing global footprint. We are looking to hire amazing people who are guided by shared values and desire to exceed customer expectations. Our continued success depends on it.

Position Overview

As a Staff SRE in Site Reliability Engineering (SRE) at Insulet, you will play a critical role in architecting, implementing, and maintaining highly available and scalable infrastructure and systems. You will lead a team of SRE engineers, driving best practices, develop a culture of automation, and ensuring the reliability of our services. This role requires a hands-on approach to solving complex technical challenges while providing technical leadership to the team.

Responsibilities
  1. Provide technical guidance and mentorship to the SRE team.
  2. Drive the implementation of best practices in reliability, scalability, and performance.
  3. Lead by example, demonstrating excellence in technical skills and problem-solving.
  4. Collaborate with cross-functional teams to design scalable, resilient, and efficient systems.
  5. Architect and implement infrastructure solutions that meet the requirements of high availability and performance.
  6. Drive the adoption of modern technologies and tools to improve system reliability and efficiency.
  7. Develop and maintain automation tools for provisioning, deployment, and monitoring.
  8. Automate routine tasks to improve operational efficiency and reduce manual intervention.
  9. Design and implement monitoring solutions to proactively identify issues and prevent service disruptions.
  10. Lead incident response efforts, conducting post-mortem analysis, and implementing measures to prevent recurrence.
  11. Develop & Automate runbooks and playbooks to streamline incident resolution processes.
  12. Conduct capacity planning exercises to ensure systems can handle current and future loads.
  13. Identify performance bottlenecks and optimize system performance through tuning and optimization efforts.
  14. Collaborate with development teams to design and implement scalable architectures.
  15. Document system architectures, configurations, and procedures.
  16. Promote knowledge sharing within the team through technical presentations, workshops, and documentation.
Required Skills and Competencies
  1. Proven experience architecting and managing highly available, scalable, and fault-tolerant systems.
  2. Proven experience with programming languages such as Python, Java, Go, or similar.
  3. Strong understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and container orchestration technologies (e.g., Kubernetes).
  4. In-Depth knowledge of AWS services including VPC, Lambda, IAM, ELB, EC2, ECS, CloudWatch, API Gateway, S3, SQS, SNS, WAF, X-Ray, and Route53 or GCP services including VPC, Cloud Functions, IAM, Cloud Load Balancing, Compute Engine, Google Kubernetes Engine (GKE), Stackdriver, API Gateway, Cloud Storage, Pub/Sub, Firebase Cloud Messaging, Cloud Armor, Cloud Trace, Cloud DNS.
  5. Experience with infrastructure as code tools such as Terraform, Ansible, or similar.
  6. Excellent troubleshooting and problem-solving skills.
  7. Strong communication and leadership skills, with the ability to collaborate effectively with cross-functional teams.
Preferred Skills and Competencies
  1. Cloud Computing Platforms: Strong understanding of platforms like AWS, Azure, and GCP.
  2. Infrastructure as Code: Experience with tools such as Terraform, Ansible, or similar.
  3. Troubleshooting and Problem-Solving: Excellent skills in these areas.
  4. Mentoring: Experience leading and mentoring engineering teams is highly desirable.
Education and Experience
  1. Bachelor’s in computer science, Engineering, or a related field.
  2. 8+ years of experience in the field including 4+ Site Reliability Engineering, DevOps, or a similar role.
Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.