Site Reliability Engineer (Senior/Lead) ID35136
Join to apply for the Site Reliability Engineer (Senior/Lead) ID35136 role at AgileEngine.
AgileEngine is one of the Inc. 5000 fastest-growing companies in the US and a top-3 ranked dev shop according to Clutch. We create award-winning custom software solutions that help companies across 15+ industries change the lives of millions.
If you enjoy a challenging environment working with top talent and are encouraged to learn and experiment daily, this is the place for you!
Responsibilities
- Lead and mentor a team of SREs across India, LATAM, and Europe.
- Drive production support practices and lead incident resolution efforts.
- Act as the technical point of contact for cross-regional alignment and team cohesion.
- Oversee infrastructure as code and automation efforts using Terraform.
- Design and improve CI/CD pipelines in GitLab to support efficient delivery.
- Manage containerized environments with Docker and Kubernetes.
- Own observability strategy using tools like New Relic, Wavefront, or similar.
- Coordinate VPN tunneling and SFTP setup to support EHR integrations.
- Provide technical guidance on customer interactions for issue resolution (2–3/month).
- Support training and documentation on internal tools and EHR-related tasks.
- Ensure infrastructure compliance with HIPAA, GDPR, and iRhythm security protocols.
- Collaborate with DevOps, IT Security, and Engineering stakeholders.
- Contribute to migrating from legacy tools (e.g., Bamboo, Codefresh, Bitbucket) to GitLab.
Must Haves
- 6+ years of experience in SRE, DevOps, or infrastructure roles.
- Experience leading distributed engineering or support teams.
- Deep knowledge of AWS, Terraform, GitLab, Kubernetes, Docker.
- Experience with VPN tunnels, SFTP setup, and cloud networking.
- Strong incident management and observability skills.
- Excellent communication skills and ability to coordinate across time zones.
- Familiarity with compliance-heavy environments (HIPAA, GDPR).
- Ability to mentor junior team members.
- Experience driving process improvement and automation.
- Work a Panama schedule with 8-hour shifts and participate in on-call duties 2–3 days/week, including every other weekend.
- Upper-Intermediate English level.
Nice to Haves
- Experience with Mirth and IRIS EHR systems.
- Familiarity with Bitbucket, Codefresh, and Ansible.
- Experience in healthcare or medical device environments.
- Understanding of high-availability infrastructure patterns.
Benefits
- Professional growth through mentorship, TechTalks, and growth roadmaps.
- Competitive USD-based compensation and budgets for education, fitness, and team activities.
- Exciting projects with modern solutions for top-tier clients including Fortune 500 companies.
- Flexible schedule with options for remote work and office presence.
Your application process continues via email and registration on our Applicant Site. Incomplete registration will terminate your application.