¡Activa las notificaciones laborales por email!

Site Reliability Engineer

DEUNA

Ciudad Madero

A distancia

USD 60,000 - 80,000

Jornada completa

Hace 5 días
Sé de los primeros/as/es en solicitar esta vacante

Descripción de la vacante

A fast-growing startup in Ciudad Madero is seeking a Mid Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of their AWS-based platform. This role involves designing observability and monitoring systems, defining SLIs, and collaborating with cross-functional teams to improve uptime and system resilience. The ideal candidate has expertise in observability tools and strong programming skills in Go. This position offers remote work options and various benefits.

Servicios

Vacations and additional PTO
Economic support for health insurance
Learning and development platform

Formación

  • Expertise with observability tools like Prometheus and Grafana.
  • Strong understanding of AWS services: ECS, Lambda, RDS.
  • Experience conducting failure drills to ensure resilience.

Responsabilidades

  • Design and maintain observability and monitoring for AWS infrastructure.
  • Define and track SLIs, SLOs, and SLAs for critical systems.
  • Collaborate with technical teams on capacity planning and scaling.

Conocimientos

Expertise with Prometheus
Experience designing dashboards
Strong proficiency in Go programming language
Excellent communication and collaboration skills

Herramientas

AWS CloudWatch
Grafana
OpenTelemetry

Descripción del empleo

About DEUNA

DEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates, reduce costs, and unlock new revenue. Built by the team behind DEUNA—the fastest-growing Commerce OS in Latin America—ATHIA combines payment intelligence, checkout optimization, and data orchestration in one powerful solution.

With deep integrations across 300+ PSPs and alternative payment methods, and over 20% of Mexico’s digital economy running through our platform, we simplify global payments through a single integration and centralized reconciliation.

We are a rapidly growing startup expanding into the U.S. to meet the urgent needs of large retailers, marketplaces, airlines, and QSRs. Join us to shape the future of payments!

Visit https://www.deuna.com/ to learn more about us!

Role Overview

As a Mid SRE at Deuna, you’ll ensure the reliability, scalability, and performance of our AWS-based platform by integrating observability, automation, and SRE best practices across the software lifecycle. You will work closely with development teams to improve uptime, provide observability tooling, and ensure we scale efficiently and securely.

Key Responsibilities

  • Design, define, and maintain observability and monitoring for our AWS infrastructure
  • Define and track SLIs, SLOs, and SLAs for critical systems
  • Improve system uptime, latency, and fault tolerance across the platform
  • Provide internal libraries and toolsets to developers for diagnostics and debugging
  • Manage scaling, performance, and resilience efforts related to system reliability
  • Collaborate with technical teams on capacity planning, load testing, and scaling policies
  • Improve production operations by defining and evolving deployment strategies and conducting disaster recovery (DR) testing


Technical Skills:

  • Expertise with Prometheus, Grafana, OpenTelemetry, AWS CloudWatch, or other observability tools
  • Experience designing dashboards, alerts, and log aggregation pipelines
  • Deep understanding of AWS services: ECS, Lambda, RDS, CodePipeline
  • Strong proficiency in Go programming language
  • Skilled at defining SLIs, SLOs, error budgets, and improving Mean Time to Recovery (MTTR)
  • Experience conducting failure drills (e.g., Chaos Monkey, Gremlin) to ensure system resilience


Soft Skills:

  • Excellent communication and collaboration skills
  • Adaptability to thrive in dynamic, fast-paced environments
  • Strong time management and task prioritization
  • Proficiency in English


What will you find when you join DEUNA?

  • A multicultural team distributed throughout LATAM
  • Dynamism, agility and constant innovation
  • Being part of a high-impact solution for an entire region
  • The best tools and technology to operate
  • Being part of the startup culture
  • We are in full expansion!


Benefits:

Vacations and additional PTO ️

Remote work from anywhere

Economic support for health insurance, internet and cell phone line

We all own DEUNA, we offer stock options

Learning and development platform

Multidisciplinary, diverse and dynamic team

Growth and career path

Be part of a dynamic team that's creating the next generation payments platform.

Join us at DEUNA!
Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.