Job Search and Career Advice Platform

¡Activa las notificaciones laborales por email!

*Monitoring and Observability Analyst Sr. (Sat, Sun, Holidays)

Coderio | Software Company

A distancia

MXN 50,000 - 70,000

Jornada completa

Hoy
Sé de los primeros/as/es en solicitar esta vacante

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading software firm is seeking a Monitoring and Observability Analyst to design and implement end-to-end monitoring solutions. The ideal candidate should have at least 3 years of experience in IT operations or SRE roles, with expertise in monitoring platforms like Prometheus, Grafana, and Datadog, and strong skills in scripting and analysis. This remote position requires a proactive approach and excellent communication skills, promoting an inclusive and collaborative work environment.

Servicios

100% remote work
Clear path to growth
Collaborative international team

Formación

  • Minimum 3 years of experience in Monitoring, IT Operations, or SRE roles.
  • Advanced experience with monitoring platforms.
  • Solid understanding of logs and distributed tracing.
  • Deep knowledge of Linux operating systems.

Responsabilidades

  • Design and implement end-to-end monitoring solutions.
  • Contribute to the company's observability strategy.
  • Develop and maintain dashboards for real-time visibility.
  • Create and maintain technical documentation.

Conocimientos

Monitoring IT Infrastructure
Scripting (Python, Bash)
Cloud Environments (AWS/Azure/GCP)
Proactivity Orientation
Analysis Skill
DevOps Mindset
Strong Communication Skills

Educación

Bachelor's degree in Systems Engineering

Herramientas

Prometheus/Grafana
ELK Stack
New Relic
Datadog
Docker
Kubernetes
Descripción del empleo
About Us

Coderio designs and delivers scalable digital solutions for global businesses. With a strong technical foundation and a product mindset, our teams lead complex software projects from architecture to execution. We value autonomy, clear communication, and technical excellence. We work closely with international teams and partners, building technology that makes a difference.

🌍 Learn more: http://coderio.com

In this role, as a Monitoring and Observability Analyst

You will design, implement, and maintain proactive monitoring and alerting systems to ensure the availability, performance, and health of IT infrastructure, applications, and services. Your main focus will be on designing end-to-end monitoring solutions using metrics, logs, and traces, configuring business-impact-based alert thresholds (SLIs/SLOs), and supporting incident resolution by providing detailed monitoring data for Root Cause Analysis (RCA). You will work closely with Operations and Development (DevOps) teams to minimize MTTR (Mean Time to Recovery) and support the continuous improvement of the ecosystem.

The role of Monitoring and Observability Engineer/Analyst is critical to our operation and requires continuous coverage (24/7).

Since we support the infrastructure in the United States, all shifts and holidays are governed by the United States (U.S.) time zone and schedule.

Saturdays, Sundays, and any U.S. Holidays require 24‑hour coverage, which is divided into full 12‑hour shifts.

It is essential that you have the availability and willingness to work this shift pattern (evening/night and weekends/holidays) to ensure service continuity and compliance with our SLAs.

What To Expect In This Role (Responsibilities)
  • Contribute to the definition of the company's observability strategy, aligned with industry best practices (SRE/DevOps).
  • Design and implement end-to-end monitoring solutions.
  • Configure alert thresholds (SLIs/SLOs) based on business impact and minimize notification noise.
  • Develop and maintain informative and visually clear dashboards (e.g., Grafana, Kibana) for real‑time visibility.
  • Implement and optimize monitoring automation, from agent deployment to automatic alert response (AIOps basic/intermediate).
  • Administer and maintain monitoring platforms (updates, patches, cost optimization).
  • Create and maintain technical documentation (runbooks, monitoring procedures, service maps).
Requirements
  • Minimum 3 years of experience in Monitoring, IT Operations, or SRE roles.
  • Advanced experience with one or more monitoring platforms: Prometheus/Grafana, ELK Stack, New Relic, Datadog or similar.
  • Dominance in monitoring Cloud environments (AWS/Azure/GCP) and containers (Docker, Kubernetes).
  • Solid understanding of Logs (fluentd, Logstash, Loki) and Distributed Tracing (Jaeger, Zipkin, OpenTelemetry).
  • Practical experience in scripting languages (e.g., Python, Bash) for task automation and custom checker development.
  • Deep knowledge of Linux operating systems.
  • Strong ability to correlate events and data from multiple sources to identify the root cause of complex problems (Analysis Skill).
  • Ability to anticipate problems instead of just reacting to alerts (Proactivity Orientation).
  • Excellent oral and written communication skills.
  • Experience in a collaborative work environment with a DevOps mindset.
  • Bachelor's degree in Systems Engineering, Computer Science, or a related field.
Nice to Have
  • Certifications related to Cloud (AWS, Azure).
  • Certifications related to Observability Platforms (Datadog, Dynatrace).
  • Certifications related to DevOps/SRE practices.
  • Understanding of basic networking concepts (TCP/IP, DNS, Load Balancers).
Benefits

100% remote Long‑term commitment, with autonomy and impact

Strategic and high‑visibility role in a modern engineering culture

Collaborative international team and strong technical leadership

Clear path to growth and leadership within Coderio

Why join Coderio?

At Coderio, we value talent regardless of location. We are a remote‑first company, passionate about technology, collaborative work, and fair compensation.

We offer an inclusive, challenging environment with real opportunities for growth. If you are motivated to build solutions with impact, we are waiting for you.

Apply now.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.