¡Activa las notificaciones laborales por email!

SRE/Observability

Plan A Technologies

Mérida

A distancia

USD 50,000 - 80,000

Jornada completa

Hace 7 días
Sé de los primeros/as/es en solicitar esta vacante

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

Plan A Technologies is seeking an experienced SRE/Observability Engineer to enhance their observability platform. The role involves designing monitoring solutions and collaborating with teams to ensure system health. Candidates should have 5+ years of relevant experience and expertise in Grafana Cloud, Loki, and Prometheus. This position offers significant career growth opportunities and the flexibility to work remotely.

Servicios

Generous vacation
New laptop
Collaborative work environment

Formación

  • 5+ years in SRE, DevOps, or Observability roles.
  • Hands-on experience with Grafana Cloud, Loki, and Prometheus at scale.
  • Strong skills in LogQL and PromQL.

Responsabilidades

  • Design and manage monitoring, logging, and alerting solutions.
  • Develop LogQL queries for log aggregation and alerting.
  • Automate incident response processes and define SLIs, SLOs, and SLAs.

Conocimientos

Grafana Cloud
Loki
Prometheus
LogQL
PromQL
Kubernetes
Docker
Cloud platforms
Terraform
Ansible

Descripción del empleo

Job Overview

Plan A Technologies is looking for an experienced SRE/Observability Engineer with expertise in Grafana Cloud, Loki, and Prometheus to enhance the reliability, scalability, and performance of our observability platform. You will collaborate with DevOps, software engineers, and infrastructure teams to build and maintain monitoring, logging, and alerting solutions that ensure system health. This role offers significant career growth opportunities.

Please note: Candidates must have at least 5+ years of experience as an SRE and solid expertise with Grafana Cloud, Loki, and Prometheus.

Job Responsibilities
  1. Design, implement, and manage monitoring, logging, and alerting solutions using Grafana Cloud, Loki, and Prometheus.
  2. Develop and maintain LogQL queries for log aggregation, parsing, and alerting.
  3. Optimize Prometheus metrics collection, storage, and query performance.
  4. Automate incident response processes, including defining SLIs, SLOs, and SLAs.
  5. Collaborate with development teams to follow observability best practices in application and infrastructure design.
  6. Troubleshoot and resolve performance issues, log ingestion problems, and metric anomalies.
  7. Create dashboards in Grafana to visualize system health indicators.
  8. Implement scalable and resilient monitoring architectures in cloud or hybrid environments.
  9. Write and maintain Infrastructure-as-Code (IaC) for the observability stack.
Experience
  1. 5+ years in SRE, DevOps, or Observability roles.
  2. Hands-on experience with Grafana Cloud, Loki, and Prometheus at scale.
  3. Strong skills in LogQL for Loki log analysis.
  4. Deep knowledge of PromQL for metrics querying.
  5. Experience with Grafana dashboards, alerting, and integrations.
  6. Proficiency with Kubernetes, Docker, and cloud platforms (AWS, GCP, Azure).
  7. Experience with Terraform, Helm, or Ansible for automation.
  8. Understanding of SRE principles, SLIs, SLOs, and incident management.
  9. Knowledge of distributed systems, microservices, and networking.
  10. Excellent communication skills in English.
  11. Proactive attitude and drive for excellence.
Preferred Qualifications
  1. Experience with other monitoring tools like Elastic Stack, Datadog, or OpenTelemetry.
  2. Knowledge of CI/CD pipelines and GitOps practices.
  3. Kubernetes or Observability certifications.
About The Company & Benefits

Plan A Technologies is a US-based software development and technology advisory firm providing top-tier engineering talent globally. We handle custom development, staff augmentation, integrations, and upgrades. Our team is hands-on yet capable of managing major enterprise projects.

Learn more: www.PlanAtechnologies.com

Location: Remote (Work From Home) or visit our global offices.

Work Environment: Supportive engineers and project managers in a collaborative atmosphere.

Benefits: Generous vacation, new laptop, and other perks.

If this sounds like you, we'd love to hear from you!

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.