Job Search and Career Advice Platform

¡Activa las notificaciones laborales por email!

Site Reliability Engineering Level 2 GCP and Azure

IBM Computing

Ciudad de México

Presencial

MXN 721,000 - 1,082,000

Jornada completa

Hace 21 días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading tech firm is seeking a Site Reliability Engineer (SRE) to ensure the availability of applications on GCP and Azure platforms. The ideal candidate will support cloud solutions by ensuring reliability, monitoring live services, and optimizing existing infrastructure. Additionally, experience with observability tools and strong scripting abilities are essential. This role emphasizes teamwork with network and security teams, and requires excellent English communication skills. Join a diverse and equal-opportunity environment where your skills will have a significant impact.

Formación

  • Proficiency in GCP and Microsoft Azure required.
  • Strong scripting and automation capabilities essential.
  • Very good English communication level (minimum B2).

Responsabilidades

  • Ensure the reliability and uptime of cloud solutions.
  • Collaborate with network and security teams for secure operations.
  • Participate in incident response and postmortems.

Conocimientos

GCP proficiency
Microsoft Azure expertise
Observability tools (Grafana, Prometheus)
Scripting and automation
Microservice architecture familiarity
Azure/GCP PostgreSQL experience
Cloud storage solutions knowledge
Container registries experience
English communication (B2)
Descripción del empleo
Introduction

The SRE position is very important for our Client to ensure the availability of its applications in GCP and Azure cloud platforms.

The clients requires also disposition to provide planned on call support.

Your role and responsibilities

SRE Responsibilities include:

  • Ensure the reliability and uptime of cloud solutions and services, aligned with user needs
  • Support pre-launch activities including system design consulting, platform development, capacity planning, and launch reviews
  • Monitor and enhance live services by tracking availability, latency, and overall system health
  • Scale systems sustainably through automation and drive improvements in reliability and delivery velocity
  • Assess and optimize existing infrastructure within geoscience workflows
  • Collaborate with network and security teams to ensure secure and reliable application operations
  • Develop and document best practices for new projects and services
  • Leverage service management systems to share lessons learned and best practices across the technical community
  • Participate in incident response and conduct blameless postmortems
Required technical and professional expertise

Below are the Key technical skills required:

  • Proficiency in GCP and Microsoft Azure
  • Experience with observability tools such as Grafana, Prometheus, Thanos, Loki,
  • Knowledge of Google Stack driver/Azure monitoring
  • Azure CI/CD pipeline expertise
  • Strong scripting and automation capabilities
  • Familiarity with microservice architecture
  • Experience with Azure/GCP PostgreSQL
  • Experience with cloud storage such as Azure and Google storage solutions
  • Experience of container registries
  • Very good English communication level at least B2

IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.