¡Activa las notificaciones laborales por email!

Site Reliability Engineering Level 2 GCP and Azure

IBM Computing

Ciudad de México

Presencial

MXN 721,000 - 1,082,000

Jornada completa

Hace 21 días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading tech firm is seeking a Site Reliability Engineer (SRE) to ensure the availability of applications on GCP and Azure platforms. The ideal candidate will support cloud solutions by ensuring reliability, monitoring live services, and optimizing existing infrastructure. Additionally, experience with observability tools and strong scripting abilities are essential. This role emphasizes teamwork with network and security teams, and requires excellent English communication skills. Join a diverse and equal-opportunity environment where your skills will have a significant impact.

Formación

Proficiency in GCP and Microsoft Azure required.
Strong scripting and automation capabilities essential.
Very good English communication level (minimum B2).

Responsabilidades

Ensure the reliability and uptime of cloud solutions.
Collaborate with network and security teams for secure operations.
Participate in incident response and postmortems.

Conocimientos

GCP proficiency

Microsoft Azure expertise

Observability tools (Grafana, Prometheus)

Scripting and automation

Microservice architecture familiarity

Azure/GCP PostgreSQL experience

Cloud storage solutions knowledge

Container registries experience

English communication (B2)

Introduction

The SRE position is very important for our Client to ensure the availability of its applications in GCP and Azure cloud platforms.

The clients requires also disposition to provide planned on call support.

Your role and responsibilities

SRE Responsibilities include:

Ensure the reliability and uptime of cloud solutions and services, aligned with user needs
Support pre-launch activities including system design consulting, platform development, capacity planning, and launch reviews
Monitor and enhance live services by tracking availability, latency, and overall system health
Scale systems sustainably through automation and drive improvements in reliability and delivery velocity
Assess and optimize existing infrastructure within geoscience workflows
Collaborate with network and security teams to ensure secure and reliable application operations
Develop and document best practices for new projects and services
Leverage service management systems to share lessons learned and best practices across the technical community
Participate in incident response and conduct blameless postmortems

Required technical and professional expertise

Below are the Key technical skills required:

Proficiency in GCP and Microsoft Azure
Experience with observability tools such as Grafana, Prometheus, Thanos, Loki,
Knowledge of Google Stack driver/Azure monitoring
Azure CI/CD pipeline expertise
Strong scripting and automation capabilities
Familiarity with microservice architecture
Experience with Azure/GCP PostgreSQL
Experience with cloud storage such as Azure and Google storage solutions
Experience of container registries
Very good English communication level at least B2

IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Consigue la evaluación confidencial y gratuita de tu currículum.

o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.

Ubicaciones

Empresas destacadas

Principales puestos