¡Activa las notificaciones laborales por email!

App Sustain & Ops Engineer

PepsiCo

Ciudad de México

Presencial

MXN 600,000 - 800,000

Jornada completa

Hace 30+ días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading global food and beverage company is seeking an Operations Engineer in Mexico City to ensure the reliability and performance of production systems. Qualified candidates will have a Bachelor's degree in a relevant field, at least 5 years of experience, and be fluent in English and Spanish. This role emphasizes collaboration with engineering teams and requires strong knowledge of cloud platforms and monitoring tools. Attractive benefits and a focus on professional development are offered.

Servicios

Opportunities to learn and develop

Financial wellness programs

Flexibility program

Wellness Line benefits

Formación

5+ years of experience in operations engineering, site reliability engineering, or systems administration.
Strong knowledge of cloud platforms (AWS, Azure, GCP).
Hands-on experience with monitoring and alerting tools.

Responsabilidades

Ensure production systems, applications are reliable and performant.
Lead troubleshooting of critical incidents and drive timely resolution.
Collaborate with Engineering & Product Teams to design scalable solutions.

Conocimientos

Linux/Unix

Windows server environments

Monitoring and alerting tools

Scripting/programming languages

CI/CD pipelines

Database management

Cloud platforms

Networking fundamentals

Service Now

Fluent in English and Spanish

Educación

Bachelor’s degree in computer science or related field

Herramientas

Prometheus

Grafana

Datadog

Splunk

Nagios

AppDynamics

Python

Bash

PowerShell

Jenkins

Ansible

Puppet

Chef

MySQL

MongoDB

Cassandra

Couchbase

AWS

Azure

GCP

Overview

We Are PepsiCo

Join PepsiCo and Dare for Better! We are the perfect place for curious people, thinkers and change agents. From leadership to front lines, we're excited about the future and working together to make the world a better place.

Being part of PepsiCo means being part of one of the largest food and beverage companies in the world, with our iconic brands consumed more than a billion times a day in more than 200 countries.

Our product portfolio, which includes 22 of the world's most iconic brands, such as Sabritas, Gamesa, Quaker, Pepsi, Gatorade and Sonrics, has been a part of Mexican homes for more than 116 years.

A career at PepsiCo means working in a culture where all people are welcome. Here, you can dare to be you. No matter who you are, where you're from, or who you love, you can always influence the people around you and make a positive impact in the world.

Know more: PepsiCoJobs

Join PepsiCo, dare for better.

Responsibilities

Role is responsible for ensuring the overall stability of production application. Reliability, availability, scalability, and efficiency of our production systems and platforms.

The Operations Engineer will collaborate with cross‑functional teams—including Software Engineering, Service Reliability, Infrastructure, and Business Operations—to streamline processes, manage day‑to‑day operations, monitor system health, and quickly resolve incidents.

As App Sustain & Ops Engineer your scope would consist of:

System Reliability & Availability – Ensure production systems, applications, and infrastructure are reliable, performant, and available within agreed SLAs/OLAs.
Incident & Problem Management – Lead troubleshooting of critical incidents and drive timely resolution as part of Incident Management. Ensure the Root Cause Analysis is performed and help coordinate the implementation of permanent fixes. Analyze priority incidents to generate insights and identify gaps in the alerting mechanisms. Analyze market‑specific issues and conduct comparative studies to determine why certain problems occur only in specific markets.
Monitoring & Alerting – Partner with the Service Reliability Engineering team to identify, develop and maintain proactive monitoring, alerting, and health checks to detect and prevent issues before business impact. Assist the SRE team in identifying critical health checks for order flow, Order journey and user journeys to enable dedicated notifications for key steps.
Deployment & Change Operations – Partner with the Software Engineering team to support safe, efficient deployments and configuration changes, ensuring minimal disruption. Provide insights on system performance and capacity trends; recommend improvements for scalability and efficiency.
Automation & Continuous Improvement – Identify manual operational tasks and automate processes to increase efficiency, reduce errors and improve response times. Identify recurring data anomalies through analysis and assist in determining effective technical and process‑related solutions. Review L2 team’s manual processes to uncover automation opportunities and implement technology‑specific solutions aimed at improving productivity.
Collaboration with Engineering & Product Teams – Partner with development, infrastructure, and reliability engineering teams to design and deliver operable, scalable, and resilient solutions.
Operational Excellence & Documentation – Maintain runbooks, SOPs, and technical documentation; uphold IT controls, compliance, and audit readiness.
Risk & Security Management – Enforce operational security best practices, support vulnerability remediation, and contribute to disaster recovery and business continuity planning.

Qualifications

Bachelor’s degree in computer science, Information Technology, Engineering, or a related field (or equivalent experience).
5+ years of experience in operations engineering, site reliability engineering, or systems administration.
Fluent in English and Spanish.
Strong knowledge of Linux/Unix and/or Windows server environments.
Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog, Splunk, Nagios, AppDynamics, Full Story, Ignio).
Proficiency in at least one scripting/programming language (e.g., Python, Bash, PowerShell).
Familiarity with CI/CD pipelines, deployment automation, and configuration management (e.g., Jenkins, Ansible, Puppet, Chef).
Database – MySQL, MongoDB, Cassandra, Couchbase.
Understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
Hands‑on experience with cloud platforms (AWS, Azure, GCP).
Experience working with Service Now.

Benefits

Opportunities to learn and develop every day through a wide range of programs.

Internal digital platforms that promote self‑learning.

Development programs according to Leadership skills.

Specialized training according to the role.

Learning experiences with internal and external providers.

We love to celebrate success, which is why we have recognition programs for seniority, behavior, leadership, moments of life, among others.

Financial wellness programs that will help you reach your goals in all stages of life.

A flexibility program that will allow you to balance your personal and work life, adapting your working day to your lifestyle.

And because your family is also important to us, they can also enjoy benefits such as our Wellness Line, thousands of Agreements and Discounts, Scholarship programs for your children, Aid Plans for different moments of life, among others.

We are an equal opportunity employer and value diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We respect and value diversity as a work‑force and innovation for the organization.

Consigue la evaluación confidencial y gratuita de tu currículum.

o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.

Ubicaciones

Empresas destacadas

Principales puestos