¡Activa las notificaciones laborales por email!

App Sustain & Ops Engineer

PepsiCo Deutschland GmbH

Ciudad de México

Presencial

MXN 1,110,000 - 1,481,000

Jornada completa

Hace 30+ días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading food and beverage company is seeking an App Sustain & Ops Engineer in Mexico City. This role focuses on ensuring system reliability, managing incidents, and collaborating with engineering teams to optimize operations. Candidates should have 5+ years in operations engineering, strong knowledge of server environments, and be fluent in English and Spanish. The position offers opportunities for continuous learning and a flexible work environment.

Servicios

Financial wellness programs
Recognition programs
Learning and development opportunities
Flexibility program for work-life balance

Formación

  • 5+ years of experience in operations engineering or related field.
  • Fluent in English and Spanish.
  • Experience with cloud platforms and CI/CD pipelines.

Responsabilidades

  • Ensure production systems are reliable and performant.
  • Lead incident management and root cause analysis.
  • Collaborate with engineering teams to improve system capabilities.

Conocimientos

Linux/Unix
Windows server environments
Monitoring and alerting tools
Scripting/programming languages
Cloud platforms
Networking fundamentals
Service Now

Educación

Bachelor’s degree in Computer Science or related field

Herramientas

Prometheus
Grafana
Datadog
MySQL
MongoDB
AWS
Azure
Descripción del empleo
Overview

We Are PepsiCo

Join PepsiCo and Dare for Better! We are the perfect place for curious people, thinkers and change agents. From leadership to front lines, we're excited about the future and working together to make the world a better place.

Being part of PepsiCo means being part of one of the largest food and beverage companies in the world, with our iconic brands consumed more than a billion times a day in more than 200 countries.

Our product portfolio, which includes 22 of the world's most iconic brands, such as Sabritas, Gamesa, Quaker, Pepsi, Gatorade and Sonrics, has been a part of Mexican homes for more than 116 years.

A career at PepsiCo means working in a culture where all people are welcome. Here, you can dare to be you. No matter who you are, where you're from, or who you love, you can always influence the people around you and make a positive impact in the world.

Know more: PepsiCoJobs

Join PepsiCo, dare for better.

Responsibilities

The Opportunity

Role is responsible for ensuring the overall stability of production application. Reliability, availability, scalability, and efficiency of our production systems and platforms. The Operations Engineer will collaborate with cross-functional teams—including Software Engineering, Service Reliability, Infrastructure, and Business Operations—to streamline processes, manage day-to-day operations, monitor system health, and quickly resolve incidents.

Your Impact

As App Sustain & Ops Engineer your scope would consist of:

  1. System Reliability & Availability: Ensure production systems, applications, and infrastructure are reliable, performant, and available within agreed SLAs/OLAs.
  2. Incident & Problem Management: Lead troubleshooting of critical incidents and drive timely resolution as part of Incident Management. Ensure the Root Cause Analysis is performed and help coordinate the implementation of permanent fixes on a timely basis. Analyze priority incidents to generate insights and identify gaps in the alerting mechanisms. Analyze market-specific issues and conduct comparative studies to determine why certain problems occur only in specific markets.
  3. Monitoring & Alerting: Partner with the Service Reliability Engineering team to identify, develop and maintain proactive monitoring, alerting, and health checks to detect and prevent issues before business impact. Assist the SRE team in identifying critical health checks for order flow, Order journey and user journeys to enable dedicated notifications for key steps.
  4. Deployment & Change Operations: Partner with the Software Engineering team to support safe, efficient deployments and configuration changes, ensuring minimal disruption to business operations. Provide insights on system performance and capacity trends; provide recommendations to the Software Engineering to implement improvements for scalability and efficiency.
  5. Automation & Continuous Improvement: Identify manual operational tasks and automate processes to increase efficiency, reduce errors, and improve response times. Identify recurring data anomalies through analysis and assist in determining effective technical and process-related solutions. Review L2 team’s manual processes to uncover automation opportunities and implement technology-specific solutions aimed at improving productivity.
  6. Collaboration with Engineering & Product Teams: Partner with development, infrastructure, and reliability engineering teams to design and deliver operable, scalable, and resilient solutions.
  7. Operational Excellence & Documentation: Maintain runbooks, SOPs, and technical documentation; uphold IT controls, compliance, and audit readiness.
  8. Risk & Security Management: Enforce operational security best practices, support vulnerability remediation, and contribute to disaster recovery and business continuity planning.

Qualifications

  • Bachelor’s degree in computer science, Information Technology, Engineering, or a related field (or equivalent experience).
  • 5+ years of experience in operations engineering, site reliability engineering, or systems administration.
  • Fluent in English and Spanish
  • Strong knowledge of Linux/Unix and/or Windows server environments.
  • Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog, Splunk, Nagios, AppDynamics, Full Story, Ignio).
  • Proficiency in at least one scripting/programming language (e.g., Python, Bash, PowerShell).
  • Familiarity with CI/CD pipelines, deployment automation, and configuration management (e.g., Jenkins, Ansible, Puppet, Chef).
  • Database - MySQL, MongoDB, Cassandra, Couchbase
  • Understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
  • Hands-on experience with cloud platforms (AWS, Azure, GCP).
  • Experience working with Service Now.
Who Are We Looking For?

If this is an opportunity that interests you, we encourage you to apply even if you do not meet 100% of the requirements.

What can you expect from us
  • Opportunities to learn and develop every day through a wide range of programs.
  • Internal digital platforms that promote self-learning.
  • Development programs according to Leadership skills.
  • Specialized training according to the role.
  • Learning experiences with internal and external providers.
  • We love to celebrate success, which is why we have recognition programs for seniority, behavior, leadership, moments of life, among others.
  • Financial wellness programs that will help you reach your goals in all stages of life.
  • A flexibility program that will allow you to balance your personal and work life, adapting your working day to your lifestyle.
  • And because your family is also important to us, they can also enjoy benefits such as our Wellness Line, thousands of Agreements and Discounts, Scholarship programs for your children, Aid Plans for different moments of life, among others.

We are an equal opportunity employer and value diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We respect and value diversity as a work force and innovation for the organization.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.