¡Activa las notificaciones laborales por email!

Lead, Site Reliability Engineer

Royal Caribbean International

Ciudad de México

Presencial

USD 80,000 - 120,000

Jornada completa

Hace 30+ días

Descripción de la vacante

Join a forward-thinking company as a Lead Site Reliability Engineer, where your expertise will help optimize and support a high-traffic website generating significant revenue. This role combines critical incident management, system monitoring, and cross-team collaboration to enhance user experience and operational efficiency. With over a decade of experience in Site Reliability Engineering and a focus on cloud platforms, you will play a pivotal role in ensuring system performance and reliability. Embrace this opportunity to work in a dynamic environment that values innovation and teamwork, while enjoying a competitive compensation package and career development opportunities.

Formación

  • 10+ years in Site Reliability Engineering or related roles.
  • Management experience with teams and service providers.

Responsabilidades

  • Support critical incidents and ensure system reliability.
  • Monitor and optimize systems, creating performance reports.
  • Collaborate with cross-functional teams for effective communication.

Conocimientos

Site Reliability Engineering
DevOps
AWS
API Design Principles
Monitoring Tools (AppDynamics, DataDog)
Analytical Skills
Incident Response Planning
Communication Skills

Educación

Bachelor’s Degree in Computer Science

Herramientas

AWS Elastic Beanstalk
AppDynamics
DataDog

Descripción del empleo

Journey with us!

Combine your career goals and sense of adventure by joining our incredible team at Royal Caribbean Group. We offer a competitive compensation and benefits package, along with excellent career development opportunities, each providing unique ways to explore the world.

We are proud to be a leader in the vacation industry, with global brands including Royal Caribbean International, Celebrity Cruises, and Silversea Cruises. We boast the most innovative fleet and private destinations, and are dedicated to turning the vacation of a lifetime into a lifetime of vacations for our guests.

Royal Caribbean Group’s Global eCommerce division has an exciting career opportunity for a full-time Lead Site Reliability Engineer, reporting to the Sr. Manager, Site Reliability Engineer.

This position will be based on-site in Mexico City.

Position Summary:

The Lead Site Reliability Engineer (Lead SRE) will assist the SRE Manager in supporting the Royal Caribbean website, which generated $183M in gross revenue in 2021. The role involves using application and user performance data to guide informed decision-making. The Lead SRE will utilize site performance metrics from various sources and tools to support tasks such as triaging critical production incidents, analyzing bugs, implementing best practices in site reliability engineering, optimizing infrastructure, and ensuring seamless collaboration between internal teams and external service providers.

Essential Duties and Responsibilities
  • Critical Incident Support: Review ticket analysis, approve incident closures, understand website architecture, escalate incidents appropriately, communicate incident details to stakeholders, and review postmortem/RCA documents.
  • Monitor and Optimize Systems: Prioritize bug and enhancement tickets, create performance reports for deployments.
  • Ensure System Reliability and Performance: Adjust health thresholds, create and maintain performance dashboards, keep alerting and documentation tools up to date.
  • Collaboration with Cross-Functional Teams: Maintain clear communication channels with scrum and marketing teams, ensuring all team members are informed of relevant updates.
Qualifications, Knowledge, and Skills
Experience
  • Minimum 10+ years in Site Reliability Engineering (SRE), DevOps, or related IT roles.
  • At least 3 years of management experience working with teams and external service providers.
Skills and Abilities
  • Proficiency with cloud platforms like AWS and AWS Elastic Beanstalk.
  • Understanding of API design principles (REST, SOAP, GraphQL).
  • Advanced knowledge of monitoring and logging tools such as AppDynamics and DataDog.
  • Strong analytical and troubleshooting skills to resolve complex production issues swiftly.
  • Effective incident response planning skills.
  • Excellent communication skills for interacting with cross-functional teams and documentation.
Education
  • Bachelor’s Degree in Computer Science, Information Technology, Engineering, or a related field.
Certifications
  • Preferred certifications in monitoring, alerting tools, or IT service management.

We understand there’s a lot to consider. Our recruiters are available to provide guidance and answer any questions during the application process. Thank you for your interest in Royal Caribbean Group. We hope to see you onboard soon!

Royal Caribbean Group is committed to equal employment opportunities and prohibits discrimination or harassment based on race, color, religion, sex, age, national origin, disability, sexual orientation, gender identity or expression, marital status, or any other characteristic protected by law.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.