¡Activa las notificaciones laborales por email!

Sr Principal Site Reliability Developer

Oracle

Zapopan

Presencial

USD 60,000 - 100,000

Jornada completa

Hace 7 días
Sé de los primeros/as/es en solicitar esta vacante

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

An innovative company is seeking a Service Reliability Engineer to tackle complex infrastructure challenges in cloud services. This role involves designing and deploying software solutions to enhance the performance and reliability of cloud products. You will collaborate with cross-functional teams to ensure high-quality service delivery while managing capacity planning and performance management. If you are passionate about automation, cloud technologies, and continuous improvement, this position offers a unique opportunity to make a significant impact on global services. Join a dynamic team and contribute to pioneering cloud systems that redefine industry standards.

Formación

  • 5+ years of experience in enterprise applications/cloud.
  • Strong communication and analytical skills.
  • Knowledge of CI/CD practices and security in application delivery.

Responsabilidades

  • Deploy and support cloud services in new regions.
  • Implement alerting and diagnostics, including security alerts.
  • Develop automation and scripts for administration.

Conocimientos

Python
SQL/PLSQL
Java
JavaScript
Automation
Security Fundamentals
Analytical Skills
Agile Methodologies

Educación

BE/BTECH/MS/MSc/MCA/MTech in Computer Science

Herramientas

Oracle Database technologies
CI/CD practices
Configuration Management Tools
SDKs
APIs

Descripción del empleo

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning, demand forecasting, software performance analysis, and system tuning.

Work with the Site Reliability Engineering (SRE) team on shared full-stack ownership of services and technology areas. Understand the configuration, dependencies, and behavioral characteristics of production services. Responsible for designing and delivering mission-critical stacks focusing on security, resiliency, scale, and performance. Have authority over end-to-end performance and operability. Partner with development teams to improve service architecture and guide the engineering of capabilities for the Oracle Cloud service portfolio. Communicate the scale, capacity, security, and performance attributes of the service and technology stack. Demonstrate understanding of automation and orchestration principles. Act as an escalation point for complex issues not covered by SOPs. Use deep knowledge of service topology and dependencies to troubleshoot and define mitigations. Explain how product architecture affects distributed systems. Maintain professional curiosity and a desire to deepen understanding of services and technologies.

The role combines production platform operations and engineering, solving technical problems, proposing improvements, and implementing recommendations. Collaborate directly with high-level developers on projects, bridging system operations and development support.

As a Service Reliability Engineer, you will define and deploy key services, focusing on architecture, operations, capacity planning, performance management, and deployment. Work with cross-functional teams to deliver high-quality experiences while ensuring reliability and performance.

The ideal candidate will have:

  • BE/BTECH/MS/MSc/MCA/MTech in Computer Science or equivalent from a reputed institute with 5+ years of experience in enterprise applications/cloud.
  • Experience managing infrastructure/applications on two of the following: Oracle Database technologies (RAC, Data Guard, Exadata, ASM/RMAN), automation technologies, security fundamentals, development languages (Python, SQL/PLSQL, Java/JavaScript, Oracle APEX).
  • Strong communication and analytical skills.
  • Ability to estimate efforts accurately and deliver on time.
  • Experience with agile methodologies and understanding of product development.
  • Knowledge of CI/CD practices and security in application delivery.
  • Deep understanding of virtualization, cloud services, Linux internals.
  • Development expertise in Python and Bash is desirable.
  • Experience with SDKs, APIs, Java/JavaScript, configuration management tools, internet protocols, and network services.
  • Proven ability to support high-performance, large-scale systems.

What You’ll Do

This role offers the opportunity to work on pioneering cloud systems, impacting global services. Responsibilities include:

  • Deploy and support cloud services in new regions.
  • Implement alerting and diagnostics, including security alerts.
  • Collaborate with product teams to enhance quality.
  • Develop automation and scripts for administration.
  • Create and maintain SOPs and documentation.
  • Support customers with database and cloud infrastructure issues.
  • Participate in team meetings and activities.
  • Perform other duties as assigned.
Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.