Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Site Reliability Engineer

Hcltech

Montanha

Presencial

BRL 120.000 - 160.000

Tempo integral

Há 4 dias
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading technology firm in Brazil is seeking a skilled professional for handling major incidents and providing production support. You will manage requests related to critical incidents while improving monitoring capabilities using tools like Dynatrace and Splunk. Ideal candidates will have deep DevOps experience, strong troubleshooting skills, and familiarity with cloud platforms. The role offers the opportunity to work in a dynamic environment with a focus on automation and incident resolution.

Qualificações

  • Deep experience in DevOps and Production Support.
  • Experience in automation and CI/CD practices.
  • Familiarity with cloud platforms like GCP, AWS, or Azure.

Responsabilidades

  • Handle major incidents via the Critical Issue Response System.
  • Provide updates until resolution of incidents.
  • Manage CIRS-related requests including deployments and data fixes.
  • Enhance monitoring capabilities using various tools.

Conhecimentos

DevOps expertise
Production Support experience
Automation skills
CI/CD practices
Monitoring tools experience
Strong troubleshooting skills
Communication skills

Ferramentas

Dynatrace
Kibana
Splunk
GCP
AWS
Azure
Descrição da oferta de emprego

Your role and responsibilities: Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution.

Performing deep-dive application troubleshooting and identifying preventive actions.

Managing CIRS-related requests including deployments, feature toggles, and data fixes.

Following up on major production incidents and coordinating with cross-functional teams.

Enhancing monitoring capabilities using tools like Dynatrace, Kibana, and Splunk.

Writing and improving monitoring scripts and alerts based on incident learnings.

Handling customer escalations and coordinating with Support & Engineering teams.

Supporting planned activities and responding to ad-hoc requests from CES teams.

Requirements and Qualifications
  • Deep experience in DevOps and Production Support.
  • Experience in automation and CI / CD practices.
  • Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
  • Hands-on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
  • Strong troubleshooting skills and ability to deep dive into application issues.
  • Excellent communication and coordination skills across teams.

Please submit résumé in English.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.