Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Application Stability Specialist

Bebeereliability

Manaus

Presencial

BRL 80.000 - 120.000

Tempo integral

Há 2 dias
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A tech company in Brazil is seeking a Site Reliability Engineer with substantial experience in DevOps and Production Support. You will handle major incidents, perform troubleshooting, and manage requests related to deployments and feature toggles. The ideal candidate has strong analytical skills, excellent communication abilities, and hands-on experience with monitoring tools like Dynatrace and Splunk. Join us to enhance our monitoring capabilities and support critical incidents in a diverse workplace.

Qualificações

  • Substantial experience in DevOps and Production Support.
  • Experience in automation and CI / CD practices.
  • Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
  • Hands‑on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
  • Strong analytical and problem‑solving skills.
  • Excellent communication and coordination skills across teams.

Responsabilidades

  • Handling major incidents via CIRS and providing updates until resolution.
  • Performing in-depth application troubleshooting and identifying preventive actions.
  • Managing requests including deployments and feature toggles.
  • Following up on significant production incidents.
  • Enhancing monitoring capabilities using various tools.
  • Writing and improving monitoring scripts based on incidents.
  • Handling customer escalations and coordinating across teams.
  • Supporting planned activities and responding to ad‑hoc requests.

Conhecimentos

DevOps experience
Production Support experience
Automation skills
CI/CD practices
Monitoring tools knowledge
Analytical skills
Communication skills
Coordination skills

Ferramentas

Dynatrace
Kibana
Splunk
GCP
AWS
Azure
Descrição da oferta de emprego
Site Reliability Engineer
Responsibilities
  • Handling major incidents via CIRS (Critical Issue Response System) and providing regular updates until resolution.
  • Performing in-depth application troubleshooting and identifying preventive actions.
  • Managing CIRS-related requests including deployments, feature toggles, and data fixes.
  • Following up on significant production incidents and coordinating with cross‑functional teams.
  • Enhancing monitoring capabilities using tools like Dynatrace, Kibana, and Splunk.
  • Writing and improving monitoring scripts and alerts based on incident learnings.
  • Handling customer escalations and coordinating with Support & Engineering teams.
  • Supporting planned activities and responding to ad‑hoc requests from teams.
Qualifications
  • Substantial experience in DevOps and Production Support.
  • Experience in automation and CI / CD practices.
  • Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
  • Hands‑on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
  • Strong analytical and problem‑solving skills.
  • Excellent communication and coordination skills across teams.

We value a diverse and inclusive work environment.

Please submit your resume in English.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.