Ativa os alertas de emprego por e-mail!

Application Stability Specialist

Bebeereliability

Manaus

Presencial

BRL 80.000 - 120.000

Tempo integral

Há 2 dias

Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A tech company in Brazil is seeking a Site Reliability Engineer with substantial experience in DevOps and Production Support. You will handle major incidents, perform troubleshooting, and manage requests related to deployments and feature toggles. The ideal candidate has strong analytical skills, excellent communication abilities, and hands-on experience with monitoring tools like Dynatrace and Splunk. Join us to enhance our monitoring capabilities and support critical incidents in a diverse workplace.

Qualificações

Substantial experience in DevOps and Production Support.
Experience in automation and CI / CD practices.
Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
Hands‑on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
Strong analytical and problem‑solving skills.
Excellent communication and coordination skills across teams.

Responsabilidades

Handling major incidents via CIRS and providing updates until resolution.
Performing in-depth application troubleshooting and identifying preventive actions.
Managing requests including deployments and feature toggles.
Following up on significant production incidents.
Enhancing monitoring capabilities using various tools.
Writing and improving monitoring scripts based on incidents.
Handling customer escalations and coordinating across teams.
Supporting planned activities and responding to ad‑hoc requests.

Conhecimentos

DevOps experience

Production Support experience

Automation skills

CI/CD practices

Monitoring tools knowledge

Analytical skills

Communication skills

Coordination skills

Ferramentas

Dynatrace

Kibana

Splunk

GCP

AWS

Azure

Site Reliability Engineer

Responsibilities

Handling major incidents via CIRS (Critical Issue Response System) and providing regular updates until resolution.
Performing in-depth application troubleshooting and identifying preventive actions.
Managing CIRS-related requests including deployments, feature toggles, and data fixes.
Following up on significant production incidents and coordinating with cross‑functional teams.
Enhancing monitoring capabilities using tools like Dynatrace, Kibana, and Splunk.
Writing and improving monitoring scripts and alerts based on incident learnings.
Handling customer escalations and coordinating with Support & Engineering teams.
Supporting planned activities and responding to ad‑hoc requests from teams.

Qualifications

Substantial experience in DevOps and Production Support.
Experience in automation and CI / CD practices.
Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
Hands‑on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
Strong analytical and problem‑solving skills.
Excellent communication and coordination skills across teams.

We value a diverse and inclusive work environment.

Please submit your resume in English.

Obtém a tua avaliação gratuita e confidencial do currículo.

ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.

Melhores cidades

Melhores empresas

Ofertas populares