Ativa os alertas de emprego por e-mail!

Devops - Sre (Site Reliability Engineering)

buscojobs Brasil

Espírito Santo

Presencial

BRL 60.000 - 100.000

Tempo integral

Ontem
Torna-te num dos primeiros candidatos

Melhora as tuas possibilidades de ir a entrevistas

Cria um currículo adaptado à oferta de emprego para teres uma taxa de sucesso superior.

Resumo da oferta

An innovative firm is seeking a Site Reliability Engineer to enhance the reliability and scalability of their systems. This role involves using advanced software tools for monitoring and automating tasks to improve productivity. You will collaborate closely with development teams to resolve system-related issues and ensure high performance standards. Join a dynamic team that values expertise and fosters an inclusive environment, where your contributions will directly impact the efficiency of critical systems. If you are passionate about reliability engineering and eager to make a difference, this opportunity is perfect for you.

Qualificações

  • Experience in software engineering with a focus on reliability and scalability.
  • Proficiency in automated monitoring and incident management tools.

Responsabilidades

  • Utilize software tools for continuous monitoring and reliability.
  • Respond to emergencies impacting system reliability and perform root cause analysis.
  • Streamline change management processes to enhance system performance.

Conhecimentos

Automated Monitoring
Root Cause Analysis
Change Management
Infrastructure as Code
Container Orchestration

Ferramentas

Azure Monitoring
Prometheus
Grafana
JIRA
GitHub
Terraform
Kubernetes
PagerDuty

Descrição da oferta de emprego

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems.

Responsibilities and Duties
  1. Utilize software tools and automated tasks for continuous monitoring and reliability of applications;
  2. Act swiftly in response to emergency situations impacting system reliability in production environments, performing root cause analysis for ongoing incidents;
  3. Oversee and streamline change management processes to enhance system performance and reliability. Own releases to production environments;
  4. Work closely with development teams throughout the software lifecycle, focusing on resolving system-related issues and eliminating toil by automating routine tasks for increased productivity;
  5. Ensure the reliability and scalability of systems, maintaining high performance and efficiency standards;
  6. Proficiency in monitoring tools such as Azure Monitoring, App Insights, Prometheus, Grafana; project tracking and version control with tools like JIRA, SVN, GitHub;
  7. Expertise with Infrastructure as Code tools (Terraform, ARM/Bicep, Pulumi, etc.) and release management tools (ArgoCD, Harness, Octopus, etc.);
  8. Experience with incident alert tools (PagerDuty, Opsgenie), and container orchestration platforms like Kubernetes, AKS, etc.
About Encora

Encora is the preferred digital engineering and modernization partner for some of the world's leading enterprises and digital native companies. With over 9,000 experts across 47+ offices and innovation labs worldwide, Encora's services include Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering.

At Encora, we hire professionals based solely on their skills and qualifications, and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.