Ativa os alertas de emprego por e-mail!

Devops - Sre (Site Reliability Engineering)

buscojobs Brasil

Região Geográfica Intermediária de São Luís

Presencial

BRL 50.000 - 90.000

Tempo integral

Ontem
Torna-te num dos primeiros candidatos

Melhora as tuas possibilidades de ir a entrevistas

Cria um currículo adaptado à oferta de emprego para teres uma taxa de sucesso superior.

Resumo da oferta

An innovative firm is seeking a Site Reliability Engineer to enhance the reliability and scalability of software systems. In this role, you'll leverage your expertise in automation and monitoring tools to ensure high performance and efficiency. You'll collaborate closely with development teams, focusing on solving system-related issues and streamlining change management processes. Join a dynamic team dedicated to creating scalable, reliable software solutions that power leading enterprises worldwide. If you're passionate about technology and eager to make a significant impact, this opportunity is perfect for you.

Qualificações

  • Experience in software engineering and infrastructure operations.
  • Strong focus on system reliability and automation of routine tasks.

Responsabilidades

  • Utilize tools for continuous monitoring and reliability of applications.
  • Act swiftly in response to incidents impacting system reliability.

Conhecimentos

Continuous Monitoring
Root Cause Analysis
Change Management
Automation
System Reliability
Proficiency in Monitoring Tools
Infrastructure as Code
Incident Management

Ferramentas

Azure Monitoring
Prometheus
Grafana
JIRA
GitHub
Terraform
Kubernetes

Descrição da oferta de emprego

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems.

Responsibilities and Duties

  1. Utilize software tools and automated tasks for continuous monitoring and reliability of applications;
  2. Act swiftly in response to emergency situations impacting system reliability in production environments, performing root cause analysis for ongoing incidents;
  3. Oversee and streamline change management processes to enhance system performance and reliability. Ownership of releases to production environments;
  4. Work closely with development teams throughout the software lifecycle, focusing on solving system-related issues and eliminating toil—automating routine tasks for enhanced productivity;
  5. Focus on the reliability and scalability of systems, ensuring high performance and efficiency standards;
  6. Proficiency in monitoring tools like Azure Monitoring, App Insights, Prometheus, Grafana. Experience with project tracking and version management tools like JIRA, SVN, GitHub;
  7. Expertise with Infrastructure as Code tools (Terraform, ARM / Bicep, Pulumi, etc.) and release management tools (ArgoCD, Harness, Octopus, etc.);
  8. Experience with incident alert tools (PageDuty, Opsgenie) and container orchestration platforms like Kubernetes, AKS, and similar.

About Encora

Encora is the preferred digital engineering and modernization partner of some of the world’s leading enterprises and digital native companies. With over 9,000 experts in 47+ offices and innovation labs worldwide, Encora’s technology practices include Product Engineering & Development, Cloud Services, Quality Engineering, DevSecOps, Data & Analytics, Digital Experience, Cybersecurity, and AI & LLM Engineering.

At Encora, we hire professionals based solely on their skills and qualifications, and do not discriminate based on age, disability, religion, gender, sexual orientation, socioeconomic status, or nationality.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.