Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Senior Cloud SRE

HCLTech

Barueri

Presencial

BRL 80.000 - 120.000

Tempo integral

Hoje
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading technology company in São Paulo is seeking a Cloud Infrastructure SRE / Tier 4 Engineer responsible for maintaining Red Hat private cloud and enhancing troubleshooting protocols. Candidates should have strong Linux expertise, a problem-solving mindset, and proficiency in automation tools like Bash and Python. The role involves following a shift schedule or on-call rotations to ensure business continuity. This is a unique opportunity to work within a collaborative, hands-on environment that values continuous learning.

Qualificações

  • Strong familiarity with Linux distributions; primarily Red Hat and CentOS.
  • Solid understanding of networking fundamentals (VLANs, IP routing).
  • Excellent troubleshooting skills and analytical thinking.
  • Proficiency in Bash and Python, or willingness to learn.
  • Knowledge of Podman, Kubernetes, Helm, and/or OpenStack.
  • Experience with storage systems like Ceph and Rook.

Responsabilidades

  • Maintain and enhance Red Hat private cloud.
  • Conduct deep dive troubleshooting from Kubernetes error messages.
  • Automate processes to prevent recurring issues.
  • Collaborate closely with developers and product engineers.
  • Participate in a follow-the-sun scheme with 12-hour shifts.

Conhecimentos

Linux Expertise
Problem-Solving Mindset
Scripting and Automation
Networking Foundations
Containerization & Virtualization
Storage Systems
Database Expertise
Monitoring and Logging
Advanced written and spoken English

Ferramentas

Bash
Python
Ansible
Kubernetes
Podman
Helm
Ceph
MySQL
ELK
Prometheus
Descrição da oferta de emprego

Cloud Infrastructure SRE / Tier 4 Engineer As a Cloud Infrastructure SRE / Tier 4 Engineer, you will join a specialized engineering task force dedicated to preventing and resolving the most critical and strategic customer issues encountered in the field.

Our global team of experienced engineers has a deep understanding of the lower layers of private cloud infrastructure. Together, we work on practical solutions, continuously learning and adapting. We collaborate closely, share knowledge, and tackle challenges head‑on.

If you're an engineer eager to understand the core of cloud technologies and looking for a team that values hands‑on expertise, you'll fit right in with us.

What you will learn and contribute to

Maintain and enhance Red Hat private cloud: You’ll work extensively with Nokia Container Services (NCS) and CloudBand Infrastructure Software (CBIS), private cloud solutions based on Kubernetes andStack.

Deep Dive Troubleshooting: Starting from high‑level Kubernetes error messages, you’ll navigate through multiple layers until pinpointing issues—even down to kernel‑level bugs.

Automate and Develop: Spend 30–50% of your time developing automation to prevent recurring issues. Solve it once, automate for the future. Use your Python skills to create health checks and improve reliability.

Continuous Learning: In our rapidly evolving tech landscape, ongoing learning is a cornerstone.

Collaborate with Developers and Engineers: Work closely with developers and product engineers to bridge infrastructure and software, ensuring seamless product delivery.

Mode of Operation: This role requires participation in a follow‑the‑sun scheme with 12‑hour shifts (within regulatory limits) or participation in on‑call rotations to ensure business continuity during emergencies.

Your skills and experience

Linux Expertise: Strong familiarity with Linux distributions; we primarily use Red Hat and CentOS.

Networking Foundations: Solid understanding of networking fundamentals (VLANs, IP routing). Experience with Calico, Multus, and Open vSwitch is a plus.

Problem‑Solving Mindset: Excellent troubleshooting skills and analytical thinking to address complex challenges.

Scripting and Automation: Proficiency in Bash and Python, or willingness to learn, plus familiarity with Ansible.

Containerization & Virtualization: Knowledge of Podman, Kubernetes, Helm, and / or OpenStack; experience with KVM / QEMU is advantageous.

Storage Systems: Experience with Ceph and Rook is highly valued.

Database Expertise: Understanding of relational databases (MySQL, MariaDB) and experience with etcd.

Monitoring and Logging: Familiarity with tools like ELK (Elasticsearch, Logstash, Kibana), Prometheus, and Grafana.

Advanced written and spoken English.

It would be nice if you also had

Proactive thinking and ownership mindset.

Strong focus on quality and reliability.

Passion for delivering training or knowledge‑sharing sessions to operations teams.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.