Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Senior Cloud SRE

HCLTech

Canoas

Presencial

BRL 80.000 - 120.000

Tempo integral

Hoje
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A technology solutions company is seeking a Cloud Infrastructure SRE / Tier 4 Engineer in Brazil to maintain and enhance Red Hat private cloud solutions. The role involves troubleshooting, automation development, and collaborating with developers for effective product delivery. Candidates should have strong Linux expertise, networking knowledge, and proficiency in Bash and Python. This position requires participation in a follow-the-sun scheme with 12-hour shifts or on-call rotations to ensure business continuity during emergencies.

Qualificações

  • Strong familiarity with Linux distributions, primarily Red Hat and CentOS.
  • Solid understanding of networking fundamentals including VLANs and IP routing.
  • Excellent troubleshooting skills with a problem-solving mindset.
  • Proficiency in Bash and Python, with familiarity in Ansible.
  • Knowledge of containerization with Kubernetes, Helm, and OpenStack.
  • Experience with Ceph and Rook valued for storage systems.
  • Understanding of relational databases like MySQL and MariaDB.

Responsabilidades

  • Maintain and enhance Red Hat private cloud solutions.
  • Perform deep dive troubleshooting from Kubernetes error messages.
  • Develop automation scripts to prevent recurring issues.
  • Collaborate with developers and product engineers for seamless delivery.

Conhecimentos

Linux Expertise
Networking Foundations
Problem-Solving Mindset
Scripting and Automation
Containerization & Virtualization
Storage Systems
Database Expertise
Monitoring and Logging
Advanced written and spoken English

Ferramentas

Kubernetes
Ansible
Bash
Python
ELK
Prometheus
Grafana
Ceph
Descrição da oferta de emprego

Cloud Infrastructure SRE / Tier 4 Engineer As a Cloud Infrastructure SRE / Tier 4 Engineer, you will join a specialized engineering task force dedicated to preventing and resolving the most critical and strategic customer issues encountered in the field.

Our global team of experienced engineers has a deep understanding of the lower layers of private cloud infrastructure. Together, we work on practical solutions, continuously learning and adapting. We collaborate closely, share knowledge, and tackle challenges head‑on.

If you're an engineer eager to understand the core of cloud technologies and looking for a team that values hands‑on expertise, you'll fit right in with us.

What you will learn and contribute to

Maintain and enhance Red Hat private cloud: You’ll work extensively with Nokia Container Services (NCS) and CloudBand Infrastructure Software (CBIS), private cloud solutions based on Kubernetes andStack.

Deep Dive Troubleshooting: Starting from high‑level Kubernetes error messages, you’ll navigate through multiple layers until pinpointing issues—even down to kernel‑level bugs.

Automate and Develop: Spend 30–50% of your time developing automation to prevent recurring issues. Solve it once, automate for the future. Use your Python skills to create health checks and improve reliability.

Continuous Learning: In our rapidly evolving tech landscape, ongoing learning is a cornerstone.

Collaborate with Developers and Engineers: Work closely with developers and product engineers to bridge infrastructure and software, ensuring seamless product delivery.

Mode of Operation: This role requires participation in a follow‑the‑sun scheme with 12‑hour shifts (within regulatory limits) or participation in on‑call rotations to ensure business continuity during emergencies.

Your skills and experience

Linux Expertise: Strong familiarity with Linux distributions; we primarily use Red Hat and CentOS.

Networking Foundations: Solid understanding of networking fundamentals (VLANs, IP routing). Experience with Calico, Multus, and Open vSwitch is a plus.

Problem‑Solving Mindset: Excellent troubleshooting skills and analytical thinking to address complex challenges.

Scripting and Automation: Proficiency in Bash and Python, or willingness to learn, plus familiarity with Ansible.

Containerization & Virtualization: Knowledge of Podman, Kubernetes, Helm, and / or OpenStack; experience with KVM / QEMU is advantageous.

Storage Systems: Experience with Ceph and Rook is highly valued.

Database Expertise: Understanding of relational databases (MySQL, MariaDB) and experience with etcd.

Monitoring and Logging: Familiarity with tools like ELK (Elasticsearch, Logstash, Kibana), Prometheus, and Grafana.

Advanced written and spoken English.

It would be nice if you also had

Proactive thinking and ownership mindset.

Strong focus on quality and reliability.

Passion for delivering training or knowledge‑sharing sessions to operations teams.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.