Enable job alerts via email!

Site Reliability Engineer (Expert) 0630

Executiveplacements.Com - The Job Portal

Gauteng

On-site

ZAR 800 000 - 1 000 000

Full time

Yesterday
Be an early applicant

Job summary

A leading technology firm seeks an Expert Site Reliability Engineer to join their engineering team. You'll drive operational excellence, design scalable infrastructure, mentor peers, and improve reliability across platforms. Requires 10+ years of experience in SRE or similar roles, strong AWS knowledge, and skills in Python, Go, and Kubernetes. Apply now with your detailed CV.

Qualifications

  • 10+ years in SRE, DevOps, or similar roles.
  • Proficient in AWS and cloud-native technologies.
  • Experience with Docker, Kubernetes, CI/CD, and GitOps.

Responsibilities

  • Design and implement scalable infrastructure solutions.
  • Architect and maintain monitoring & alerting systems.
  • Develop automated workflows to reduce manual effort.
  • Lead major incident response and drive continuous improvement.
  • Mentor team members and influence engineering decisions.

Skills

SRE methodology
Networking fundamentals
AWS proficiency
Experience with Python
Experience with Go
Experience with JavaScript/TypeScript
Docker
Kubernetes
CI/CD pipelines
GitOps

Tools

Grafana
Prometheus
Terraform
PostgreSQL
MongoDB
Job description
Site Reliability Engineer (Expert)

Employer: Open Source (Pty) Ltd
Location: Midrand, South Africa

We're looking for an Expert Site Reliability Engineer (SRE) to join our multi‑member engineering team, driving operational excellence across our platform.

Why Join Us?

Work on high‑availability, multi‑region deployments. Shape our observability strategy and implement automation at scale. Collaborate with development teams to enhance service reliability. Lead incident response and drive systematic improvements.

Key Responsibilities
  • Design and implement scalable infrastructure solutions to ensure system reliability.
  • Architect and maintain monitoring & alerting systems for observability.
  • Develop automated workflows to reduce manual effort.
  • Lead major incident response and drive continuous improvement.
  • Mentor team members and influence engineering decisions as a technical leader.
  • Build internal tools to increase operational efficiency.
  • Establish and enforce SRE methodologies and best practices.
Qualifications & Experience
  • 10+ years in SRE, DevOps, or similar roles.
  • Strong networking fundamentals.
  • Proficient in AWS and cloud‑native technologies.
  • Experience with Python, Go, or JavaScript/TypeScript.
  • Experience with Docker, Kubernetes, CI/CD, and GitOps (Flux/ArgoCD).
  • Knowledge of monitoring tools (Grafana, Prometheus, Loki, Tempo).
  • Bonus: Advanced Kubernetes certification (CKA/CKAD), Terraform, PostgreSQL, MongoDB experience.
  • Expertise in performance optimization, cost management, security hardening, and compliance.
Tech Stack
  • Containerization: Kubernetes, Docker
  • Observability: Grafana + Prometheus stack
  • Infrastructure: Cloud‑native technologies
  • Programming: Go, Python, TypeScript/JavaScript
  • CI/CD: Modern pipeline tools
  • Multi‑region deployments & microservices architecture

Apply now with your latest and detailed CV!

Referrals increase your chances of interviewing at The JOB Portal by 2x.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.