Enable job alerts via email!

Site Reliability Engineer (Expert) 0630

Executiveplacements.Com - The Job Portal

Gauteng

On-site

ZAR 800 000 - 1 000 000

Full time

Yesterday

Be an early applicant

Job summary

A leading technology firm seeks an Expert Site Reliability Engineer to join their engineering team. You'll drive operational excellence, design scalable infrastructure, mentor peers, and improve reliability across platforms. Requires 10+ years of experience in SRE or similar roles, strong AWS knowledge, and skills in Python, Go, and Kubernetes. Apply now with your detailed CV.

Qualifications

10+ years in SRE, DevOps, or similar roles.
Proficient in AWS and cloud-native technologies.
Experience with Docker, Kubernetes, CI/CD, and GitOps.

Responsibilities

Design and implement scalable infrastructure solutions.
Architect and maintain monitoring & alerting systems.
Develop automated workflows to reduce manual effort.
Lead major incident response and drive continuous improvement.
Mentor team members and influence engineering decisions.

Skills

SRE methodology

Networking fundamentals

AWS proficiency

Experience with Python

Experience with Go

Experience with JavaScript/TypeScript

Docker

Kubernetes

CI/CD pipelines

GitOps

Tools

Grafana

Prometheus

Terraform

PostgreSQL

MongoDB

Site Reliability Engineer (Expert)

Employer: Open Source (Pty) Ltd
Location: Midrand, South Africa

We're looking for an Expert Site Reliability Engineer (SRE) to join our multi‑member engineering team, driving operational excellence across our platform.

Why Join Us?

Work on high‑availability, multi‑region deployments. Shape our observability strategy and implement automation at scale. Collaborate with development teams to enhance service reliability. Lead incident response and drive systematic improvements.

Key Responsibilities

Design and implement scalable infrastructure solutions to ensure system reliability.
Architect and maintain monitoring & alerting systems for observability.
Develop automated workflows to reduce manual effort.
Lead major incident response and drive continuous improvement.
Mentor team members and influence engineering decisions as a technical leader.
Build internal tools to increase operational efficiency.
Establish and enforce SRE methodologies and best practices.

Qualifications & Experience

10+ years in SRE, DevOps, or similar roles.
Strong networking fundamentals.
Proficient in AWS and cloud‑native technologies.
Experience with Python, Go, or JavaScript/TypeScript.
Experience with Docker, Kubernetes, CI/CD, and GitOps (Flux/ArgoCD).
Knowledge of monitoring tools (Grafana, Prometheus, Loki, Tempo).
Bonus: Advanced Kubernetes certification (CKA/CKAD), Terraform, PostgreSQL, MongoDB experience.
Expertise in performance optimization, cost management, security hardening, and compliance.

Tech Stack

Containerization: Kubernetes, Docker
Observability: Grafana + Prometheus stack
Infrastructure: Cloud‑native technologies
Programming: Go, Python, TypeScript/JavaScript
CI/CD: Modern pipeline tools
Multi‑region deployments & microservices architecture

Apply now with your latest and detailed CV!

Referrals increase your chances of interviewing at The JOB Portal by 2x.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.