Job Search and Career Advice Platform

Enable job alerts via email!

Site Reliability Engineering Officer

Takamol Holding

Riyadh

On-site

SAR 200,000 - 300,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A prominent technology firm is seeking a skilled Site Reliability Engineering Officer to join their Infrastructure team in Riyadh. The role emphasizes cloud and containerized platform management, ensuring reliability and operational excellence. Key responsibilities include maintaining cloud infrastructure, deploying containerized workloads, and implementing GitOps practices. Candidates should have at least 2 years of relevant experience, proficiency in tools like Kubernetes and Terraform, and strong automation skills. This is an excellent opportunity for motivated professionals to impact service reliability in a dynamic environment.

Qualifications

  • 2+ years of experience in DevOps, Platform Engineering, or Site Reliability Engineering roles.
  • Hands-on experience with OCI, GCP, and private cloud environments.
  • Proficiency in scripting languages such as Bash and Python.

Responsibilities

  • Operate, monitor, and maintain cloud-native infrastructure across OCI, GCP, and private cloud environments.
  • Deploy, manage, and support containerized workloads using Kubernetes and Docker.
  • Implement and manage GitOps practices using GitLab CI/CD for automated deployments.

Skills

DevOps
Site Reliability Engineering
Kubernetes
Docker
GitLab CI/CD
Terraform
Ansible
Prometheus
Observability
Scripting (Bash, Python)

Tools

Git
SonarQube
Job description

We are seeking a skilled and motivated Site Reliability Engineering Officer (SRE) to join our Infrastructure team. This role focuses on operating, automating, and ensuring the reliability, scalability, and security of cloud and containerized platforms.

You will play a key role in maintaining high service availability and driving operational excellence across our cloud environments.

Key Responsibilities
  • Operate, monitor, and maintain cloud-native infrastructure across OCI, GCP, and private cloud environments, ensuring high availability and scalability.
  • Deploy, manage, and support containerized workloads using Kubernetes and Docker.
  • Implement and manage GitOps practices using GitLab CI/CD for automated and auditable deployments.
  • Build, manage, and maintain Infrastructure as Code (IaC) using Terraform, ensuring compliance with best practices.
  • Automate operational tasks and configuration management using Ansible.
  • Implement and maintain monitoring, logging, and observability solutions using Prometheus, ELK, and alerting frameworks.
  • Develop and maintain operational runbooks, automation scripts, and technical documentation.
  • Participate in incident response, root cause analysis, and post-incident reviews.
  • Collaborate closely with development, security, and platform teams to improve service reliability and efficiency.
  • Apply SRE principles including SLIs, SLOs, and error budgets to continuously improve system reliability.
  • Enforce cloud, Kubernetes, and security best practices in line with governance and compliance requirements.
Required Skills & Qualifications
  • 2+ years of experience in DevOps, Platform Engineering, or Site Reliability Engineering roles.
  • Hands-on experience with OCI, GCP, and private cloud environments.
  • Strong experience with Kubernetes and Docker.
  • Proficiency in GitLab CI/CD, Git, and GitOps workflows.
  • Solid experience using Terraform for infrastructure provisioning.
  • Strong automation and configuration management skills using Ansible.
  • Experience with monitoring and observability tools such as Prometheus and ELK.
  • Proficiency in scripting languages such as Bash and Python.
  • Good understanding of cloud and Kubernetes security best practices.
Preferred Qualifications
  • Experience with hybrid-cloud or multi-cloud architectures.
  • Practical knowledge of SRE practices (SLIs, SLOs, error budgets, incident management).
  • Experience with code quality and static analysis tools such as SonarQube.
  • Relevant certifications such as CKA, CKAD, Terraform Associate, OCI or GCP Cloud certifications.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.