Enable job alerts via email!

Site Reliability Engineer

Apple

Singapore

On-site

SGD 80,000 - 120,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology company in Singapore is seeking a Site Reliability Engineer (SRE) who excels at reliability, software development, and automation. This role involves operating and monitoring production environments, implementing advanced automation tools, and working collaboratively with various teams to enhance operational efficiency. Ideal candidates will have a strong background in modern SRE practices and programming skills in Python, Go, or Swift. Opportunities for personal development and innovation are plentiful in this role.

Qualifications

  • Proven experience in Site Reliability Engineering or a related field.
  • Experience with CI/CD problem diagnosis and troubleshooting.
  • Experience working with cloud compute environments like AWS, GCP or Azure.

Responsibilities

  • Operate, monitor, and triage all aspects of our production and non-production environments.
  • Pioneer and implement the next generation telemetry system for AIS services.
  • Automate deployment and orchestration of services into the cloud environment.

Skills

Python
Go
Swift
Site Reliability Engineering
Cloud environments (AWS, GCP, Azure)
Infrastructure as code (IaC)
Automation
Shell scripting

Education

Bachelor’s degree in Computer Science or related field

Tools

Terraform
Ansible
Docker
Kubernetes
Splunk
Grafana
Prometheus
Job description
Summary

Imagine what you could accomplish here. Bring your passion, creativity, and dedication, and there will be no limit to what you can achieve. This is not just another SRE role — it’s a chance to help redefine how reliability engineering is practiced at hyper-scale. Our team is building the platforms that will autonomously operate Apple’s core information security systems, setting a new bar for how critical services are managed.

Description

We are seeking exceptional engineers who thrive at the intersection of reliability, software development and automation — individuals driven to push the boundaries of what’s possible. The ideal candidate has a strong foundation in modern SRE practices and a proven ability to design and implement software that solves operational challenges. You’ll break new ground using the most advanced tools and approaches available, developing automation that doesn’t just keep pace with scale but anticipates, reacts and stays ahead of it. You will work closely with Security Engineering, Threat Detection, Incident Response and other internal functions to ensure the scalability, availability and security of the tools and infrastructure that support Apple’s cybersecurity mission. Join us, and help build the future of self-managing systems at one of the most innovative companies in the world.

Responsibilities
  • Our team is highly collaborative, working closely with partner teams to deliver the best results for Apple. We strive to find the best solution while also considering the need to get things done efficiently for each engineering challenge we face. Good ideas are valued and rewarded.
  • As an SRE in Apple Information Security, you will:
  • Operate, monitor, and triage all aspects of our production and non-production environments
  • Pioneer and implement the next generation telemetry system for AIS services
  • Establish alert handling procedures, run-books, and collaborate with our global security team
  • Automate deployment and orchestration of services into the cloud environment as well as other routine processes
  • Actively participate in capacity planning and disaster recovery exercises
  • Interact with and support partner teams across the enterprise
  • Cultivate and maintain relationships with internal and external third party vendors
Minimum Qualifications
  • Bachelor’s degree in Computer Science, or a related field, or equivalent practical experience
  • Proven experience in Site Reliability Engineering or a related field
  • Strong programming skills: Python, Go or Swift
  • Experience working with cloud compute environments like AWS, GCP or Azure
  • Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation, e.g., Terraform, Pulumi, CloudFormation, Ansible, Chef, Puppet, Jenkins
  • Cloud deployment and CI/CD problem diagnosis and troubleshooting
Preferred Qualifications
  • Experience or experimentation building systems that leverage Agentic AI principles, tools, platforms and frameworks
  • Strong understanding and experience in implementing monitoring and observability tools like Splunk, Grafana, Prometheus
  • Building and operating container orchestrating systems (Docker, Kubernetes, Vagrant and micro-services)
  • Experience administering and troubleshooting Linux systems including the usage of standard Linux utilities
  • Experience in shell scripting (e.g., bash/zsh) and system administration
  • Experience with measuring, analyzing, and optimizing system performance
  • Passion for high-quality code, tests, documentation and production services
  • Participation in an on-call rotation

Apple is an equal opportunity employer that is committed to inclusion and diversity, and thus we treat all applicants fairly and equally. Apple is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.