Enable job alerts via email!

Site Reliability Engineer Manager

APPLE SOUTH ASIA PTE. LTD.

Singapore

On-site

SGD 90,000 - 130,000

Full time

Today
Be an early applicant

Job summary

A leading technology company in Singapore is seeking an experienced Site Reliability Engineer (SRE) to lead a high-performing team dedicated to automating and scaling core security platforms. The ideal candidate will have over 5 years of experience in SRE, strong cloud and automation skills, and excellent communication abilities. Join us to redefine reliability engineering practices at a global scale.

Qualifications

  • 5+ years of experience in SRE or Service Infrastructure roles with leadership experience.
  • Strong understanding of modern SRE practices including observability, automation, and reliability.
  • Excellent communication skills for collaboration across teams.

Responsibilities

  • Inspire and mentor a high-performing team of SREs.
  • Build resilient monitoring and alerting to minimize downtime.
  • Partner with InfoSec stakeholders to translate security needs into solutions.

Skills

SRE practices
Automation
Cloud platforms (AWS, GCP)
Communication skills
Container technologies (Docker, Kubernetes)

Education

Bachelor's degree in Computer Science or related field

Tools

Terraform
Ansible
Job description
Summary

Imagine what you could accomplish here. Bring your passion, creativity, and dedication, and there will be no limit to what you can achieve. This is not just another SRE role—it’s a chance to help redefine how reliability engineering is practiced at hyper-scale. Our team is building the platforms that will autonomously operate Apple’s core information security systems, setting a new bar for how critical services are managed.

Description

We are seeking exceptional engineers who thrive at the intersection of reliability, software development and automation — individuals driven to push the boundaries of what’s possible. The ideal candidate has a strong foundation in modern SRE practices and a proven ability to design and implement software that solves operational challenges. You’ll break new ground using the most advanced tools and approaches available, developing automation that doesn’t just keep pace with scale but anticipates, reacts and stays ahead of it. You will work closely with Security Engineering, Threat Detection, Incident Response and other internal functions to ensure the scalability, availability and security of the tools and infrastructure that support our cybersecurity mission. Join us, and help build the future of self-managing systems at one of the most innovative companies in the world.

Responsibilities
  • Inspire, mentor, and grow a high-performing team of SREs dedicated to automating and scaling Apple’s core security platforms.
  • Champion operational excellence by building resilient monitoring, alerting, and automated remediation practices that minimize downtime and manual effort.
  • Advance infrastructure-as-code and automation to eliminate toil, improve consistency, and accelerate delivery of secure, reliable services.
  • Partner closely with InfoSec stakeholders to translate security requirements into scalable, supportable, and performant solutions.
  • Own the reliability of critical security systems—including SIEM, SOAR, telemetry, and vulnerability management—ensuring availability, performance, and capacity keep pace with business demand.
  • Lead incident response with confidence, driving resolution of outages and infrastructure issues while fostering a blameless, learning-oriented culture.
  • Define and enforce SLOs/SLIs for InfoSec services, using data to measure success and continuously improve.
  • Collaborate across engineering and IT to embed best practices in CI/CD, containerization, and service orchestration.
  • Uphold strong security hygiene and compliance, aligning with both internal standards and external regulatory requirements.
  • Set direction and priorities for the team, managing resources, timelines, and initiatives to maximize impact.
Minimum Qualifications
  • 5+ years of experience in SRE or Service Infrastructure roles, including 2+ years in a leadership or managerial role
  • Strong understanding of modern SRE practices, including observability, automation, and reliability engineering
  • Experience with cloud platforms (AWS, GCP) and infrastructure-as-code tools (Pulumi, Terraform, Ansible, etc.)
  • Familiarity with container technologies (Docker, Kubernetes) and CI/CD pipelines
  • Excellent communication skills with an ability to collaborate across technical and non-technical teams
Preferred Qualifications
  • Bachelorʼs degree in Computer Science, or a related field, or equivalent practical experience
  • Prior experience working in or closely with Information Security teams
  • The ability to contribute and review code in Python, Go, Swift or other scripting languages
  • Experience operating with Scrum/Agile development methodologies
  • Ability to cultivate an environment that emphasizes collaboration, accountability, and excellence
  • Experience managing systems that support InfoSec functions (e.g., security monitoring, log aggregation, scanning tools)
  • Ability to work under pressure and manage difficult situations in a dynamic work environment
  • Passion for high-quality code, unit-tests, documentation, and production services
  • Previous experience working on a global team with 24/7 support model

Apple is an equal opportunity employer that is committed to inclusion and diversity, and thus we treat all applicants fairly and equally. Apple is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.