Enable job alerts via email!

AIML - Site Reliability Engineer (SRE), Siri Knowledge Platforms

Apple

London

On-site

GBP 70,000 - 90,000

Full time

26 days ago

Job summary

A leading technology company in London is seeking an SRE for their AI/ML organization. You will manage the infrastructure for Siri and other user-facing solutions, ensuring stability and security. A strong background in Kubernetes, automation with Go or Python, and configuration management is required. The role includes mentoring team members and participating in on-call rotations.

Qualifications

  • Strong sense of ownership and integrity.
  • Sophisticated knowledge of Kubernetes and containerisation.
  • Proficiency in Go, Python or similar for automation.
  • Experience with configuration management tools.

Responsibilities

  • Responsible for infrastructure powering Siri and other solutions.
  • Improve stability, security, efficiency, and scalability.
  • Build and maintain documentation reflecting configuration.
  • Mentor new team members in an on-call rotation.

Skills

Ownership and integrity
Kubernetes
Containerisation systems
Public cloud infrastructure
Go
Python
Configuration management
Troubleshooting

Tools

Puppet
Chef
Ansible
Spinnaker

Job description

Play a meaningful role in revolutionising how people use their computers and mobile devices, build ground breaking technology for algorithmic search, machine learning, natural language processing & artificial intelligence and work with the teams building the most scalable big-data systems in existence.

Description As an SRE in the AI/ML organisation within Apple, you will be directly responsible for the infrastructure that powers Siri, search, and other high-impact user-facing solutions running on millions of Apple devices worldwide. We strive to improve the stability, security, efficiency, and scalability of a 24/7 global service. We have on-call rotations-working in a geographically distributed SRE teams for follow-the-sun support. Your strong troubleshooting ability will be used daily to isolate issues and resolve the root cause through investigative analysis. The role also requires building and maintaining accurate, up-to-date documentation reflecting configuration, providing code reviews, and mentoring new team members. An ideal candidate is an independent problem-solver who is focused and capable of exhibiting deftness to handle multiple simultaneous contending priorities and deliver solutions in a timely manner.

Minimum Qualifications

  • A strong sense of ownership and integrity demonstrated through clear communication and collaboration.
  • Sophisticated knowledge of one or more of the following: Kubernetes, containerisation systems, and/or public cloud infrastructure (AWS, GCP).
  • Proficiency in Go, Python, or similar language to automate tasks.
  • Hands-on experience handling large numbers of diverse systems with configuration management or software delivery platforms (such as Puppet, Chef, Ansible, and Spinnaker).
Preferred Qualifications
  • Working knowledge of multi-tier applications and their dependencies including load balancing, TCP/IP networking, web services, LDAP and DNS.
  • Proficiency with web server administration including Apache and Nginx.
  • Knowledge of database design, support and administration including Postgres, MySQL, and HBase.
  • Network administration and troubleshooting.
  • Good interpersonal skills shown through previous projects or assignments.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs