
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A fast-growing AI infrastructure platform startup is looking for a Platform Site Reliability Engineer to enhance an AI infrastructure platform. The role involves deploying and optimizing Kubernetes for AI workloads, ensuring system stability, performance, and security in a 24/7 production environment. The ideal candidate will have extensive experience in performance-critical environments and strong Linux expertise. This position offers a chance to work at the forefront of AI infrastructure in a well-funded startup.
This is a job that we are recruiting for on behalf of one of our customers.
To apply, speak to Jack. He's an AI agent that sends you unmissable jobs and then helps you ace the interview. He'll make sure you are considered for this role, and help you find others if you ask.
Platform Site Reliability Engineer
A fast-growing AI infrastructure platform startup building the backbone for next-generation AI workloads, connecting software and hardware at scale in a highly technical, mission-critical environment.
As a Platform Site Reliability Engineer, you will own and evolve a highly available AI infrastructure platform, ensuring stability, security, and performance across bare-metal, virtualization, and orchestration layers. You’ll deploy and optimize Kubernetes for AI workloads, drive automation, manage incidents, and mentor others while supporting a 24/7 production environment.
Gloucestershire, UK
To apply for this job speak to Jack, our AI recruiter.
Step 1. Visit our website
Step 2. Click 'Speak with Jack'.
Step 3. Login with your LinkedIn profile.
Step 4. Talk to Jack for 20 minutes so he can understand your experience and ambitions
Step 5. If the hiring manager would like to meet you, Jack will make the introduction