Enable job alerts via email!
A fast-growing AI startup in London is seeking a Site Reliability Engineer to manage performance and reliability as user traffic climbs. This role requires hands-on experience in scaling systems, defining SLOs, and automating operations on Kubernetes. Ideal candidates will thrive in a collaborative startup environment and prioritize impactful work. The position offers a competitive salary, equity, and a hybrid working model.
Gizmo is an AI startup on a mission to make learning so easy that anyone can learn anything. We're building Duolingo for anything - a platform that uses gamification and social mechanics to make learning fun.
With over 1 million monthly active users and $4M in annual recurring revenue, we’re already one of the fastest-growing startups in the UK. Backed by leading investors, we recently raised $16M in Series A funding to accelerate our vision of helping 1 billion people learn.
Role Overview
Reporting to the founders, you will own capacity, performance and reliability for Gizmo’s full-stack platform as daily traffic climbs from hundreds of thousands to millions of users. You’ll write code across the stack, but your charter is classic SRE: defend SLOs, eliminate toil, and raise the ceiling on scale before it becomes a hard limit.
Key Responsibilities
Nice-to-haves: experience with Hasura internals, Cloudflare Workers edge optimisation, or running OpenSearch clusters at scale.