
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A pioneering AI company in London seeks a Senior AI Engineer – Reinforcement Learning Lead to innovate in software quality. The role involves shipping ML to production, focusing on reinforcement learning, and developing autonomous agents. You'll work closely with experienced leaders, pushing boundaries in AI technology. Join a mission to redefine software testing while enjoying equity and substantial resources to aid your development in a fast-paced startup environment.
Change Software Forever
QA slows the world down. Flaky tests kill trust, stall releases, and bleed engineering velocity.
Duku AI is ending that era.
We’re building autonomous agents that think like engineers: they run every critical user journey, catch failures before users do, and self-heal as the codebase evolves. Real AI teammates, not test scripts that break on impact.
We’re venture-backed and led by operators who’ve scaled Meta’s testing infrastructure, launched Uber’s global playbooks, and grew Deliveroo from zero to hypergrowth. We know what elite execution looks like and we’re hunting for one more builder to help us rewrite the rules of software quality.
Why This Role is Different
Most “AI engineer” jobs are just applying models someone else built. This isn’t that. This is about pushing RL to its edge:
If you’ve ever wanted to take RL out of papers and into the wild, this is it.
What You’ll Achieve
In your first three months, you’ll see your reinforcement learning prototypes running live inside real applications, surfacing bugs no human ever noticed.
By six months, those agents will have evolved , scaling across multiple environments, learning and adapting in ways that prove this isn’t theory but reality.
And within a year, the intelligence you’ve built will sit at the heart of every release for our first customers, powering their ability to ship AI-generated code with confidence.
What You Bring ( Non‑Negotiables)
The Stuff That Matters
Why Join Now
The Challenge
Big tech tried to brute‑force this problem and hit a wall. Most startups never got past brittle scripts. The reason is simple: building true autonomy takes more than patching frameworks , it takes intelligence. That’s the path we’re on. Your system will need to:
It won’t be easy. That’s the point.
What You Get
To win the space, we’re looking for the best people in London, with 10/10 ambition and work ethic to join us and build a product people love.