Enable job alerts via email!
A leading AI research firm in London is seeking a skilled researcher with a strong background in reinforcement learning and large-scale ML. You will conduct innovative research, lead data initiatives, and work alongside world-class professionals. Ideal candidates possess a publication record and are driven to stay ahead of AI advancements. This role offers competitive compensation and benefits.
Reflection’s mission is to build open superintelligence and make it accessible to all.
We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Stanford and beyond.
We’re hiring in San Francisco, New York, and London.
Many of the problems we tackle require real-time alignment and fast resolution. We’ve found this is best achieved in-person. That said, we care most about working with the best people so we’d love to hear from you wherever you are.
Conduct cutting-edge research in:
Reinforcement learning for reasoning and planning (long-horizon, hierarchical control)
Agentic capabilities and generalization
Lead Data and RL environment initiatives:
Curate dataset mixtures and design learning curricula (exploration, rewards, scaling laws)
Implement ML data pipelines for large-scale RL training
Scrape, collect and curate training and evaluation data
Build and maintain custom RL environments and evaluation benchmarks
Collaborate with a world-class research team to publish and open-source impactful work
Keep up with the latest advancements in agentic and LLM-based research, and bring relevant ideas into our systems
Strong background in LLMs and/or reinforcement learning
Demonstrated ability to carry out end-to-end ML research (problem formulation, experimentation, analysis)
Experience training large-scale models or working with distributed training infrastructure
A publication record in top ML conferences (NeurIPS, ICML, ICLR, etc.) is a strong plus
Familiarity with RL environments is a plus
The opportunity to work at the forefront of AI research and data collection for training cutting-edge models.
Collaboration with a team of world-class researchers and engineers from top AI labs and companies.
Competitive compensation and benefits, with opportunities for professional growth.