Enable job alerts via email!

Technical Staff, Data - London

Reflection

City Of London

On-site

GBP 70,000 - 100,000

Full time

Today

Be an early applicant

Job summary

A leading AI research firm in London is seeking a skilled researcher with a strong background in reinforcement learning and large-scale ML. You will conduct innovative research, lead data initiatives, and work alongside world-class professionals. Ideal candidates possess a publication record and are driven to stay ahead of AI advancements. This role offers competitive compensation and benefits.

Benefits

Opportunity for professional growth

Competitive salary

World-class research collaboration

Responsibilities

Conduct cutting-edge research in reinforcement learning and agentic capabilities.
Lead initiatives on data and RL environments.
Collaborate with a research team to publish impactful work.
Stay informed on advancements in agentic and LLM-based research.

Skills

Strong background in LLMs and/or reinforcement learning

End-to-end ML research capability

Experience with large-scale model training

Publication record in top ML conferences

Familiarity with RL environments

About Reflection AI

Reflection’s mission is to build open superintelligence and make it accessible to all.

We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Stanford and beyond.

We’re hiring in San Francisco, New York, and London.

Many of the problems we tackle require real-time alignment and fast resolution. We’ve found this is best achieved in-person. That said, we care most about working with the best people so we’d love to hear from you wherever you are.

What you will do

Conduct cutting-edge research in:
- Reinforcement learning for reasoning and planning (long-horizon, hierarchical control)
- Agentic capabilities and generalization
Lead Data and RL environment initiatives:
- Curate dataset mixtures and design learning curricula (exploration, rewards, scaling laws)
- Implement ML data pipelines for large-scale RL training
- Scrape, collect and curate training and evaluation data
- Build and maintain custom RL environments and evaluation benchmarks
Collaborate with a world-class research team to publish and open-source impactful work
Keep up with the latest advancements in agentic and LLM-based research, and bring relevant ideas into our systems

Qualifications

Strong background in LLMs and/or reinforcement learning
Demonstrated ability to carry out end-to-end ML research (problem formulation, experimentation, analysis)
Experience training large-scale models or working with distributed training infrastructure
A publication record in top ML conferences (NeurIPS, ICML, ICLR, etc.) is a strong plus
Familiarity with RL environments is a plus

What We Offer:

The opportunity to work at the forefront of AI research and data collection for training cutting-edge models.
Collaboration with a team of world-class researchers and engineers from top AI labs and companies.
Competitive compensation and benefits, with opportunities for professional growth.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Technical Staff, Data - London

Reflection

City Of London

On-site

GBP 70,000 - 100,000

Full time

Job summary

Benefits

Responsibilities

Skills

Company

Services

Free resources

Support

Technical Staff, Data - London

Reflection

City Of London

On-site

GBP 70,000 - 100,000

Full time

Job summary

Benefits

Responsibilities

Skills

Follow us

Company

Services

Free resources

Support