Enable job alerts via email!

Technical Staff, Data - London

Reflection

City Of London

On-site

GBP 70,000 - 100,000

Full time

Today
Be an early applicant

Job summary

A leading AI research firm in London is seeking a skilled researcher with a strong background in reinforcement learning and large-scale ML. You will conduct innovative research, lead data initiatives, and work alongside world-class professionals. Ideal candidates possess a publication record and are driven to stay ahead of AI advancements. This role offers competitive compensation and benefits.

Benefits

Opportunity for professional growth
Competitive salary
World-class research collaboration

Responsibilities

  • Conduct cutting-edge research in reinforcement learning and agentic capabilities.
  • Lead initiatives on data and RL environments.
  • Collaborate with a research team to publish impactful work.
  • Stay informed on advancements in agentic and LLM-based research.

Skills

Strong background in LLMs and/or reinforcement learning
End-to-end ML research capability
Experience with large-scale model training
Publication record in top ML conferences
Familiarity with RL environments
Job description
About Reflection AI

Reflection’s mission is to build open superintelligence and make it accessible to all.

We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Stanford and beyond.

We’re hiring in San Francisco, New York, and London.

Many of the problems we tackle require real-time alignment and fast resolution. We’ve found this is best achieved in-person. That said, we care most about working with the best people so we’d love to hear from you wherever you are.

What you will do
  • Conduct cutting-edge research in:

    • Reinforcement learning for reasoning and planning (long-horizon, hierarchical control)

    • Agentic capabilities and generalization

  • Lead Data and RL environment initiatives:

    • Curate dataset mixtures and design learning curricula (exploration, rewards, scaling laws)

    • Implement ML data pipelines for large-scale RL training

    • Scrape, collect and curate training and evaluation data

    • Build and maintain custom RL environments and evaluation benchmarks

  • Collaborate with a world-class research team to publish and open-source impactful work

  • Keep up with the latest advancements in agentic and LLM-based research, and bring relevant ideas into our systems

Qualifications
  • Strong background in LLMs and/or reinforcement learning

  • Demonstrated ability to carry out end-to-end ML research (problem formulation, experimentation, analysis)

  • Experience training large-scale models or working with distributed training infrastructure

  • A publication record in top ML conferences (NeurIPS, ICML, ICLR, etc.) is a strong plus

  • Familiarity with RL environments is a plus

What We Offer:
  • The opportunity to work at the forefront of AI research and data collection for training cutting-edge models.

  • Collaboration with a team of world-class researchers and engineers from top AI labs and companies.

  • Competitive compensation and benefits, with opportunities for professional growth.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.