Enable job alerts via email!

Evals Research Scientist / Engineer

COL Limited

London

On-site

GBP 40,000 - 70,000

Full time

30+ days ago

Job summary

COL Limited is seeking research scientists, research engineers, and software engineers to join their team in London. The roles involve evaluating large language models and improving AI capabilities through empirical research and software development. The ideal candidates will possess skills in LLM steering, prompting, and software engineering, while being part of an inclusive and diverse workplace.

Benefits

Flexible work hours and schedule

Unlimited vacation

Unlimited sick leave

Lunch, dinner, and snacks provided

Paid work trips and conferences

Yearly professional development budget (£1,000)

Qualifications

Candidates with self-taught skills are encouraged to apply.
Experience with LLMs is essential.
Strong software engineering and empirical research background preferred.

Responsibilities

Focus on evaluating large language models (LLMs).
Conduct empirical research and model evaluations.
Develop software tools for LLM evaluation.

Skills

Large Language Model (LLM) steering

Prompting

Software engineering

Empirical Research

Generalist skills

Applications deadline: We are reviewing applications on a rolling basis. It might take a few weeks until you hear from us.

ABOUT APOLLO RESEARCH

The capabilities of current AI systems are evolving at a rapid pace. While these advancements offer tremendous opportunities, they also present significant risks, such as the potential for deliberate misuse or the deployment of sophisticated yet misaligned models. At Apollo Research, our primary concern lies with deceptive alignment, a phenomenon where a model appears to be aligned but is, in fact, misaligned and capable of evading human oversight.

Our approach focuses on behavioral model evaluations, which we then use to audit real-world models. We also combine black-box approaches with applied interpretability. In our evaluations, we focus on LM agents, i.e. LLMs with agentic scaffolding similar to AIDE or SWE agent. We also study model organisms in controlled environments (see our security policies), e.g. to better understand capabilities related to scheming.

At Apollo, we aim for a culture that emphasizes truth-seeking, being goal-oriented, giving and receiving constructive feedback, and being friendly and helpful. If you’re interested in more details about what it’s like working at Apollo, you can find more information here.

ABOUT THE TEAM

The current evals team consists of Mikita Balesni, Jérémy Scheurer, Alex Meinke, Rusheb Shah, Bronson Schoen, and Axel Højmark. Marius Hobbhahn manages and advises the evals team, though team members lead individual projects. You will mostly work with the evals team, but you will likely sometimes interact with the interpretability team, e.g. for white-box evaluations, and with the governance team to translate technical knowledge into concrete recommendations. You can find our full team here.

ABOUT THE ROLE

We’re looking for research scientists, research engineers, and software engineers who are excited to work on these and similar projects. We intend to hire people with a broad range of experience and encourage applications even if you don’t yet have experience in any of our current team efforts. We welcome applicants of all ethnicities, genders, sexes, ages, abilities, religions, sexual orientations, regardless of pregnancy or maternity, marital status, or gender reassignment.

EVALS TEAM WORK. The evals team focuses on the following efforts:

Conceptual work on safety cases for scheming, for example, our work on evaluation-based safety cases for scheming
Building evaluations for scheming-related properties, such as situational awareness or deceptive reasoning.
Conducting evaluations on frontier models and publishing the results either to the general public or a target audience such as AI developers or governments, for example, our work in OpenAI’s o1-preview system card.
Creating model organisms and demonstrations of behavior related to deceptive alignment, e.g. exploring the influence of goal-directedness on scheming.
Applied interpretability work that directly informs our evaluations, e.g. Detecting Strategic Deception Using Linear Probes.
Designing and evaluating AI control protocols. We have not started these efforts yet but intend to work on them starting Q2 2025.
Building a high-quality software stack to support all of these efforts. We have recently switched to Inspect as our primary evals framework.

CANDIDATE CHARACTERISTICS in strong candidates

For all skills, we don’t require a formal background or industry experience and welcome self-taught candidates.

Large Language Model (LLM) steering: The core skill of our evals research scientist role is steering LLMs. This can take many different forms, such as:
Prompting: eliciting specific behavior through clever word choice.
LM agents & scaffolding: chaining inputs and outputs from various models in a structured way, making them more goal-directed and agentic.
Fluent LLM usage: With increasing capabilities, we can use LLMs to speed up all parts of our pipeline. We welcome candidates who have integrated LLMs into their workflow.
Supervised fine-tuning: creating datasets and then fine-tuning models to improve a specific capability or to study aspects of learning/generalization.
RL(HF/AIF): using other models, programmatic reward functions, or custom reward models as a source of feedback for fine-tuning an existing LLM.

Software engineering: Model evaluators benefit from a solid foundation in software engineering. This can include developing APIs (ideally around LLMs or eval tasks), data science, system design, data engineering, and front-end development.
Generalist: Most evals tasks require a wide range of skills ranging from LLM fine-tuning to developing frontend labeling interfaces. Therefore, we're seeking individuals with diverse skill sets, a readiness to acquire new skills rapidly, and a strong focus on results.
Empirical Research Experience: We’re looking for candidates with prior empirical research experience. This includes the design and execution of experiments as well as writing up and communicating these findings. Optimally, the research included working with LLMs. This experience can come from academia, industry, or independent research.
Scientific mindset: We think it is easy to overinterpret evals results and, thus, think a core skill of a good evals engineer or scientist is to keep track of potential alternative explanations for findings. Ideally, any candidate should be able to propose and test these alternative hypotheses in new experiments.
Values: We’re looking for team members who thrive in a collaborative environment and are results-oriented. You can find out more about our culture here.

Additionally, “nice to have” skills include experience related to AI control and cyber security.
Depending on your preferred role, we will weigh these characteristics differently, e.g. software engineers don’t have to have research experience, but must have strong software engineering skills.

LOGISTICS

Start Date: Target of 2-3 months after the first interview.
Time Allocation: Full-time.
Location: The office is in London, and the building is shared with the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis.
Work Visas: We can sponsor UK visas

BENEFITS:

Salary: a competitive UK-based salary.
Flexible work hours and schedule.
Unlimited vacation.
Unlimited sick leave.
Lunch, dinner, and snacks are provided for all employees on workdays.
Paid work trips, including staff retreats, business trips, and relevant conferences.
A yearly $1,000 (USD) professional development budget.

We want to emphasize that people who feel they don’t fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.

Equality Statement: Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.

How to apply:Please complete the application form with your CV. The provision of a cover letter is optional but not necessary. Please also feel free to share links to relevant work samples.

About the interview process: Our multi-stage process includes a screening interview, a take-home test (approx. 2 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews. If you want to prepare for the interviews, we suggest working on hands-on LLM evals projects (e.g. as suggested in our starter guide), such as building LM agent evaluations in Inspect.

Applications deadline: We are reviewing applications on a rolling basis. It might take a few weeks until you hear from us.

* This role is supported byAI Futures Grants, a UK Government program designed to help the next generation of AI leaders meet the costs of relocating to the UK. AI Futures Grants provide financial support to reimburse relocation costs such as work visa fees, immigration health surcharge and travel/subsistence expenses. Successful candidates for this role may beable to get up to £10,000 to meet associated relocation costs, subject to terms and conditions.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Evals Research Scientist / Engineer

COL Limited

London

On-site

GBP 40,000 - 70,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Similar jobs

Company

Services

Free resources

Support

Evals Research Scientist / Engineer

COL Limited

London

On-site

GBP 40,000 - 70,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Similar jobs

Follow us

Company

Services

Free resources

Support