Enable job alerts via email!

Research Engineer, Machine Learning (Horizons)

Anthropic

Greater London

Hybrid

GBP 70,000 - 90,000

Full time

30+ days ago

Job summary

A leading company in AI safety seeks a Research Engineer to advance large language models through innovative reinforcement learning techniques. Collaborate with a diverse team to enhance reasoning abilities in code generation and mathematics. Ideal candidates have strong software engineering skills and experience with deep learning frameworks. Embrace a culture of collaboration and impact in AI research.

Benefits

Competitive compensation

Flexible hours

Collaborative environment

Qualifications

5+ years of industry-related experience.
Enjoy pair programming and care about code quality.

Responsibilities

Develop and implement novel reinforcement learning techniques.
Design and conduct experiments to enhance models' reasoning capabilities.

Skills

Python

Deep Learning

Software Engineering

Education

Bachelor's degree

Tools

PyTorch

Jax

Kubernetes

Research Engineer, Machine Learning (Horizons)

London, UK

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We aim for AI to be safe and beneficial for users and society. Our team is a growing group of researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role:

As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, enhancing reasoning abilities in areas like code generation and mathematics, and exploring reinforcement learning for agentic / open-ended tasks.

Representative projects:

Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models.
Create tools and environments for models to interact with, enabling complex, open-ended tasks.
Design and conduct experiments to enhance models' reasoning capabilities, especially in code generation and mathematics.

You may be a good fit if you:

Have 5+ years of industry-related experience
Are proficient in Python and experienced with deep learning frameworks such as PyTorch or Jax
Possess a strong software engineering background and enjoy working closely with researchers and engineers
Enjoy pair programming
Care about code quality, testing, and performance
Are passionate about AI's potential impact and committed to developing safe, beneficial systems

Strong candidates may also:

Have a background in machine learning, reinforcement learning, or high-performance computing
Have experience with virtualization and sandboxed code execution environments
Have experience with Kubernetes
Have contributed to open-source projects or published research papers in relevant fields

Candidates need not have:

Formal certifications or educational credentials
Prior experience with LLMs or machine learning research

Deadline to apply: Rolling review; no fixed deadline.

The expected salary range is provided separately.

Logistics

Education: Bachelor's degree or equivalent experience in a related field.

Location-based hybrid policy: Expectation of being in the office at least 25% of the time; some roles may require more.

Visa sponsorship: We sponsor visas when possible; we will make reasonable efforts to assist if you are offered the role.

We encourage applicants from diverse backgrounds and those who may not meet every qualification to apply.

How we're different

We focus on big science and impactful research, valuing collaboration, communication, and impact in AI safety and steerability. Our research directions include GPT-3, interpretability, multimodal neurons, scaling laws, and more.

Come work with us!

Based in San Francisco, we offer competitive compensation, benefits, flexible hours, and a collaborative environment.

Apply for this job

Fields marked with * are required.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Research Engineer, Machine Learning (Horizons)

Anthropic

Greater London

Hybrid

GBP 70,000 - 90,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Job description

Similar jobs

Company

Services

Free resources

Support

Research Engineer, Machine Learning (Horizons)

Anthropic

Greater London

Hybrid

GBP 70,000 - 90,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Job description

Similar jobs

Follow us

Company

Services

Free resources

Support