Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
A leading company in AI research is seeking a Research Engineer in Machine Learning to advance large language models through reinforcement learning. The role involves collaboration with researchers to improve AI capabilities and safety. Candidates should have a strong software engineering background and proficiency in Python. The position offers competitive compensation and a hybrid work environment.
London, UK
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
As a Research Engineer on the Reinforcement Learning Fundamentals team, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning, improving reasoning abilities in areas such as code generation and mathematics, and exploring reinforcement learning for agentic / open-ended tasks.
Deadline to apply: None. Applications will be reviewed on a rolling basis.
The expected salary range for this position is:
Education requirements: Bachelor’s degree in a related field or equivalent experience.
Location-based hybrid policy: Currently, all staff are expected to be in the office at least 25% of the time, with some roles requiring more.
Visa sponsorship: We sponsor visas! We will make every effort to assist with visa processes if we make an offer.
We encourage you to apply even if you do not meet every qualification. Diversity and representation are important to us, and we value different perspectives in our team.
We believe impactful AI research is big science, focusing on large-scale efforts with high impact, akin to empirical sciences like physics and biology. We value collaboration, impact, and communication, hosting frequent discussions to pursue high-impact work.
Our recent research includes GPT-3, interpretability, multimodal neurons, scaling laws, AI & compute, safety, and human preferences.
Anthropic is headquartered in San Francisco, offering competitive compensation, benefits, equity donation matching, generous leave, flexible hours, and a collaborative office environment.
* indicates a required field