Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An innovative firm is seeking a Research Engineer to join their Frontier Red Team, focusing on creating evaluation systems for AI safety. This role involves designing robust infrastructure to measure capabilities and risks, collaborating with domain experts, and contributing to industry standards. Ideal candidates will have strong software engineering skills, particularly in Python, and a passion for responsible AI development. If you're excited about tackling unprecedented technical challenges and making a significant impact in the AI field, this opportunity is perfect for you.
We're building a team to develop and run "gold standard" evaluations for catastrophic risks, to make sure we release models that are safe for the world to use.This work is at the core of implementing our Responsible Scaling Policy (RSP), which defines the technical and operational measures for safely training and deploying frontier AI models.
As a Research Engineer on the Frontier Red Team, you'll be creating evaluation systems that will help us understand and control some of the most capable AI systems ever built. You will collaborate with domain experts across multiple workstreams including biosecurity, autonomous replication, cybersecurity, and national security. You'll build, scale, and run evaluations to measure dangerous capabilities in models and determine if and when they cross ASL thresholds and require heightened security measures. Your work will directly inform decisions at the highest levels of the company and help establish standards that could influence the entire AI industry.
We are looking for engineers who can execute rapidly, maintain high throughput, and bring a strong builder mindset to solving complex problems. The ideal candidate will be able to quickly prototype and iterate on evaluation infrastructure while maintaining high engineering standards. You'll be building systems to evaluate capabilities that have never existed before, requiring creative solutions and rigorous implementation.
Deadline to apply: None. Applications will be reviewed on a rolling basis.