
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading AI research organization in London is seeking a Research Scientist with expertise in developing safeguards for open-source models. The successful candidate will focus on mitigating risks related to AI-generated harmful content, engaging with various stakeholders across government and industry. Ideal candidates will possess strong technical skills in machine learning, particularly with open-weight models. This position offers competitive compensation, unique opportunities for influence in AI governance, and the flexibility of hybrid working arrangements.
London, UK
The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally.
We’re here because governments are critical for advanced AI going well, and UK AISI is uniquely positioned to mobilise them. With our resources, unique agility and international influence, this is the best place to shape both AI development and government action.
Societal Resilience is a multidisciplinary team that studies how advanced AI models can impact people and society. We research the prevalence and severity of high‑impact societal risks caused by frontier AI deployment, and develop mitigations to address these risks. Core research topics include the use of AI for assisting with criminal activities, preventing critical overreliance on insufficiently robust systems, undermining trust in information, jeopardising psychological wellbeing, or for malicious social engineering. We are interested in both immediate and medium‑term risks.
One emerging risk area we are concerned with is the use of open‑weight models to drive risks like child sexual abuse material (CSAM) and non‑consensual intimate imagery (NCII) generation. AISI has previously published research on methods for making open‑weight models more robust against malicious tampering. In this role, you’ll join a strongly collaborative technical research team to help design and develop technical safeguards for open‑weight models that will reduce the risks of CSAM, NCII, and other risks. We do not expect this role to handle this kind of content directly.
This is a research scientist position focused on developing technical safeguards against tampering with open‑weight models. The role will focus on mitigating AI‑generated CSAM and NCII by targeting the real‑world supply chain driving harm: open‑weight models, adaptation artifacts (LoRAs, guides), and downstream distribution infrastructure (hosting platforms, app stores, operating systems).
Our approach prioritises downstream mitigations and actors beyond frontier model developers. This role will build technical tools, protocols, and evidence that platforms and OS/app ecosystems can adopt.
This work belongs inside UK government because effective mitigation requires cross‑agency coordination (Home Office, DSIT, Ofcom), engagement with regulated platforms under the Online Safety Act, and credible evidence to inform policy trade‑offs across innovation, competition, and child protection.
This role will synthesize threat intelligence on how AI‑generated CSAM and NCII are developed, create scalable screening methodologies that platforms can realistically run, and publish best‑practice protocols with NGOs to raise the floor across the ecosystem.
You’ll work closely with engineers and domain experts across AISI, as well as external research collaborators at Home Office, Internet Watch Foundation, and Ofcom. Researchers on this team have substantial freedom to shape independent research agendas, lead collaborations, and initiate projects that push the frontier of what evaluations can reveal.
Your work will raise safety standards across hosting and distribution layers, reduce the availability of CSAM/NCII‑generating artifacts (e.g., LoRAs) on major platforms, inform industry protocols and possibly standards, and provide actionable evidence for government decisions.
Crucially, we do not expect this role to handle NCII or CSAM material.
We’re flexible on the exact profile and expect successful candidates will meet many (but not necessarily all) of the criteria below. Depending on experience, we will consider candidates at either the RS or Senior RS level.
Annual salary is benchmarked to role scope and relevant experience. Most offers land between £65,000 and £145,000 (base plus technical allowance), with 27% employer pension and other benefits on top (details on the “what we offer” section on our careers page).
This role sits outside of the DDaT pay framework given the scope of this role requires in depth technical expertise in frontier AI safety, robustness and advanced AI architectures.
In accordance with the Civil Service Commission rules, the following list contains all selection criteria for the interview process. The interview process may vary candidate to candidate but you should expect a typical process to include some technical proficiency tests, discussions with a cross‑section of our team at AISI (including non‑technical staff), conversations with your team lead. The process will culminate in a conversation with members of the senior team here at AISI.
Candidates should expect to go through some or all of the following stages once an application has been submitted:
The Civil Service Code sets out the standards of behaviour expected of civil servants. The Civil Service embraces diversity and promotes equal opportunities. We run a Disability Confident Scheme for candidates with disabilities who meet the minimum selection criteria. The Civil Service also offers a Redeployment Interview Scheme to civil servants who are at risk of redundancy, and who meet the minimum requirements for the advertised vacancy.