Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
A dynamic startup is seeking an Applied Scientist specializing in AI evaluations to contribute to the development of adversarial tests for Large Language Models (LLMs). This role offers the chance to work closely with a fresh team of innovators, providing significant input on the project roadmap and a competitive compensation package that includes salary, bonuses, and share options. Flexibility in work arrangements is encouraged, catering to both on-site and remote applicants within the UK or EU.
Social network you want to login/join with:
col-narrow-left
Trismik
kingston upon hull, east yorkshire, United Kingdom
Other
-
Yes
col-narrow-right
7
06.06.2025
21.07.2025
col-wide
At Trismik we're a team of tech geeks from the University of Cambridge, Salesforce, and Amazon looking to push the boundaries of AI through science-led evaluations. If you're ambitious to make a difference to the future of AI, have a PhD in NLP, and like to turn ideas into reality, we'd love to hear from you.
Role
We are developing adversarial tests for Large Language Models (LLMs). We are looking for a passionate, talented, and innovative applied scientist with a strong background in algorithms to help build an industry-leading evaluation engine for LLMs and help us bring this to market. As one of our first hires this is a high-impact and high-ownership role. You will have a strong say in how our science and product roadmap evolves.
Our mission is to provide the fastest and most accurate testing environment to add value to AI engineers wishing to deploy AI applications using LLMs. We do this to push forward the SoTA in Artificial Intelligence and to achieve the best possible chance of a human-aligned AGI.
Key job responsibilities
As an Applied Scientist in a startup you will have a key role in our team. You will work with the Chief Scientist to design, develop and deploy evaluation technologies that will create high value insights for AI engineers. These will involve providing assessments for several technologies that involve Large Language Models, including retrieval augmented systems (RAGs), recommender systems, and agentive systems. You will:
Essential skills
Our offer