Enable job alerts via email!

Applied Scientist (AI Evaluation)

JR United Kingdom

United Kingdom

Hybrid

GBP 60,000 - 100,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative startup is seeking a talented Applied Scientist to join their team and help advance AI through cutting-edge evaluation technologies. This role offers the opportunity to work closely with top experts in the field, designing and launching new features that will shape the future of AI. You'll be at the forefront of developing algorithms and optimizing models while collaborating with engineers and product partners. With a flexible work environment and competitive compensation, this position is perfect for those passionate about pushing the boundaries of AI technology.

Benefits

Competitive salary
Bonus and share options
Company-issued laptop
Tech allowance
Flexible/remote work environment
Travel allowances

Qualifications

  • PhD in relevant fields is essential.
  • Experience with deep learning and LLMs is critical.

Responsibilities

  • Design and deploy evaluation technologies for AI applications.
  • Collaborate with the Chief Scientist on product development.

Skills

PhD in Computer Science
Deep Learning Architectures
LLM Benchmarking
Programming in Python
Algorithm Development
Machine Learning Tools
Teamwork Skills
Dataset Creation
Publications in Conferences
AWS Deployment

Education

PhD in NLP/ML

Tools

Git
HuggingFace

Job description

Social network you want to login/join with:

col-narrow-left

Client:

Trismik

Location:
Job Category:

Other

-

EU work permit required:

Yes

col-narrow-right

Job Views:

4

Posted:

08.05.2025

Expiry Date:

22.06.2025

col-wide

Job Description:

At Trismik, we're a team of tech enthusiasts from the University of Cambridge, Salesforce, and Amazon, aiming to advance AI through science-led evaluations. If you're ambitious about shaping the future of AI, hold a PhD in NLP/ML, and enjoy transforming ideas into reality, we'd love to hear from you.

Role

We are developing adversarial tests for Large Language Models (LLMs). We seek a passionate, talented, and innovative applied scientist with a strong algorithm background to help build an industry-leading evaluation engine for LLMs and bring it to market. As one of our first hires, this role offers high impact and ownership. You will influence our science and product roadmap.

Our mission is to provide the fastest and most accurate testing environment to support AI engineers deploying AI applications with LLMs, pushing the state-of-the-art in AI and aiming for human-aligned AGI.

Key job responsibilities

As an Applied Scientist in our startup, you will work closely with the Chief Scientist to design, develop, and deploy evaluation technologies that generate valuable insights for AI engineers. Your assessments will cover technologies involving Large Language Models, including retrieval augmented systems (RAGs), recommender systems, and agentive systems. Your tasks will include:

  • Inventing, experimenting with, and launching new features, products, and systems based on machine learning and MLLM.
  • Constructing and analyzing large-scale multi-modal datasets for our products.
  • Building algorithms, conducting offline and A/B testing, optimizing, and deploying models in production alongside software engineers.
  • Automating large-scale data analysis, machine learning development, validation, and serving processes.
  • Communicating results and insights to technical and non-technical audiences through presentations and reports, and publishing work in white papers and conferences.
  • Collaborating with engineers, product partners, and engaging with customers, including marketing-led discussions.

Essential skills

  • PhD in Computer Science, Computational Engineering, Machine Learning, Statistics, or related fields.
  • Experience in designing and optimizing deep learning architectures, including model pruning.
  • Experience with LLM benchmarking.
  • Strong programming skills in Python, Java, C++, or similar.
  • Expertise in algorithm development for optimization problems.
  • Familiarity with ML tools like Git, IDEs, libraries, HuggingFace.
  • Teamwork skills and ability to thrive in fast-paced, high-risk environments.
  • Experience creating datasets and coordinating product development with stakeholders.
  • Publications in top-tier conferences/journals (e.g., NeurIPS, EMNLP, ACL).
  • Experience deploying solutions on AWS or other cloud platforms.
  • Excellent communication, strong work ethic, and a commitment to production-quality coding.

Our offer

  • Competitive salary, bonus, and share options.
  • Company-issued laptop, workstation, and tech allowance.
  • Flexible/remote work environment, based in Cambridge/London, with options for remote work within the UK/EU and travel allowances.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Pharmacovigilance Scientist

AL Solutions

Remote

GBP 50,000 - 80,000

4 days ago
Be an early applicant

User Experience Researcher

Harvey Nash

Greater London

Remote

GBP 50,000 - 80,000

12 days ago

User Experience Researcher

JR United Kingdom

London

Remote

GBP 50,000 - 90,000

Today
Be an early applicant

Senior Pharmacovigilance Scientist

JR United Kingdom

Remote

GBP 50,000 - 80,000

3 days ago
Be an early applicant

Senior Machine Learning Scientist

TN United Kingdom

Manchester

Remote

GBP 60,000 - 100,000

3 days ago
Be an early applicant

Clinical Safety Scientist

TN United Kingdom

Chesterfield

Remote

GBP 40,000 - 80,000

10 days ago

Scientist, Crop Modeling

TN United Kingdom

Remote

USD 97,000 - 130,000

12 days ago

Clinical Safety Scientist

Cypartners

Brentford

Remote

GBP 60,000 - 80,000

2 days ago
Be an early applicant

Data Scientist (GenAI)

Starling Bank

Cardiff

Hybrid

GBP 45,000 - 75,000

Today
Be an early applicant