Job Search and Career Advice Platform

Enable job alerts via email!

Remote AI Agent Evaluation Scenario Designer

Mindrift

Remote

NOK 600,000 - 800,000

Part time

24 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking AI company is seeking a talented individual to design structured evaluation scenarios for LLM-based agents. This remote freelance role involves creating test cases to evaluate AI decision-making. Candidates should have a strong analytical mindset, with a Bachelor’s or Master’s degree in Computer Science or related fields. The position offers flexible hours, with compensation up to $52/hour based on expertise and project needs. Ideal for those looking to influence AI development and gain valuable experience in the field.

Benefits

Flexible scheduling
Competitive hourly rates
Valuable portfolio experience

Qualifications

  • Degree in Computer Science, Software Engineering, Data Science, or related fields.
  • Background in QA, software testing, or data analysis.
  • Good understanding of test design principles.

Responsibilities

  • Design structured test scenarios based on real-world tasks.
  • Define acceptable agent behavior and scoring logic.
  • Review agent outputs and adapt tests.

Skills

Analytical mindset
Attention to detail
Written communication skills in English
Curiosity about AI

Education

Bachelor's/Master's Degree in Computer Science or related fields

Tools

Python
JavaScript
Job description
A forward-thinking AI company is seeking a talented individual to design structured evaluation scenarios for LLM-based agents. This remote freelance role involves creating test cases to evaluate AI decision-making. Candidates should have a strong analytical mindset, with a Bachelor’s or Master’s degree in Computer Science or related fields. The position offers flexible hours, with compensation up to $52/hour based on expertise and project needs. Ideal for those looking to influence AI development and gain valuable experience in the field.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.