Enable job alerts via email!

Remote AI Agent Evaluation Scenario Designer

Mindrift

Remote

NOK 600,000 - 800,000

Part time

24 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking AI company is seeking a talented individual to design structured evaluation scenarios for LLM-based agents. This remote freelance role involves creating test cases to evaluate AI decision-making. Candidates should have a strong analytical mindset, with a Bachelor’s or Master’s degree in Computer Science or related fields. The position offers flexible hours, with compensation up to $52/hour based on expertise and project needs. Ideal for those looking to influence AI development and gain valuable experience in the field.

Benefits

Flexible scheduling

Competitive hourly rates

Valuable portfolio experience

Qualifications

Degree in Computer Science, Software Engineering, Data Science, or related fields.
Background in QA, software testing, or data analysis.
Good understanding of test design principles.

Responsibilities

Design structured test scenarios based on real-world tasks.
Define acceptable agent behavior and scoring logic.
Review agent outputs and adapt tests.

Skills

Analytical mindset

Attention to detail

Written communication skills in English

Curiosity about AI

Education

Bachelor's/Master's Degree in Computer Science or related fields

Tools

Python

JavaScript

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.