Job Search and Career Advice Platform

Activez les alertes d’offres d’emploi par e-mail !

Remote: Evaluation Scenario Designer for AI Agent Testing

Mindrift

À distance

EUR 40 000 - 60 000

Temps partiel

Aujourd’hui
Soyez parmi les premiers à postuler

Générez un CV personnalisé en quelques minutes

Décrochez un entretien et gagnez plus. En savoir plus

Résumé du poste

A leading AI innovation company is seeking candidates to design structured evaluation scenarios for LLM-based agents. The role involves creating test cases that simulate human-performed tasks, defining evaluation logic, and ensuring production readiness of scenarios. Requirements include a degree in a related field, strong analytical skills, and familiarity with coding formats. This flexible freelance project allows you to contribute from anywhere in the world, with pay rates up to $50/hour based on experience.

Prestations

Flexible scheduling
Competitive pay rates
Work from anywhere

Qualifications

  • Bachelor’s and/or Master’s Degree in Computer Science, Software Engineering, or related fields.
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
  • Strong written communication skills in English.

Responsabilités

  • Create structured test cases that simulate complex human workflows.
  • Define gold-standard behavior and scoring logic to evaluate agent actions.
  • Analyze agent logs and failure modes.

Connaissances

Test design principles
Analytical mindset
Strong written communication in English
Basic experience with Python
Basic experience with JavaScript

Formation

Bachelor’s and/or Master’s Degree in Computer Science or related fields

Outils

JSON
YAML
Description du poste
A leading AI innovation company is seeking candidates to design structured evaluation scenarios for LLM-based agents. The role involves creating test cases that simulate human-performed tasks, defining evaluation logic, and ensuring production readiness of scenarios. Requirements include a degree in a related field, strong analytical skills, and familiarity with coding formats. This flexible freelance project allows you to contribute from anywhere in the world, with pay rates up to $50/hour based on experience.
Obtenez votre examen gratuit et confidentiel de votre CV.
ou faites glisser et déposez un fichier PDF, DOC, DOCX, ODT ou PAGES jusqu’à 5 Mo.