
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading AI innovation company is seeking a flexible remote role focused on evaluating LLM-based agents. You will design realistic evaluation scenarios and create structured test cases that simulate human workflows. Applicants require a Bachelor's or Master's in Computer Science or related fields, with strong analytical skills and comfort in using tools like Python and JSON. This role offers the chance to influence AI model understanding while contributing to innovative projects.