Aktiviere Job-Benachrichtigungen per E-Mail!
Erhöhe deine Chancen auf ein Interview
Erstelle einen auf die Position zugeschnittenen Lebenslauf, um deine Erfolgsquote zu erhöhen.
A leading company in AI software development is looking for a contractor to evaluate code generated by Large Language Models (LLMs). The role involves reviewing code responses, ensuring high standards in quality and correctness, and collaborating with AI researchers. Candidates should have over 7 years in software engineering and possess strong analytical and communication skills. This is a remote role requiring approximately 20 hours per week.
Turing is one of the world’s fastest-growing AI companies, pushing the boundaries of AI-assisted software development. Our mission is to empower the next generation of AI systems to reason about and work with real-world software repositories. You’ll be working at the intersection of software engineering, open-source ecosystems, and frontier AI.
Project Overview
We're building high-quality evaluation and training datasets to improve how Large Language Models (LLMs) interact with realistic software engineering tasks. A key focus of this project is curating verifiable software engineering challenges from public GitHub repository histories using a human-in-the-loop process.
Why This Role Is Unique
Required Skills
Bonus Points