Ativa os alertas de emprego por e-mail!

Remote Senior Software Engineer (LLM) - 34953

Turing

São Paulo

Teletrabalho

BRL 80.000 - 120.000

Tempo parcial

Há 3 dias
Torna-te num dos primeiros candidatos

Melhora as tuas possibilidades de ir a entrevistas

Cria um currículo adaptado à oferta de emprego para teres uma taxa de sucesso superior.

Resumo da oferta

A leading AI company is seeking a Remote Senior Software Engineer specializing in LLMs. The role involves curating evaluation datasets for AI, collaborating with researchers, and assessing model-generated code. Candidates should have over 7 years of software engineering experience, strong code evaluation skills, and excellent communication abilities.

Qualificações

  • 7+ years of professional software engineering experience.
  • Strong fundamentals in coding best practices.
  • Exceptional written communication skills.

Responsabilidades

  • Review and compare model-generated code responses.
  • Evaluate code diffs for correctness and quality.
  • Provide clear rationales for ranking decisions.

Conhecimentos

Software design
Debugging
Code quality assessment
Written communication

Descrição da oferta de emprego

Remote Senior Software Engineer (LLM) - 34953
Remote Senior Software Engineer (LLM) - 34953

4 days ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

Turing is one of the world’s fastest-growing AI companies, pushing the boundaries of AI-assisted software development. Our mission is to empower the next generation of AI systems to reason about and work with real-world software repositories. You’ll be working at the intersection of software engineering, open-source ecosystems, and frontier AI.

Project Overview

We're building high-quality evaluation and training datasets to improve how Large Language Models (LLMs) interact with realistic software engineering tasks. A key focus of this project is curating verifiable software engineering challenges from public GitHub repository histories using a human-in-the-loop process.

Why This Role Is Unique

  • Collaborate directly with AI researchers shaping the future of AI-powered software development.
  • Work with high-impact open-source projects and evaluate how LLMs perform on real bugs, issues, and developer tasks.
  • Influence dataset design that will train and benchmark next-gen LLMs.
  • What does day-to-day look like:
  • Review and compare 3–4 model-generated code responses for each task using a structured ranking system.
  • Evaluate code diffs for correctness, code quality, style, and efficiency.
  • Provide clear, detailed rationales explaining the reasoning behind each ranking decision.
  • Maintain high consistency and objectivity across evaluations.
  • Collaborate with the team to identify edge cases and ambiguities in model behavior.

Required Skills

  • 7+ years of professional software engineering experience, ideally at top-tier product companies (e.g., Stripe, Datadog, Snowflake, Dropbox, Canva, Shopify,Intuit,PayPal, Research at IBM/GE/Honewell/Scheinder etc. ).
  • Strong fundamentals in software design, coding best practices, and debugging.
  • Excellent ability to assess code quality, correctness, and maintainability.
  • Proficient with code review processes and reading diffs in real-world repositories.
  • Exceptional written communication skills to articulate evaluation rationale clearly.
  • Prior experience with LLM-generated code or evaluation work is a plus.

Bonus Points

  • Experience in LLM research, developer agents, or AI evaluation projects.
  • Background in building or scaling developer tools or automation systems.
  • Commitment: ~20 hours/week (partial PST overlap required)
  • Type: Contractor (no medical/paid leave)
  • Duration: 1 month (starting next week; potential extensions based on performance and fit)
Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Contract
Job function
  • Job function
    Information Technology and Engineering
  • Industries
    Software Development

Referrals increase your chances of interviewing at Turing by 2x

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.

Ofertas semelhantes

Remote Senior Software Engineer (LLM) - 34953

Turing

São Paulo

Teletrabalho

BRL 80.000 - 120.000

Há 4 dias
Torna-te num dos primeiros candidatos