G2I Inc.
Camino (now Clariti)
Camino (now Clariti)
HireLATAM
HireLATAM
Draftea
RM Staffing B.V.
RM Staffing B.V.
Entra en contacto con cazatalentos para acceder a vacantes similaresAplicaciòn Empresarial
Aplicaciòn Empresarial
Atos SE
Atos SE
Administración 360
Administración 360
Atomic HR
Atomic HR
Bairesdev Llc
Canonical
Canonical
Canonical
Towa
Towa
Kueski
A leading tech company is seeking a Software Engineer for AI to train large-language models to write production-grade code. Responsibilities include comparing code snippets, repairing AI-generated code, and providing detailed feedback. Required qualifications include 4+ years of software engineering experience, strong Python skills, and an ability to communicate clearly. This remote contract role offers competitive pay, with flexible hours ranging from 15 to 40 hours per week.
Help train large‑language models (LLMs) to write production‑grade code across a wide range of programming languages:
End result: the model learns to propose, critique, and improve code the way you do.
RLHF in one line
Generate code? expert engineers rank, edit, and justify? convert that feedback into reward signals? reinforcement learning tunes the model toward code you'd actually ship.
Much of this role involves explaining why one approach is better than another. This cannot be overstated.
If you can review and critique code clearly, we’ll teach you the rest.
We are looking for engineers with a strong command of Python.
* El índice de referencia salarialse calcula en base a los salarios que ofrecen los líderes de mercado en los correspondientes sectores. Su función es guiar a los miembros Prémium a la hora de evaluar las distintas ofertas disponibles y de negociar el sueldo. El índice de referencia no es el salario indicado directamente por la empresa en particular, que podría ser muy superior o inferior.