Job Search and Career Advice Platform

¡Activa las notificaciones laborales por email!

Software Engineer, Ai (Python)

G2I Inc.

A distancia

MXN 400,000 - 600,000

A tiempo parcial

Hoy
Sé de los primeros/as/es en solicitar esta vacante

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading technology firm seeks a Software Engineer to help train large-language models in writing production-grade code remotely. The ideal candidate will have over 4 years of experience in Python, strong code-review skills, and excellent attention to detail. Responsibilities include comparing code snippets, refactoring AI-generated code, and injecting feedback into the training process. The position offers flexible hours, with compensation ranging from $30 to $70 per hour, depending on location and seniority.

Formación

  • 4+ years of professional software engineering experience in Python.
  • Strong instincts for spotting logic errors, performance traps, and security issues.
  • Extreme attention to detail and excellent written communication skills.

Responsabilidades

  • Help train large-language models to write production-grade code.
  • Compare & rank multiple code snippets, explaining which is best and why.
  • Repair & refactor AI-generated code for correctness, efficiency, and style.

Conocimientos

Professional software engineering experience in Python
Strong code-review instincts
Attention to detail
Excellent written communication skills
Ability to read documentation and language specs
Descripción del empleo

Software Engineer, AI — Code Evaluation & Training (Remote)

Help train large-language models (LLMs) to write production-grade code across a wide range of programming languages :

  • Compare & rank multiple code snippets , explaining which is best and why.
  • Repair & refactor AI-generated code for correctness, efficiency, and style.
  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly.

End result : the model learns to propose, critique, and improve code the way _you_ do.

RLHF in one line

Generate code ? expert engineers rank, edit, and justify ? convert that feedback into reward signals ? reinforcement learning tunes the model toward code you'd actually ship.

What You'll Need :
  • 4+ years of professional software engineering experience in Python
  • (Constraint programming experience is a bonus, but not required)
  • Strong code-review instincts —you can spot logic errors, performance traps, and security issues quickly.
  • Extreme attention to detail and excellent written communication skills.
  • You enjoy reading documentation and language specs and thrive in an asynchronous, low-oversight environment.

Much of this role involves explaining _why_ one approach is better than another.

This cannot be overstated.

What You Don't Need :
  • No prior RLHF (Reinforcement Learning with Human Feedback) or AI training experience.
  • No deep machine learning knowledge.

If you can review and critique code clearly, we'll teach you the rest.

Tech Stack :

We are looking for engineers with a strong command of Python.

Logistics :
  • Location : Fully remote — work from anywhere
  • Compensation : From $30 / hr to $70 / hr, depending on location and seniority
  • Hours : Minimum 15 hrs / week, up to 40 hrs / week available
  • Engagement : contract
Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.