Job Search and Career Advice Platform

¡Activa las notificaciones laborales por email!

Deep Reinforcement Learning Engineer (Principal)

Friday Systems

Madrid

Presencial

EUR 70.000 - 90.000

Jornada completa

Hace 13 días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading AI development company in Madrid is seeking an expert in Reinforcement Learning to own the DRL stack from formulation to deployment. You will design algorithms and ensure product excellence in a small, impactful team. Candidates must have a strong background in Deep Learning and experience in shipping RL systems from start to finish. This role offers a unique opportunity to directly influence product strategy and innovation.

Servicios

Salary + equity
Direct impact on product
No bureaucracy

Formación

  • Proven experience shipping RL systems from idea to production.
  • Strong expertise in PyTorch and multiple DRL algorithms implementation.
  • Solid understanding of mathematical principles in reinforcement learning.

Responsabilidades

  • Own the DRL stack end-to-end from formulation to deployment.
  • Design DRL algorithms for complex control tasks.
  • Collaborate with C-level team for product excellence.

Conocimientos

Reinforcement Learning
Deep Learning
PyTorch
Docker
AWS
Python
Linux
Descripción del empleo

Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. We focus on high-throughput palletizing and related tasks where classical approaches break down. Our stack is built around Deep Reinforcement Learning with modern sequence models.

Tiny team, zero bureaucracy, direct impact, salary + equity.

THE ROLE

Own the DRL stack end-to-end : formulation → algorithm design → large-scale training → evaluation → deployment. You’ll work directly with the CTO to turn cutting-edge DRL into production throughput at customer sites.

YOU WILL

Design & ship DRL algorithms (PPO / SAC / DDQN and variants, based on encoders / cross-attention / pointer networks) for complex control & combinatorial optimization.

Tackle stability & sample-efficiency : GAE, normalization, entropy / KL control, distributional / value-loss tuning, curriculum learning and reward shaping, …

Launch multi-GPU training, parallel rollouts, efficient replay / storage, and reproducible experiment tooling.

Productionize : clean PyTorch code, profiling, Dockerized services (FastAPI), AWS deployments, experiment tracking, dashboards.

Collaborate with the C-Level Team to ensure product excellence and alignment with business strategy. Forge strong relationships with clients, effectively translating their needs into unique technology solutions.

Build and nurture a high-performing team by attracting top talent. Provide mentorship and leadership to foster a culture of quality and innovation.

YOU HAVE

Track record shipping RL beyond academic demos : you’ve led at least one end-to-end RL system from idea to production or a state-of-the-art benchmark in the last 3–5 years.

Extensive Deep Learning, Reinforcement Learning & PyTorch expertise : You can implement several DRL algorithms from scratch, reason about root-cause performance drops and make informed decisions about next steps.

Systems know-how : Python, Linux, Docker, Multi-GPU, Cloud (AWS).

Math maturity : MDPs / Bellman operators, policy gradients, trust-region / KL, GAE / λ-returns, stability / regularization in on-policy vs off-policy regimes.

Ownership : you’re comfortable being the primary owner for experiments, code quality, and results in a small team.

Location / time zone : EU-based (CET±2) and able to travel occasionally to customer warehouses.

We are not considering entry-level or coursework-only profiles for this role.

HIRING PROCESS

30-min intro & mutual fit

Deep technical session with CTO on your past RL work (no LeetCode, no homework)

Two one-hour “Traits & Skills” conversations with our other Co-founders.

Meet the team & offer

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.