¡Activa las notificaciones laborales por email!

Sr. Ai Engineer

buscojobs España

Islas Baleares

A distancia

EUR 60.000 - 80.000

Jornada completa

Hace 12 días

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

A leading company is seeking an experienced AI Engineer to enhance their Private AI platform. The role involves developing AI systems, optimizing models, and collaborating with teams to drive innovation in a fully remote setting. The ideal candidate will have extensive experience in AI engineering and a strong focus on customer needs.

Servicios

Competitive salary and equity package

Work with cutting-edge AI technologies

Flexible schedule (40h/week)

23 days of PTO

Necessary equipment provided

Periodic on-site team-building events

Formación

5+ years of experience in software or AI engineering roles.
Experience with on-premise deployment of AI systems.
Strong background in integrating and fine-tuning open-source LLMs.

Responsabilidades

Design, develop, and deploy AI systems for on-premise environments.
Optimize Small Language Models for private deployments.
Collaborate with Product & Engineering teams on implementation.

Conocimientos

Problem Solving

Attention to Detail

Communication

Customer Focus

Herramientas

Docker

Kubernetes

Python

PyTorch

LlamaIndex

LangChain

transforms company knowledge into immediate productivity gains through a secure, private AI workspace. Our on-premise deployment ensures complete data sovereignty, cost control, and full customization by running local AI models with no vendor lock-in. As creators and maintainers of the popular open-source project PrivateGPT (privategpt.dev) with more than 55K Github stars, we are committed to leveraging the latest AI technologies to drive innovation and deliver exceptional value to our customers.

As an early-stage startup serving clients in financial, manufacturing, engineering, and other regulated industries, we're looking for talented individuals to help drive our growth in a company where we celebrate diversity and are committed to creating an inclusive environment for all employees.

Role Overview

We're seeking an experienced AI Engineer to join our team and play a crucial role in developing and enhancing our Private AI platform. You'll work on the whole AI stack, from GPU management and inference server to leveraging and optimizing open-source AI models for on-premise bare-metal deployments, ensuring complete control of the data pipeline and the agentic system on top of it. All of this while maintaining high performance and security for our enterprise clients in regulated industries.

Key Responsibilities

Design, develop, and deploy AI systems that operate efficiently in on-premise environments with local models, independent of third-party providers like OpenAI or Amazon Bedrock.
Optimize Small Language Models (SLMs) for private deployments, focusing on performance and resource constraints.
Contribute to the architecture of our AI platform, ensuring scalability and security.
Implement advanced prompt engineering techniques and secure data processing pipelines for knowledge extraction and transformation.
Research, design, and implement agentic strategies to improve AI model interaction and user experience.
Stay current with advancements in GenAI research and identify opportunities for product enhancement, developing a deep understanding of various AI models' capabilities and limitations.
Collaborate with Product & Engineering teams in implementation, product definition, prioritization, scoping, and validation.
Work closely with client success teams to troubleshoot and resolve technical issues.

Requirements

5+ years of experience in software or AI engineering roles, with a focus on GenAI in the last 2 years.
Experience with on-premise deployment of AI systems.
Strong background in integrating and fine-tuning open-source LLMs.
Proficiency in Python and relevant ML/AI frameworks (PyTorch, LlamaIndex, LangChain, etc.).
Familiarity with agent strategies, agentic frameworks, and RAG implementations.
Experience with vector databases and efficient retrieval systems.
Knowledge of containerization and deployment technologies (Docker, Kubernetes).
Strong problem-solving skills and attention to detail.
Excellent English communication skills and ability to collaborate effectively.
Eager to stay updated with new techniques and models in GenAI.
Lean, creative mindset, ready to iterate, learn, and adapt.
Customer-focused mindset.

Nice to Have

Experience with vLLM + NVIDIA Triton architecture.
Understanding of NVIDIA drivers and CUDA platform.
Knowledge of evaluation and observability frameworks (LangSmith, Opik, Arize Phoenix, Ragas, etc.).
Knowledge of Model Context Protocol (MCP) for building agentic systems.
Experience with model quantization and optimization techniques.
Experience in startups or early-stage companies.
Background in financial, manufacturing, or engineering domains.

What We Offer