¡Activa las notificaciones laborales por email!

Sr. Ai Engineer

buscojobs España

Lugo

A distancia

EUR 50.000 - 90.000

Jornada completa

Ayer
Sé de los primeros/as/es en solicitar esta vacante

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

A leading company in innovative AI technologies is seeking an AI Engineer to develop their Private AI platform. In this pivotal role, you'll work with cutting-edge technologies to optimize AI systems for enterprise clients in regulated industries, ensuring high performance and security. Join a dynamic and diverse team, and contribute significantly to the company's growth in a fully remote environment.

Servicios

Competitive salary and equity package
23 days of PTO
Full-remote position within Europe
Necessary equipment provided
Periodic team-building events

Formación

  • 5+ years in software/AI engineering with focus on GenAI last 2 years.
  • Proficiency in Python and experience with on-premise AI systems.
  • Strong background in integrating open-source LLMs.

Responsabilidades

  • Design and deploy efficient AI systems in on-premise environments.
  • Contribute to AI platform architecture ensuring scalability and security.
  • Collaborate with Product & Engineering teams for implementation.

Conocimientos

Python
Problem Solving
Communication
Attention to Detail
Adaptability

Herramientas

PyTorch
Docker
Kubernetes

Descripción del empleo

About Us
Zylon (https://www.Zylon.Ai) transforms company knowledge into immediate productivity gains through a secure, private AI workspace. Our on-premise deployment ensures complete data sovereignty, cost control, and full customization by running local AI models with no vendor lock-in. As creators and maintainers of the popular open-source project PrivateGPT (https://privategpt.dev) with more than 55K Github stars, we are committed to leveraging the latest AI technologies to drive innovation and deliver exceptional value to our customers. As an early-stage startup serving clients in financial, manufacturing, engineering, and other regulated industries, we're looking for talented individuals to help drive our growth in a company where we celebrate diversity and are committed to creating an inclusive environment for all employees.

Role Overview

We're seeking an experienced AI Engineer to join our team and play a crucial role in developing and enhancing our Private AI platform. You'll work on the entire AI stack, from GPU management and inference server to leveraging and optimizing open-source AI models for on-premise bare metal deployments, ensuring high performance and security for our enterprise clients in regulated industries.

Key Responsibilities

  • Design, develop, and deploy AI systems that operate efficiently in on-premise environments with local models, independent of third-party providers like OpenAI or Amazon Bedrock.
  • Optimize Small Language Models (SLMs) for private deployments focusing on performance and resource constraints.
  • Contribute to the architecture of our AI platform, ensuring scalability and security.
  • Implement advanced prompt engineering techniques and secure data processing pipelines for knowledge extraction and transformation.
  • Research, design, and implement agentic strategies to enhance AI model interaction and user experience.
  • Stay current with advancements in GenAI research to bring additional value to our product and deepen team understanding of AI models' capabilities and limitations.
  • Collaborate with Product & Engineering teams on implementation, product definition, prioritization, scoping, and validation.
  • Work with client success teams to troubleshoot and resolve technical issues.

Requirements

  • 5+ years of experience in software or AI engineering roles, with a focus on GenAI in the last 2 years.
  • Experience with on-premise deployment of AI systems.
  • Strong background in integrating and fine-tuning open-source LLMs.
  • Proficiency in Python and ML/AI frameworks (PyTorch, LlamaIndex, LangChain, etc.).
  • Familiarity with agent strategies, RAG implementations, vector databases, and retrieval systems.
  • Knowledge of containerization and deployment technologies (Docker, Kubernetes).
  • Strong problem-solving skills, attention to detail, and excellent communication skills in English.
  • Eagerness to stay updated with new techniques, models, and advances in GenAI.
  • Creative, lean mindset, adaptable, and customer-focused.

Nice to Have

  • Experience with vLLM + NVIDIA Triton architecture, CUDA, and NVIDIA drivers.
  • Knowledge of evaluation and observability frameworks (LangSmith, Opik, Arize Phoenix, Ragas, etc.).
  • Understanding of Model Context Protocol (MCP) for agentic systems.
  • Experience with model quantization and optimization techniques.
  • Background in startups or early-stage companies, especially in financial, manufacturing, or engineering domains.

What We Offer

  • The opportunity to shape the future of private AI in regulated industries.
  • Competitive salary and equity package.
  • Work with cutting-edge AI technologies.
  • Direct impact on product development and company growth.
  • Collaborative, innovative team environment.
  • Full-remote position within Europe, with a flexible schedule (40h/week).
  • 23 days of PTO, necessary equipment, and periodic team-building events.

Location: Lugo, Kingdom Of Spain, ES

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.