¡Activa las notificaciones laborales por email!

Sr. Ai Engineer

buscojobs España

Málaga

Presencial

EUR 50.000 - 80.000

Jornada completa

Hace 19 días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Empieza desde cero o carga un currículum

Descripción de la vacante

A leading company in AI technology is seeking an experienced AI Engineer to enhance their Private AI platform. The role involves developing and deploying AI systems, ensuring performance and security for enterprise clients. The successful candidate will enjoy a competitive salary and the opportunity to work with cutting-edge AI technologies in a collaborative environment.

Servicios

Competitive salary and equity package
23 days of PTO
Necessary equipment provided
Full-remote position within Europe
Flexible schedule

Formación

  • 5+ years of experience in software and/or AI engineering.
  • Experience with on-premise deployment of AI systems.
  • Background in integrating open-source LLMs.

Responsabilidades

  • Design and deploy AI systems for on-premise environments.
  • Optimize Small Language Models for private deployment.
  • Collaborate with cross-functional teams.

Conocimientos

Proficiency in Python
Strong problem-solving skills
Attention to detail
Excellent communication skills

Herramientas

PyTorch
Docker
Kubernetes

Descripción del empleo

About Us

Zylon (https://www.Zylon.Ai/) transforms company knowledge into immediate productivity gains through a secure, private AI workspace. Our on-premise deployment ensures complete data sovereignty, cost control, and full customization by running local AI models with no vendor lock-in. As creators and maintainers of the popular open-source project PrivateGPT (https://privategpt.dev/) with over 55K GitHub stars, we are committed to leveraging the latest AI technologies to drive innovation and deliver exceptional value to our customers.

As an early-stage startup serving clients in financial, manufacturing, engineering, and other regulated industries, we're looking for talented individuals to help propel our growth. We celebrate diversity and are dedicated to creating an inclusive environment for all employees.

Role Overview

We're seeking an experienced AI Engineer to join our team and play a crucial role in developing and enhancing our Private AI platform. You will work on the entire AI stack—from GPU management and inference servers to optimizing open-source AI models for on-premise, bare-metal deployments. Your work will ensure complete control over the data pipeline and the agentic systems operating on top while maintaining high performance and security for our enterprise clients in regulated industries.

Key Responsibilities
  1. Design, develop, and deploy AI systems that operate efficiently in on-premise environments using local models, avoiding reliance on third-party providers like OpenAI or Amazon Bedrock.
  2. Optimize Small Language Models (SLMs) for private deployment, focusing on performance and resource constraints.
  3. Contribute to the architecture of our AI platform, ensuring scalability and security.
  4. Implement advanced prompt engineering techniques and secure data processing pipelines for knowledge extraction and transformation.
  5. Research, design, and implement agentic strategies to enhance AI model interaction and user experience.
  6. Stay current with the latest GenAI research, identify opportunities for added value, and deepen the team’s understanding of various AI models’ capabilities and limitations.
  7. Collaborate with Product & Engineering teams on implementation, product definition, prioritization, scoping, and validation.
  8. Work closely with client success teams to troubleshoot and resolve technical issues.
Requirements
  1. 5+ years of experience in software and/or AI engineering roles, with a focus on GenAI for at least the last 2 years.
  2. Experience with on-premise deployment of AI systems.
  3. Strong background in integrating and fine-tuning open-source LLMs.
  4. Proficiency in Python and relevant ML/AI frameworks (PyTorch, LlamaIndex, LangChain, etc.).
  5. Familiarity with agent strategies, agentic frameworks, and RAG implementations.
  6. Experience with vector databases and efficient retrieval systems.
  7. Knowledge of containerization and deployment technologies (Docker, Kubernetes).
  8. Strong problem-solving skills and attention to detail.
  9. Excellent communication skills in English and ability to collaborate in cross-functional teams.
  10. Eager to stay up-to-date with new techniques, models, and advancements in GenAI.
  11. Creative, lean mindset, and adaptability for fast iteration and learning.
  12. Customer-focused attitude.
Nice to Have
  1. Experience with vLLM and NVIDIA Triton architecture.
  2. Understanding of NVIDIA drivers, CUDA platform, and powering GPUs for AI.
  3. Knowledge of evaluation and observability frameworks (e.g., LangSmith, Opik, Arize Phoenix, Ragas).
  4. Familiarity with Model Context Protocol (MCP) for building agentic systems.
  5. Experience with model quantization and optimization techniques.
  6. Background in startups or early-stage companies.
  7. Experience working in financial, manufacturing, or engineering domains.
What We Offer
  1. The opportunity to shape the future of private AI in regulated industries.
  2. Competitive salary and equity package.
  3. Work with cutting-edge AI technologies.
  4. Direct impact on product development and company growth.
  5. Collaborative, innovative team environment.
  6. Full-remote position within Europe.
  7. Flexible schedule (40h/week).
  8. 23 days of PTO.
  9. Necessary equipment provided.
  10. Periodic on-site team-building events.
Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.