¡Activa las notificaciones laborales por email!

Senior Machine Learning Engineer (LLM & GPU Architecture)

European Tech Recruit

País Vasco

Presencial

EUR 50.000 - 80.000

Jornada completa

Hace 14 días

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

A leading tech start-up in Spain is seeking a Senior Machine Learning Engineer specializing in LLM and GPU architecture. The role involves leveraging cutting-edge AI and quantum technologies to enhance model design and implementation while collaborating with cross-functional teams. Candidates should hold a Master's or Ph.D. and possess extensive experience in deep learning and neural networks.

Formación

  • 3+ years of hands-on experience with deep learning models.
  • Solid mathematical foundations in deep learning algorithms.
  • Experience with cloud platforms and containerization technologies.

Responsabilidades

  • Design and develop techniques to compress Large Language Models.
  • Conduct evaluations and benchmarks of model performance.
  • Act as a domain expert in LLMs and quantum AI.

Conocimientos

Deep Learning
Neural Networks
Problem-solving
Programming in Python
Quantum Computing

Educación

Master's or Ph.D. in Artificial Intelligence, Computer Science, Data Science

Herramientas

HuggingFace Transformers
PyTorch
Docker
AWS

Descripción del empleo

Senior Machine Learning Engineer (LLM & GPU Architecture)

This is a great opportunity to work with one of the biggest growing tech start-ups based in Spain, they are well-funded and one of the most well-known quantum software companies in Europe. They are a provider of hyper-efficient software to companies seeking to gain an edge with quantum computing and artificial intelligence, across finance, energy, manufacturing, defence, cybersecurity, life sciences, and chemistry, delivering practical applications and tangible value with the use of their new AI & now LLM product.

Job Overview

In this role you will have the opportunity to leverage cutting-edge quantum and AI technologies to lead the design, implementation, and improvement of our language models, as well as working closely with cross-functional teams to integrate these models into our products. You will have the opportunity to work on challenging projects, contribute to cutting-edge research, and shape the future of LLM and NLP technologies.

Responsibilities

  • Design and develop new techniques to compress Large Language Models based on quantum-inspired technologies to solve challenging use cases in various domains.
  • Conduct rigorous evaluations and benchmarks of model performance, identifying areas for improvement, and fine-tuning and optimising LLMs for enhanced accuracy, robustness, and efficiency.
  • Use your expertise to assess the strengths and weaknesses of models, propose enhancements, and develop novel solutions to improve performance and efficiency.
  • Act as a domain expert in the field of LLMs, understanding domain-specific problems and identifying opportunities for quantum AI-driven innovation.
  • Maintain comprehensive documentation of LLM development processes, experiments, and results.
  • Participate in code reviews and provide constructive feedback to team members.

Required Qualifications

  • Master's or Ph.D. in Artificial Intelligence, Computer Science, Data Science, or related fields.
  • 3+ years of hands-on experience with deep learning models and neural networks, preferably working with Large Language Models and Transformer architectures, or computer vision models.
  • Hands-on experience using LLM and Transformer models, with excellent command of libraries such as HuggingFace Transformers, Accelerate, Datasets, etc.
  • Solid mathematical foundations and expertise in deep learning algorithms and neural networks, both training and inference.
  • Excellent problem-solving, debugging, performance analysis, test design, and documentation skills.
  • Strong understanding with the fundamentals of GPU architectures.
  • Excellent programming skills in Python and experience with relevant libraries (PyTorch, HuggingFace, etc.).
  • Experience with cloud platforms (ideally AWS), containerization technologies (Docker) and with deploying AI solutions in a cloud environment
  • Excellent written and verbal communication skills, with the ability to work collaboratively in a fast-paced team environment and communicate complex ideas effectively.
  • Previous research publications in deep learning is a plus.

Key Words: Large Language Models / LLM / Machine Learning / AI / Quantum Computing / GPU Architecture / GPGPU / GPU Farms / Multi-GPU / AWS / Kubernetes Clusters / DeepSpeed / SLURM / RAY / Transformer Models / Fine-tuning / Mistral / Llama

By applying to this role you understand that we may collect your personal data and store and process it on our systems. For more information please see our Privacy Notice (https://eu-recruit.com/about-us/privacy-notice/).

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.