Senior Machine Learning Engineer (LLM & GPU Architecture)

NLP PEOPLE

Elche

EUR 45.000 - 85.000

Descripción del empleo

Senior Machine Learning Engineer (LLM & GPU Architecture)

This is a great opportunity to work with one of the biggest growing tech start-ups based in Spain. They are well-funded and one of the most well-known quantum software companies in Europe, providing hyper-efficient software to companies seeking to gain an edge with quantum computing and artificial intelligence across various sectors.

Job Overview

In this role, you will leverage cutting-edge quantum and AI technologies to lead the design, implementation, and improvement of our language models. You will work closely with cross-functional teams to integrate these models into our products, contributing to challenging projects and shaping the future of LLM and NLP technologies.

Responsibilities

Design and develop new techniques to compress Large Language Models based on quantum-inspired technologies.
Conduct rigorous evaluations and benchmarks of model performance, identifying areas for improvement.
Fine-tune and optimize LLMs for enhanced accuracy, robustness, and efficiency.
Assess strengths and weaknesses of models, propose enhancements, and develop novel solutions.
Act as a domain expert in LLMs, identifying opportunities for quantum AI-driven innovation.
Maintain comprehensive documentation of LLM development processes, experiments, and results.
Participate in code reviews and provide constructive feedback to team members.

Required Qualifications

Master’s or Ph.D. in Artificial Intelligence, Computer Science, Data Science, or related fields.
3+ years of hands-on experience with deep learning models and neural networks, preferably with Large Language Models and Transformer architectures.
1+ year of hands-on experience using LLM and Transformer models, with excellent command of libraries such as HuggingFace Transformers, Accelerate, Datasets, etc.
Solid mathematical foundations and expertise in deep learning algorithms.
Excellent problem-solving, debugging, performance analysis, test design, and documentation skills.
Strong understanding of GPU architectures.
Excellent programming skills in Python and experience with relevant libraries (PyTorch, HuggingFace, etc.).
Experience with cloud platforms (ideally AWS), containerization technologies (Docker), and deploying AI solutions in a cloud environment.
Excellent written and verbal communication skills, with the ability to work collaboratively in a fast-paced team environment.
Previous research publications in deep learning is a plus.

For this position, you will need to be fluent in Spanish.

Key Words: Large Language Models, LLM, Machine Learning, AI, Quantum Computing, GPU Architecture, GPGPU, GPU Farms, Multi-GPU, AWS, Kubernetes Clusters, DeepSpeed, SLURM, RAY, Transformer Models, Fine-tuning, Mistral, Llama.

Company: European Tech Recruit