Senior Machine Learning Engineer (LLM & GPU Architecture)
This is a great opportunity to work with one of the biggest growing tech start-ups based in Spain. They are well-funded and one of the most well-known quantum software companies in Europe, providing hyper-efficient software to companies seeking to gain an edge with quantum computing and artificial intelligence across various sectors.
Job Overview
In this role, you will leverage cutting-edge quantum and AI technologies to lead the design, implementation, and improvement of our language models. You will work closely with cross-functional teams to integrate these models into our products, contributing to challenging projects and shaping the future of LLM and NLP technologies.
Responsibilities
- Design and develop new techniques to compress Large Language Models based on quantum-inspired technologies.
- Conduct rigorous evaluations and benchmarks of model performance, identifying areas for improvement.
- Fine-tune and optimize LLMs for enhanced accuracy, robustness, and efficiency.
- Assess strengths and weaknesses of models, propose enhancements, and develop novel solutions.
- Act as a domain expert in LLMs, identifying opportunities for quantum AI-driven innovation.
- Maintain comprehensive documentation of LLM development processes, experiments, and results.
- Participate in code reviews and provide constructive feedback to team members.
Required Qualifications
- Master’s or Ph.D. in Artificial Intelligence, Computer Science, Data Science, or related fields.
- 3+ years of hands-on experience with deep learning models and neural networks, preferably with Large Language Models and Transformer architectures.
- 1+ year of hands-on experience using LLM and Transformer models, with excellent command of libraries such as HuggingFace Transformers, Accelerate, Datasets, etc.
- Solid mathematical foundations and expertise in deep learning algorithms.
- Excellent problem-solving, debugging, performance analysis, test design, and documentation skills.
- Strong understanding of GPU architectures.
- Excellent programming skills in Python and experience with relevant libraries (PyTorch, HuggingFace, etc.).
- Experience with cloud platforms (ideally AWS), containerization technologies (Docker), and deploying AI solutions in a cloud environment.
- Excellent written and verbal communication skills, with the ability to work collaboratively in a fast-paced team environment.
- Previous research publications in deep learning is a plus.
For this position, you will need to be fluent in Spanish.
Key Words: Large Language Models, LLM, Machine Learning, AI, Quantum Computing, GPU Architecture, GPGPU, GPU Farms, Multi-GPU, AWS, Kubernetes Clusters, DeepSpeed, SLURM, RAY, Transformer Models, Fine-tuning, Mistral, Llama.
Company: European Tech Recruit