Job Search and Career Advice Platform

Enable job alerts via email!

Graduate AI Engineer

Reply

Greater London

On-site

GBP 80,000 - 100,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading AI tech consultancy in the UK is seeking a Graduate AI Engineer to design and develop bespoke large language models for innovative organizations. You will work on fine-tuning models, building AI systems, and collaborating with cross-functional teams to deliver high-impact AI solutions. This role requires a Bachelor's or Master's degree in a related field and strong experience in Python and machine learning frameworks. Additional skills in NLP and data preprocessing are a plus. Opportunities for travel within the UK and EU are available.

Qualifications

  • Strong experience with Python and ML frameworks such as PyTorch or TensorFlow.
  • Hands-on experience training, fine-tuning, or deploying LLMs.
  • Solid understanding of NLP, transformers, and attention mechanisms.
  • Experience with data preprocessing, tokenization, and dataset pipelines.
  • Familiarity with REST APIs, microservices, and MLOps tools.

Responsibilities

  • Design, develop, and train large language models and AI systems.
  • Fine-tune pre-trained LLMs for specific use cases.
  • Monitor model performance and manage model drift.

Skills

Python
Machine Learning frameworks
NLP
Data preprocessing
Collaboration skills
Problem-solving skills

Education

Bachelor's or Master's degree in Computer Science, AI, or related field

Tools

PyTorch
TensorFlow
MLflow
Kubeflow
Airflow
Weights & Biases
AWS
GCP
Azure
Job description
Overview

Graduate AI Engineer

Sail Reply is an AI tech innovation consultancy that delivers experience-led, value-focused solutions for some of the world's most forward-thinking organisations. Our mission is democratising LLMs to any business process by turning proprietary knowledge into competitive advantage with bespoke LLMs built for Clients domain and deployed at scale. We build bespoke LLM solutions tailored to the client's business processes, delivering enterprise-grade performance comparable to leading off-the-shelf model. Providing a solution designed for high relevance, low latency, and compliance.

Role Overview

As an AI engineer, you will help deliver experience-led, value-focused solutions for innovative organizations by building bespoke LLMs tailored to client business processes. Your work will focus on turning proprietary knowledge into competitive advantage by deploying custom LLMs at scale, achieving enterprise-grade performance on par with leading off-the-shelf models. These solutions are designed for high relevance, low latency, and strict compliance, ensuring maximum impact for our clients.

We are recruiting for Autumn 2026.

Responsibilities
  • Design, develop, and train large language models and AI systems.
  • Fine-tune pre-trained LLMs (e.g., GPT, LLaMA, Mistral, Falcon) for specific use cases.
  • Build and optimize prompting strategies, Retrieval-Augmented Generation (RAG), and agent-based systems.
  • Prepare, clean, and manage large-scale datasets for model training.
  • Implement model evaluation, benchmarking, and performance optimization.
  • Deploy models into production using scalable and secure architectures.
  • Collaborate with cross-functional teams to translate business needs into AI solutions.
  • Monitor model performance, manage model drift, iterate improvements, and stay current with the latest research and advancements in AI and LLMs.
About the Candidate
  • Bachelor's or Master's degree (2:1 or higher) in Computer Science, AI, Machine Learning, or a related field (or equivalent experience) is essential.
  • Strong experience with Python and ML frameworks such as PyTorch or TensorFlow, and hands-on experience training, fine-tuning, or deploying LLMs.
  • Solid understanding of NLP, transformers, attention mechanisms, embeddings, and experience with data preprocessing, tokenization, and dataset pipelines.
  • Familiarity with REST APIs, microservices, model serving, and MLOps tools (e.g., MLflow, Kubeflow, Airflow, Weights & Biases).
  • Experience with cloud platforms (AWS, GCP, Azure), distributed training, model parallelism, inference optimization, and GPU/TPU infrastructure.
  • Knowledge of vector databases (e.g., FAISS, Pinecone), security, privacy, and responsible AI practices.
  • Strong problem-solving, analytical, and communication skills, with a positive, team-oriented attitude and a passion for continuous learning.
  • Additional advantages include experience with RLHF, open-source contributions, building AI copilots/chatbots, client and stakeholder management, and use of Atlassian tools like Jira and Confluence.
  • Willingness to travel within the UK and EU for client engagements as required.

Reply is an Equal Opportunities Employer and committed to embracing diversity in the workplace. We provide equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type regardless of age, sexual orientation, gender, identity, pregnancy, religion, nationality, ethnic origin, disability, medical history, skin colour, marital status or parental status or any other characteristic protected by the Law.

Reply is committed to making sure that our selection methods are fair to everyone. To help you during the recruitment process, please let us know of any Reasonable Adjustments you may need.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.