Enable job alerts via email!

GenAI Engineer - Retrieval-Augmented Generation (RAG)

all.health

London

On-site

GBP 60,000 - 100,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a talented Machine Learning Engineer to revolutionize healthcare through cutting-edge technology. In this pivotal role, you will design and optimize retrieval-augmented generation (RAG) pipelines, enhancing how patients and clinicians interact with health data. Your expertise in machine learning, natural language processing, and deep learning frameworks will directly contribute to creating a safer, more efficient healthcare platform. If you are passionate about leveraging technology to improve patient care and thrive in a dynamic environment, this opportunity is perfect for you.

Qualifications

  • 3+ years of experience in machine learning/NLP roles focused on LLMs.
  • Strong proficiency in Python and deep learning frameworks.

Responsibilities

  • Design and implement RAG architectures using LLMs.
  • Build and maintain retrieval pipelines over health data.
  • Integrate RAG outputs into user-facing applications.

Skills

Machine Learning
Natural Language Processing (NLP)
Python
Deep Learning (PyTorch, TensorFlow)
GenAI Libraries (LangChain, LlamaIndex, Transformers)
Vector Search
Embedding Models
Prompt Engineering
Problem-Solving Skills

Education

Masters Degree

Tools

PyTorch
TensorFlow
FAISS
Weaviate
Pinecone
BioBERT
ClinicalBERT

Job description

all.health is at the forefront of revolutionizing healthcare for millions of patients worldwide. Combining more than 20 years of proprietary wearable technology with clinically relevant signals, all.health connects patients and physicians like never before with continuous, data-driven dialogue. This unique position of daily directed guidance stands to redefine primary care while helping people live happier, healthier, and longer.

Education

    • Masters Degree
About the Role
    • You will design, build, and optimize RAG pipelines that combine large language models (LLMs) with domain-specific retrieval systems, enabling natural language understanding and reasoning over patient data, clinical guidelines, and health records. Your work will directly impact how patients and clinicians interact with our platform, enabling safe, accurate, and context-aware content surfacing.
Responsibilities
    • Design and implement RAG architectures using open-source and potentially proprietary LLMs (e.g., LLaMA, Mistral, OpenAI, Anthropic).
    • Build and maintain retrieval pipelines over structured and unstructured health data (EHRs, patient notes, device logs, clinical documentation).
    • Develop indexing strategies using vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding models (e.g., BioBERT, ClinicalBERT).
    • Integrate RAG outputs into user-facing applications, ensuring responses are grounded, reliable, and privacy-compliant.
    • Work closely with product, clinical, and data science teams to fine-tune prompts, evaluate responses, and iterate on model performance.
    • Build evaluation pipelines for factuality, relevance, and safety using synthetic and real-world datasets.
    • Contribute to infrastructure for scalable GenAI deployments and model versioning.
    • Stay up to date with the latest research in GenAI and health tech applications of LLMs.
Requirements
    • 3+ years of experience working in machine learning / NLP roles, with recent focus on LLMs and/or GenAI.
    • Strong proficiency in Python, deep learning frameworks (PyTorch or TensorFlow), and GenAI libraries (LangChain, LlamaIndex, Transformers).
    • Hands-on experience with vector search, embedding models, and retrieval pipelines.
    • Familiarity with prompt engineering, prompt tuning, and evaluation of generative model outputs.
    • Experience working with healthcare or sensitive data (HIPAA/GDPR compliance awareness).
    • Strong problem-solving skills and ability to move fast in a startup environment.
    • Bonus: Experience with MLOps, Kubernetes, AWS/GCP, and deploying models in production.
Work Permit
    • UK work permit required
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

GenAI Engineer – Retrieval-Augmented Generation (RAG)

all.health

London

On-site

GBP 50,000 - 90,000

22 days ago