Enable job alerts via email!

Senior Principal, Data Engineering

BioTalent Ltd

United Kingdom

On-site

GBP 60,000 - 90,000

Full time

30 days ago

Job summary

A global biopharmaceutical organization is seeking a skilled Data Engineer to lead initiatives across R&D. You will design and optimize data pipelines using AWS technologies while ensuring compliance with global data regulations. The ideal candidate has expertise in Python, R, SQL, and AWS, along with a relevant degree and extensive experience in healthcare data.

Qualifications

  • 5–7 years’ experience in data engineering, especially in healthcare or clinical data.
  • Expert proficiency in data processing languages like Python, R, and SQL.
  • Knowledge of healthcare data standards such as CDISC, HL7, and FHIR.

Responsibilities

  • Lead the design, development, and maintenance of data pipelines.
  • Create and optimise ETL/ELT processes for structured and unstructured data.
  • Collaborate with researchers and data scientists to provide tailored solutions.

Skills

Python
R
SQL
AWS S3
Redshift
Docker
Kubernetes
Data privacy regulations (GDPR, HIPAA)

Education

Bachelor’s degree in a relevant field
Master’s degree in a relevant field

Tools

AWS Glue
AWS Lambda
Job description

We are partnered with a global biopharmaceutical organisation dedicated to transforming the lives of patients and their families. With a strong commitment to delivering life-changing medicines for serious conditions — often where limited or no therapeutic options exist — this company combines a diverse portfolio of marketed therapies with a growing pipeline in oncology and neuroscience. Headquartered in Europe with operations, laboratories, and manufacturing facilities worldwide, they are driven by a patient-focused and science-led approach.

This role will offer you:

  • A leading position in driving data engineering initiatives across multiple Research & Development areas, including Clinical, Pre-Clinical, Non-Clinical, Omics, Real World Data, and more.
  • The opportunity to design and optimise advanced data pipelines, models, and repositories using cutting-edge AWS technologies.
  • Cross-functional collaboration with international teams of scientists, researchers, and stakeholders.
  • A high-impact role supporting innovation in both neuroscience and oncology.

Responsibilities:

  • Lead the design, development, and maintenance of data pipelines for diverse R&D data sources.
  • Create and optimise ETL/ELT processes for structured and unstructured data using Python, R, SQL, and AWS services.
  • Build and maintain repositories and data warehousing solutions.
  • Develop and maintain data quality frameworks, validation processes, and KPIs.
  • Implement data versioning, lineage tracking, and regulatory compliance measures.
  • Document data processes, architectures, and workflows in line with best DevOps practices.
  • Collaborate with R&D researchers, data scientists, and stakeholders to deliver tailored solutions.
  • Ensure compliance with global data privacy regulations such as GDPR and HIPAA.

You will bring:

  • Expert proficiency in Python, R, and SQL for data processing.
  • Advanced knowledge of AWS services, particularly S3, Redshift, FSx, Glue, and Lambda.
  • Strong skills in relational database design and modelling, with experience in NoSQL and Graph databases.
  • Experience with containerisation (Docker, Kubernetes/EKS).
  • Knowledge of healthcare data standards such as CDISC, HL7, FHIR, SNOMED CT, OMOP, and DICOM.
  • Familiarity with big data technologies, MLOps, and model deployment.
  • Bachelor’s degree in a relevant field (Master’s preferred) and 5–7 years’ experience in data engineering, including work with healthcare, research, or clinical data.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs