Enable job alerts via email!

Principal, Data Engineering (Remote)

Jazz Pharmaceuticals, Inc.

City Of London

Remote

GBP 80,000 - 100,000

Full time

Yesterday
Be an early applicant

Job summary

A leading biopharmaceutical company in London is seeking a Principal Data Engineer to support and innovate data projects across the Research and Development sectors. The ideal candidate will have extensive experience in data engineering, particularly focusing on healthcare-related data and a strong background in AWS technologies. This role demands a proficient understanding of data modeling, machine learning operations, and excellent problem-solving skills. Competitive salary and benefits are offered.

Qualifications

  • Strong knowledge of data engineering tools such as Python, R, and SQL for data processing.
  • 3-5 years of experience in data engineering with a healthcare focus.

Responsibilities

  • Support data pipeline design and maintenance for various data sources.
  • Optimize ETL/ELT processes for structured and unstructured data.
  • Develop data quality frameworks and validation processes.

Skills

Data engineering tools
AWS services
Python
R
SQL
Data modeling
Communication skills
MLOps
Agile development

Education

Bachelor's Degree in Computer Science, Statistics, Mathematics, Life Sciences
Master's Degree preferred

Tools

Docker
Amazon Redshift
AWS S3
Job description

The Principal will be responsible for supporting complex or leading singular projects related to data engineering requirements and initiatives across Jazz Research and Development. The Principal will support data projects from across the business including Clinical, Pre-Clinical, Non-Clinical, Chemistry, RWD and Omics.

Essential Functions:

  • Support the design, development and maintenance of data pipelines for processing Research and Development data from diverse sources (Clinical Trials, Medical Devices, Pre-Clinical, Omics, Real World Data) utilizing the AWS technology platform.
  • Create and optimize ETL/ELT processes for structured and unstructured data using Python, R, SQL, AWS services and other tools.
  • Build and maintain data repositories using AWS S3 and FSx technologies. Establish data warehousing solutions using Amazon Redshift.
  • Build and maintain standard data models.
  • Develop data quality frameworks, validation processes and KPIs to ensure accuracy and consistency of data pipelines.
  • Implement data versioning and lineage tracking to support data traceability, regulatory compliance and audit requirements.
  • Create and maintain documentation for data processes, architectures, and workflows.
  • Implement modern software development best practices (e.g. Code Versioning, DevOps, CD/CI).
  • Support collaboration with RnD Researchers, Data scientists and Stakeholders to understand data requirements and deliver appropriate solutions in a global working model.
  • Maintain compliance with data privacy regulations such as HIPAA, GDPR
  • May be required to develop, deliver or support data literacy training across R&D.

Requirements:

  • Strong knowledge of data engineering tools such as Python, R and SQL for data processing.
  • Strong proficiency with AWS services particularly S3, Redshift, FSx, Glue, Lambda.
  • Strong proficiency with relational databases.
  • Strong background in data modeling and database design.
  • Familiarity with unstructured database technologies (e.g. NoSQL) and other database types (e.g. Graph).
  • Familiarity with Containerization such as Docker and EKS/Kubernetes.
  • Familiarity with one or more RnD research process and associated regulatory requirements.
  • Exposure to healthcare data standards (CDISC, HL7, FHIR, SNOMED CT, OMOP, DICOM).
  • Exposure to big data technologies and handling.
  • Knowledge of machine learning operations (MLOps) and model deployment.
  • Strong problem-solving and analytical abilities.
  • Excellent communication skills for collaborating with stakeholders.
  • Experience working in an Agile development environment.

Education and Experience:

  • Bachelor's Degree in Computer Science, Statistics, Mathematics, Life Sciences, or other relevant scientific fields; Master's Degree preferred
  • 3-5 years of experience in data engineering, with at least 1.5 years focusing on healthcare, research or clinical related data

About Jazz Pharmaceuticals:

Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and their families. We are dedicated to developing life-changing medicines for people with serious diseases - often with limited or no therapeutic options. We have a diverse portfolio of marketed medicines, including leading therapies for sleep disorders and epilepsy, and a growing portfolio of cancer treatments.

Jazz Pharmaceuticals is an equal opportunity/affirmative action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any characteristic protected by law.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.