Enable job alerts via email!

Principal Data Engineer

MedStar Health

Brisbane (CA)

Remote

USD 181,000 - 235,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

MedStar Health is seeking a Principal Data Engineer to develop clinical data engineering solutions that improve transplant patient outcomes. The role involves designing scalable data pipelines, utilizing AI, and collaborating with cross-functional teams in healthcare. Ideal candidates will have a PhD and extensive experience in data engineering, alongside strong programming skills.

Benefits

Competitive base salary and incentive compensation
Health and welfare benefits including gym reimbursement
401(k) savings plan match
Employee Stock Purchase Plan
Pre-tax commuter benefits
Paid leave for organ donors

Qualifications

  • 10+ years in data engineering and AI, specializing in healthcare.
  • Expert in ETL pipelines and database management.
  • 5+ years leveraging cloud platforms for data solutions.

Responsibilities

  • Design, optimize, and manage end-to-end ETL pipelines for large-scale datasets.
  • Build and deploy NLP pipelines to enhance precision medicine capabilities.
  • Lead the development of AI-driven tools tailored for healthcare innovation.

Skills

Python
SQL
R
Bash
Git
Natural Language Processing
AI

Education

PhD in Computer Science, Bioinformatics, or related field

Tools

Databricks
Azure
PyTorch
TensorFlow
Scikit-learn

Job description

CareDx, Inc. is a leading precision medicine solutions company focused on the discovery, development, and commercialization of clinically differentiated, high-value healthcare solutions for transplant patients and caregivers. CareDx offers products, testing services, and digital healthcare solutions along the pre- and post-transplant patient journey, and is the leading provider of genomics-based information for transplant patients.

We are seeking a Principal Data Engineer to contribute to developing clinical data engineering solutions. This role will assist in designing and implementing scalable data pipelines and AI-driven tools to support our mission of improving transplant patient outcomes. The ideal candidate has foundational knowledge in data engineering, OMOP, and natural language processing (NLP), with an interest in cloud computing and precision medicine.

Key Responsibilities:

  • Scalable Data Pipelines: Design, optimize, and manage end-to-end ETL pipelines to ingest, transform, normalize, and integrate large-scale EMR datasets from diverse sources, ensuring robust and scalable data architectures.
  • Natural Language Processing (NLP): Build and deploy NLP pipelines to extract and standardize longitudinal clinical features from unstructured data, enhancing CareDx's precision medicine capabilities and enabling actionable insights.
  • AI/ML Innovation: Spearhead developing and implementing cutting-edge AI-driven NLP systems tailored to internal stakeholder needs, accelerating the creation of transformative healthcare AI products.
  • Cloud & Distributed Computing: Utilize cloud platforms (Databricks/Azure) and distributed computing frameworks to deploy and automate AI solutions, optimizing for scalability, cost-efficiency, and high availability in diagnostic and research applications.
  • Genomics & Precision Medicine: Partner with bioinformaticians, data scientists, machine learning experts, and clinical teams to integrate multi-omics data into AI models, driving improved patient transplant outcomes.
  • System Reliability: Uphold data integrity, security, and disaster recovery standards across distributed systems, ensuring operational resilience for CareDx's clinical and research initiatives.
  • Leadership & Mentorship: Provide technical guidance to cross-functional teams, championing best practices in data engineering and AI development to foster a culture of excellence and collaboration.
  • Innovation: Research and implement state-of-the-art techniques to advance CareDx's leadership in transplant innovation, delivering impactful solutions at the forefront of healthcare technology.

Qualifications:

  • Education: PhD in Computer Science, Bioinformatics, Biomedical Informatics, or a related field.
  • Experience: 10+ years in data engineering and AI, specializing in processing large-scale clinical, genomic, or molecular datasets within healthcare or diagnostics
  • Programming: Expert in Python, SQL, R, Bash, and Git with a strong command of modern development workflows
  • Data Engineering: Extensive experience on ETL pipelines, database management, and data modeling.
  • Cloud Platforms: 5+ years leveraging Databricks or Azure for deploying robust, cloud-based solutions
  • Clinical NLP: Experience or coursework in NLP techniques for processing clinical text.
  • OMOP CDM: Expertise with the OMOP Common Data Model and clinical data standardization.
  • AI Development: Demonstrated expertise in building AI models with PyTorch/TensorFlow or Scikit-learn and their application to structured/unstructured data to drive innovative solutions.
  • Real World Data: Experience with real-world evidence studies, experience with EMR QC is a must.

San Francisco Bay Area:

The anticipated base salary range for candidates who will work in Brisbane, California is $181,000 to $235,000. The final salary offered to a successful candidate will be dependent on several factors that may include but are not limited to the type and length of experience within the job, type and length of experience within the industry, education, etc. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. CareDx is a multi-state employer, and this salary range may not reflect positions that work in other states.

REMOTE: US only

The anticipated base salary range in the United States is $162,000 to $210,000. The final salary offered to a successful candidate will be dependent on several factors that may include but are not limited to the type and length of experience within the job, type and length of experience within the industry, education, etc. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. CareDx is a multi-state employer, and this salary range may not reflect positions that work in other states.

Additional Details:

Every individual at CareDx has a direct impact on our collective mission to improve the lives of organ transplant patients worldwide. We believe in taking great care of our people, so they take even greater care of our patients.

Our competitive Total Rewards package includes:

  • Competitive base salary and incentive compensation
  • Health and welfare benefits including a gym reimbursement program
  • 401(k) savings plan match
  • Employee Stock Purchase Plan
  • Pre-tax commuter benefits
  • And more!
  • Please refer to our page to view detailed benefits at https://caredx.com/company/careers/

In addition, we have a Living Donor Employee Recovery Policy that allows up to 30 days of paid leave annually to a full-time employee who makes the selfless act of donating an organ or bone marrow.

With products that are making a difference in the lives of transplant patients today and a promising pipeline for the future, it's an exciting time to be part of the CareDx team. Join us in partnering with transplant patients to transform our future together.

CareDx, Inc. is an Equal Opportunity Employer and participates in the E-Verify program.

By proceeding with our application and submitting your information, you acknowledge that you have read our U.S. Personnel Privacy Notice and consent to receive email communication from CareDx.

******** We do not accept resumes from headhunters, placement agencies, or other suppliers that have not signed a formal agreement with us.

#LI-Remote

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Data Engineer

Jobot

Salt Lake City null

Remote

Remote

USD 150,000 - 190,000

Full time

Today
Be an early applicant

Lead Data Engineer

Jobot

Grand Prairie null

Remote

Remote

USD 150,000 - 190,000

Full time

Today
Be an early applicant

Lead Data Engineer

Trust & Will

California null

Remote

Remote

USD 166,000 - 215,000

Full time

2 days ago
Be an early applicant

Principal Data Engineer

Sky Mavis

null null

Remote

Remote

USD 210,000 - 220,000

Full time

5 days ago
Be an early applicant

Data Engineer- Manager

PwC - Global

null null

Remote

Remote

USD 100,000 - 232,000

Full time

2 days ago
Be an early applicant

[Hiring] Principal Data Engineer @The Nuclear Company

The Nuclear Company

null null

Remote

Remote

USD 198,000 - 228,000

Full time

13 days ago

Principal Data Engineer

b.well Connected Health

Chicago null

Remote

Remote

USD 175,000 - 210,000

Full time

8 days ago

Chief Data Scientist (Remote)

MedStar Health

San Francisco null

Remote

Remote

USD 150,000 - 250,000

Full time

Yesterday
Be an early applicant

Chief Data Scientist (Remote)

TRIMEDX

San Francisco null

Remote

Remote

USD 150,000 - 230,000

Full time

Yesterday
Be an early applicant