Enable job alerts via email!

Principal Data Engineer

Careabout

New York (NY)

Remote

USD 90,000 - 150,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative healthcare organization is seeking a Principal Data Engineer to lead the design and implementation of scalable data infrastructures. This role involves collaborating with various teams to ensure data quality and compliance with HIPAA regulations. The successful candidate will utilize their expertise in Python, SQL, and AWS to create robust data pipelines and mentor junior engineers. This is an exciting opportunity to contribute to transforming healthcare through data-driven solutions in a remote work environment, where your efforts will directly impact patient outcomes and healthcare efficiency.

Benefits

Health Insurance
Dental Insurance
Vision Insurance
401K with Employer Contribution
PTO and Paid Holidays
Short and Long-term Disability Insurance
Life Insurance
Wellness Programs

Qualifications

  • 7+ years of experience in data engineering with large-scale systems.
  • Strong proficiency in Python, SQL, and data orchestration tools.
  • Experience in a HIPAA-compliant environment.

Responsibilities

  • Design and implement scalable data pipelines using Python and SQL.
  • Ensure data quality and reliability across multiple sources.
  • Mentor junior engineers and collaborate with cross-functional teams.

Skills

Python
SQL
Data Pipeline Orchestration
AWS
dbt
Data Integration
HIPAA Compliance
Mentoring

Education

Bachelor's Degree in Computer Science
Master's Degree in Computer Science

Tools

Dagster
Snowflake
AWS S3
AWS EC2
AWS Lambda

Job description

CareAbout Health is a managed services organization (MSO) that provides expert advice, resources, tools, and other support to its portfolio of medical groups and healthcare focused companies. CareAbout Health is helping align incentives to create a world where patients, providers, and payers work together in a seamless, coordinated manner toward common goals: higher quality, lower cost, better outcomes.

Role Title: Principal Data Engineer (Healthcare)

FLSA Category: Exempt

Role Location: Remote

Reporting Relationships:

This position reports to Director of Data Management.

Role Summary and Responsibilities:

As a Principal Data Engineer, you will play a critical role in designing and implementing scalable, secure, and efficient data infrastructure within a HIPAA-compliant context. Reporting to the Director of Data Management, you will serve as a technical leader on the team, mentoring engineers and collaborating closely with analytics, medical economics, product management, and data science teams. Your work will ensure that our data pipelines, models, and platforms support our mission to transform healthcare.

Key Responsibilities / Essential Functions:

Architect and implement robust, scalable data pipelines using Python, Dagster, dbt, and SQL.

Ensure data quality, consistency, and reliability across multiple data sources and domains.

Integration/ETL

Integrating data from multiple sources, including databases, APIs, and files, into a unified system.

Ensuring data is consistent, accurate, and available across systems.

Extract data from various sources, transform it (clean, format, aggregate), and load it into a target data system.

Data Modeling & Transformation

Develop and maintain data models, transformations, and orchestration logic in dbt.

Implement data governance and schema management practices in accordance with healthcare data standards.

Optimize cloud resource usage and data pipeline performance.

Technical Leadership & Mentorship

Provide guidance and best practices to junior and mid-level data engineers, fostering skill development and growth.

Collaborate with cross-functional teams (analytics, product, data science, and medical economics) to translate business requirements into technical solutions.

Compliance & Security

Uphold HIPAA and other relevant healthcare data privacy regulations, ensuring robust data protection and security measures.

Promote secure coding and data handling practices throughout the data engineering lifecycle.

Evaluate current data systems and recommend architectural improvements for long-term scalability, reliability, and performance.

Drive innovation by researching new technologies, frameworks, and methodologies that enhance our data platform.

Non-Essential Functions:

Leverage AWS services (e.g., S3, EC2, Lambda, ECS) and Snowflake to build highly performant data storage and processing solutions.

Optimize cloud resource usage and data pipeline performance.

Other duties, as assigned.

Qualifications:

Education & Experience

Bachelor’s or Master’s degree (preferred) in Computer Science, Engineering, or a related field (or equivalent experience).

7+ years of experience in data engineering, with a proven track record of designing and maintaining large-scale, high-volume data systems.

Technical Skills

Strong proficiency in Python, SQL, and data pipeline orchestration tools (preferably Dagster, or similar such as Airflow).

Hands-on experience with AWS (S3, EC2, Lambda, etc.) and Snowflake for cloud-based data solutions.

In-depth knowledge of dbt for data transformations and modeling.

Experience working with structured and semi-structured healthcare data, plus a deep understanding of data integration best practices.

Experience using source control systems and CI/CD pipelines.

Demonstrated ability to mentor and lead engineering teams while remaining hands-on in technical development.

Excellent communication skills, with the ability to collaborate effectively across cross-functional teams.

Strong problem-solving, organizational, and analytical thinking skills.

Experience working in a HIPAA/healthcare environment or other regulated industries is preferred.

Commitment to maintaining the highest standards of data security and privacy.

Preferred Qualifications:

Familiarity with other AWS services (e.g., EMR, Kinesis) or additional big data technologies.

Previous involvement in machine learning or advanced analytics initiatives.

Proficient with Scrum and Agile methodologies

Experience directly managing and mentoring a team.

Physical Requirements:

Mainly sedentary.

Sitting at the desk most of the day.

Standing or walking less than two hours per day.

Lifting no more than ten pounds on rare occasions.

Must be able to work at a computer and answer phone calls on a regular basis.

Featured Benefits:

Health, dental, and vision insurance.

401K with automatic employer contribution.

PTO and Paid Holidays.

Access to voluntary short and long-term disability insurance.

Access to additional life insurance.

Access to a variety of Wellness programs.

CareAbout Health is committed to providing an environment of mutual respect where equal opportunities are available to all applicants and employees without regard to actual or perceived race, color, creed, religion, national origin, ancestry, citizenship status, age, sex or gender (including pregnancy, childbirth, related medical conditions and lactation), gender identity or gender expression (including transgender status), sexual orientation, marital status, military service and veteran status, disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances (referred to as “protected characteristics”).

We are interested in every qualified candidate who is legally able to work in the United States without sponsorship. We cannot offer any visa sponsorship now at this time.

The compensation range for this position is

Compensation is based on the level and requirements of the role.

Salary within our ranges may also be determined by your education, experience, knowledge, skills, abilities, and location, as required by the role, as well as internal equity and alignment with market data.

About Us

CareAbout Health is a physician-led transformative care solution, creating an ecosystem that utilizes a technology-enabled platform and whole-person care delivery model to support our network of providers and their patients.

Our platform provides access to value-based care arrangements, is equipped with analytics, provider tools and workflows and the required care management support needed to succeed.

We continue the work to identify and solve seemingly impossible problems in healthcare; to align incentives between patients, providers, and payors; and to raise the floor in underserved communities. Ultimately, CareAbout Health envisions a world where healthcare professionals and systems work together in an efficient and effective manner to significantly and sustainably improve patients’ lives.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Data Engineer - Remote US

ZipRecruiter

Columbus

Remote

USD 90,000 - 150,000

2 days ago
Be an early applicant

Lead Data Engineer - GenAI (Hybrid or Remote)

S&P Global

New York

Remote

USD 90,000 - 200,000

28 days ago

Lead Data Engineer

SoLo Funds Inc.

Los Angeles

Remote

USD 80,000 - 120,000

Today
Be an early applicant

Lead Data Engineer

RightClick

Remote

USD 90,000 - 150,000

2 days ago
Be an early applicant

Principal Data Engineer

Bentley iTwin Ventures

Exton

Remote

USD 90,000 - 150,000

5 days ago
Be an early applicant

Principal Data Engineer

Lantern

Remote

USD 120,000 - 180,000

2 days ago
Be an early applicant

Principal Data Engineer

Employer Direct Healthcare

Remote

USD 120,000 - 180,000

6 days ago
Be an early applicant

Lead Data Engineer (Remote)

Inspira Financial

Oak Brook

Remote

USD 125,000 - 150,000

2 days ago
Be an early applicant

Lead Data Engineer - GIBU

Initial Therapeutics, Inc.

Remote

USD 137,000 - 216,000

2 days ago
Be an early applicant