Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An innovative healthcare organization is seeking a Principal Data Engineer to lead the design and implementation of scalable data infrastructures. This role involves collaborating with various teams to ensure data quality and compliance with HIPAA regulations. The successful candidate will utilize their expertise in Python, SQL, and AWS to create robust data pipelines and mentor junior engineers. This is an exciting opportunity to contribute to transforming healthcare through data-driven solutions in a remote work environment, where your efforts will directly impact patient outcomes and healthcare efficiency.
CareAbout Health is a managed services organization (MSO) that provides expert advice, resources, tools, and other support to its portfolio of medical groups and healthcare focused companies. CareAbout Health is helping align incentives to create a world where patients, providers, and payers work together in a seamless, coordinated manner toward common goals: higher quality, lower cost, better outcomes.
Role Title: Principal Data Engineer (Healthcare)
FLSA Category: Exempt
Role Location: Remote
Reporting Relationships:
This position reports to Director of Data Management.
Role Summary and Responsibilities:
As a Principal Data Engineer, you will play a critical role in designing and implementing scalable, secure, and efficient data infrastructure within a HIPAA-compliant context. Reporting to the Director of Data Management, you will serve as a technical leader on the team, mentoring engineers and collaborating closely with analytics, medical economics, product management, and data science teams. Your work will ensure that our data pipelines, models, and platforms support our mission to transform healthcare.
Key Responsibilities / Essential Functions:
Architect and implement robust, scalable data pipelines using Python, Dagster, dbt, and SQL.
Ensure data quality, consistency, and reliability across multiple data sources and domains.
Integration/ETL
Integrating data from multiple sources, including databases, APIs, and files, into a unified system.
Ensuring data is consistent, accurate, and available across systems.
Extract data from various sources, transform it (clean, format, aggregate), and load it into a target data system.
Data Modeling & Transformation
Develop and maintain data models, transformations, and orchestration logic in dbt.
Implement data governance and schema management practices in accordance with healthcare data standards.
Optimize cloud resource usage and data pipeline performance.
Technical Leadership & Mentorship
Provide guidance and best practices to junior and mid-level data engineers, fostering skill development and growth.
Collaborate with cross-functional teams (analytics, product, data science, and medical economics) to translate business requirements into technical solutions.
Compliance & Security
Uphold HIPAA and other relevant healthcare data privacy regulations, ensuring robust data protection and security measures.
Promote secure coding and data handling practices throughout the data engineering lifecycle.
Evaluate current data systems and recommend architectural improvements for long-term scalability, reliability, and performance.
Drive innovation by researching new technologies, frameworks, and methodologies that enhance our data platform.
Non-Essential Functions:
Leverage AWS services (e.g., S3, EC2, Lambda, ECS) and Snowflake to build highly performant data storage and processing solutions.
Optimize cloud resource usage and data pipeline performance.
Other duties, as assigned.
Qualifications:
Education & Experience
Bachelor’s or Master’s degree (preferred) in Computer Science, Engineering, or a related field (or equivalent experience).
7+ years of experience in data engineering, with a proven track record of designing and maintaining large-scale, high-volume data systems.
Technical Skills
Strong proficiency in Python, SQL, and data pipeline orchestration tools (preferably Dagster, or similar such as Airflow).
Hands-on experience with AWS (S3, EC2, Lambda, etc.) and Snowflake for cloud-based data solutions.
In-depth knowledge of dbt for data transformations and modeling.
Experience working with structured and semi-structured healthcare data, plus a deep understanding of data integration best practices.
Experience using source control systems and CI/CD pipelines.
Demonstrated ability to mentor and lead engineering teams while remaining hands-on in technical development.
Excellent communication skills, with the ability to collaborate effectively across cross-functional teams.
Strong problem-solving, organizational, and analytical thinking skills.
Experience working in a HIPAA/healthcare environment or other regulated industries is preferred.
Commitment to maintaining the highest standards of data security and privacy.
Preferred Qualifications:
Familiarity with other AWS services (e.g., EMR, Kinesis) or additional big data technologies.
Previous involvement in machine learning or advanced analytics initiatives.
Proficient with Scrum and Agile methodologies
Experience directly managing and mentoring a team.
Physical Requirements:
Mainly sedentary.
Sitting at the desk most of the day.
Standing or walking less than two hours per day.
Lifting no more than ten pounds on rare occasions.
Must be able to work at a computer and answer phone calls on a regular basis.
Featured Benefits:
Health, dental, and vision insurance.
401K with automatic employer contribution.
PTO and Paid Holidays.
Access to voluntary short and long-term disability insurance.
Access to additional life insurance.
Access to a variety of Wellness programs.
CareAbout Health is committed to providing an environment of mutual respect where equal opportunities are available to all applicants and employees without regard to actual or perceived race, color, creed, religion, national origin, ancestry, citizenship status, age, sex or gender (including pregnancy, childbirth, related medical conditions and lactation), gender identity or gender expression (including transgender status), sexual orientation, marital status, military service and veteran status, disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances (referred to as “protected characteristics”).
We are interested in every qualified candidate who is legally able to work in the United States without sponsorship. We cannot offer any visa sponsorship now at this time.
The compensation range for this position is
Compensation is based on the level and requirements of the role.
Salary within our ranges may also be determined by your education, experience, knowledge, skills, abilities, and location, as required by the role, as well as internal equity and alignment with market data.
CareAbout Health is a physician-led transformative care solution, creating an ecosystem that utilizes a technology-enabled platform and whole-person care delivery model to support our network of providers and their patients.
Our platform provides access to value-based care arrangements, is equipped with analytics, provider tools and workflows and the required care management support needed to succeed.
We continue the work to identify and solve seemingly impossible problems in healthcare; to align incentives between patients, providers, and payors; and to raise the floor in underserved communities. Ultimately, CareAbout Health envisions a world where healthcare professionals and systems work together in an efficient and effective manner to significantly and sustainably improve patients’ lives.