Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An established industry player is seeking a Big Data & ML Infrastructure Engineer to revolutionize healthcare AI. In this pivotal role, you will design and optimize ETL pipelines, ensuring the infrastructure can handle vast amounts of healthcare data. Collaborating with a talented team, your work will directly impact patient outcomes and clinical research. This opportunity offers a flexible work environment and the chance to work with cutting-edge technology in a rapidly growing company. If you are passionate about data engineering and eager to make a difference in healthcare, this role is perfect for you.
Job Description
Our client is looking for a Big Data & ML Infrastructure Engineer to help them build the world’s largest AI training and validation platform for healthcare.
The Opportunity
As a Software Engineer (Big Data & ML Infrastructure), you will be at the core of this company’s mission, shaping the architecture that powers its AI ecosystem. This role is ideal for a data engineering expert passionate about building scalable, efficient, and secure infrastructure capable of handling complex healthcare data. You will collaborate with data scientists, product managers, and healthcare partners to design data pipelines that accelerate, secure, and enhance AI development.
Why This Role?
Cutting-Edge Work: Lead the development of cloud-based ML infrastructure at scale, managing structured and unstructured healthcare data.
High Impact: Your contributions will directly support AI models that improve patient outcomes and advance clinical research.
Elite Team: Work alongside industry-leading experts in AI, healthcare, and technology.
Growth Potential: Join a well-funded, rapidly growing company committed to learning, innovation, and technical excellence.
Flexible Location: Based in New York with remote options for the right candidate.
Key Responsibilities
Design and optimize ETL pipelines to process petabytes of healthcare data.
Create scalable solutions for data processing, storage, and cloud-based machine learning models.
Maintain compliance with healthcare regulations and ensure top-tier data security.
Collaborate with health system stakeholders to enable seamless data transfer.
Document processes clearly to ensure transparency and auditability.
Ideal Candidate Profile
3+ years of Python development covering the entire software lifecycle.
Extensive experience with OLAP systems (AWS Redshift, BigQuery, Snowflake) and SQL.
Hands-on experience with Terraform, Docker, and cloud infrastructure (AWS, GCP, Azure).
Strong problem-solving skills and adaptability to fast-paced, ambiguous environments.
Experience with healthcare data, NLP, OCR, or AI tools is advantageous.
A collaborative, solutions-oriented team player.
Compensation & Benefits
Comprehensive benefits package
Equity opportunities
Flexible, mission-driven work environment