Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An innovative company is seeking a Big Data & ML Infrastructure Engineer to help build a cutting-edge AI training platform for healthcare. In this pivotal role, you'll design ETL pipelines and scalable data processing solutions, collaborating with top experts to improve patient outcomes. Join a rapidly expanding firm that values learning and innovation, offering a flexible work environment and comprehensive benefits. Your contributions will directly impact the future of AI in healthcare, making this an exciting opportunity for data engineering professionals passionate about technology and healthcare.
Job Description
Our client is looking for a Big Data & ML Infrastructure Engineer to help them on their mission to build the world’s largest AI training and validation platform for healthcare.
The Opportunity
As a Software Engineer (Big Data & ML Infrastructure), you will be at the core of this company’s mission, shaping the architecture that powers its AI ecosystem. This role is ideal for a data engineering expert passionate about building scalable, efficient, and secure infrastructure to handle complex healthcare data. You will collaborate with data scientists, product managers, and healthcare partners to design data pipelines that accelerate AI development, ensuring safety and impact.
Why This Role?
Cutting-Edge Work: Lead the development of cloud-based ML infrastructure at scale, managing structured and unstructured healthcare data.
High Impact: Your work will contribute directly to AI models that improve patient outcomes and support clinical research.
Elite Team: Work alongside industry-leading experts in AI, healthcare, and technology.
Growth Potential: Join a well-funded, rapidly expanding company committed to learning, innovation, and excellence.
Flexible Location: Based in New York with remote options for suitable candidates.
Key Responsibilities
Design and optimize ETL pipelines to process petabytes of healthcare data.
Create scalable solutions for data processing, storage, and cloud-based ML models.
Maintain compliance with healthcare regulations and uphold top-tier data security.
Collaborate with health system stakeholders to enable seamless data flow.
Document processes clearly to ensure transparency and auditability.
Ideal Candidate Profile
3+ years of Python development across the full software lifecycle.
Extensive experience with OLAP systems (AWS Redshift, BigQuery, Snowflake) and SQL.
Practical expertise with Terraform, Docker, and cloud platforms (AWS, GCP, Azure).
Strong problem-solving skills and the ability to thrive in fast-paced, ambiguous environments.
Experience in healthcare data, NLP, OCR, or AI tools is advantageous.
A collaborative, solutions-oriented mindset.
Compensation & Benefits
Comprehensive benefits package
Equity opportunities
Flexible, mission-driven work environment