Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An innovative company is seeking a Big Data & ML Infrastructure Engineer to shape the architecture of a cutting-edge AI training platform for healthcare. This role offers the opportunity to work on scalable, efficient infrastructure that directly impacts patient outcomes and clinical research. Collaborate with industry experts while enjoying a flexible work environment. If you're passionate about leveraging technology to improve healthcare, this position is perfect for you. Join a rapidly expanding team and contribute to groundbreaking projects in AI and healthcare.
Job Description
Our client is looking for a Big Data & ML Infrastructure Engineer to help them build the world’s largest AI training and validation platform for healthcare.
The Opportunity
As a Software Engineer (Big Data & ML Infrastructure), you will be at the core of this company’s mission, shaping the architecture that powers its AI ecosystem. This role is ideal for a data engineering expert passionate about building scalable, efficient, and secure infrastructure capable of handling complex healthcare data. You will collaborate with data scientists, product managers, and healthcare partners to design data pipelines that accelerate AI development, ensuring safety and impact.
Why This Role?
Cutting-Edge Work: Lead the development of cloud-based ML infrastructure at scale, managing structured and unstructured healthcare data.
High Impact: Contribute directly to AI models that improve patient outcomes and support clinical research.
Elite Team: Work with industry-leading experts in AI, healthcare, and technology.
Growth Potential: Join a well-funded, rapidly expanding company with a culture of innovation and technical excellence.
Flexible Location: Based in New York with remote options for the right candidate.
Key Responsibilities
Design and optimize ETL pipelines to process petabytes of healthcare data.
Create scalable solutions for data processing, storage, and cloud-based machine learning models.
Maintain compliance with healthcare regulations and ensure top-tier data security.
Collaborate with health system stakeholders for seamless data integration.
Document processes clearly to ensure transparency and auditability.
Ideal Candidate Profile
3+ years of Python development across the full software lifecycle.
Extensive experience with OLAP systems (AWS Redshift, BigQuery, Snowflake) and SQL.
Hands-on experience with Terraform, Docker, and cloud infrastructure (AWS, GCP, Azure).
Strong problem-solving skills and adaptability in fast-paced, ambiguous environments.
Experience with healthcare data, NLP, OCR, or AI tools is advantageous.
A collaborative, solutions-oriented approach is essential.
Compensation & Benefits
Comprehensive benefits package
Equity opportunities
Flexible, mission-driven work environment