Enable job alerts via email!
Boost your interview chances
A leading company is seeking a seasoned Lead MLOps Engineer to operationalize machine learning workloads. You will be instrumental in designing and maintaining cutting-edge infrastructure in AWS for ML applications. Your role will involve automating ML workflows, enhancing model reproducibility, and collaborating closely with data scientists to ensure optimal model performance.
Job Title: Lead MLOps Engineer/Architect
Location: New York, NY (Remote)
Experience Required: 12+ years in MLOps, DevOps, or related fields
Educational Qualifications: Bachelor's or Master's degree in Computer Science, Engineering, or a related field
We're seeking an experienced Lead MLOps Engineer to lead the operationalization of our Machine Learning workloads.
As a key team member, you'll be responsible for designing, building, and maintaining infrastructure required for efficient development, deployment, and monitoring of machine learning workloads. Your close collaboration with data scientists will ensure that our models are reliable, scalable, and performing optimally.
This role requires expertise in automating ML workflows, enhancing model reproducibility, and ensuring continuous integration and delivery.
Key Responsibilities:Architect scalable, cost-efficient, reliable, and secure MLOps solutions
Design, implement, and deploy MLOps solutions in AWS
Select and justify appropriate ML technologies within AWS; identify suitable AWS services for MLOps implementation
Build and maintain infrastructure for efficient development, deployment, and monitoring of ML models
Implement CI/CD pipelines for ML applications for seamless development and deployment
Collaborate with data scientists to address requirements for model serving, versioning, and reproducibility
Monitor and optimize production model performance; proactively troubleshoot issues
Automate repetitive MLOps workflows to enhance efficiency and reduce human error
Maintain technical documentation and train team members on MLOps best practices
Stay current with the latest MLOps tools, technologies, and methodologies
10+ years of experience in MLOps with a proven track record of successful deployments
Deep expertise in MLOps tools and platforms such as Kubernetes, Docker, Jenkins, Git, MLflow, and JupyterHub
Strong working knowledge of AWS and Infrastructure-as-Code tools (AWS CDK, CloudFormation)
Proficiency with DevOps methodologies and CI/CD pipelines (e.g., GitHub Actions)
Strong understanding of the ML lifecycle, pipelines, model training, and monitoring
Solid Python programming skills
Experience with ML frameworks like TensorFlow, PyTorch, and/or scikit-learn
Experience with large language models (LLMs) and their operational challenges is a plus
Excellent communication, collaboration, and problem-solving skills
Ability to explain complex technical topics to non-technical stakeholders
Passion for innovation and streamlining ML/LLM workflows
AWS Certified Machine Learning Specialty
Experience with A/B testing and model performance monitoring