Get AI-powered advice on this job and more exclusive features.
Ardan Labs is seeking a skilled MLOps Engineer to support the deployment, automation, and scaling of machine learning systems for one of our fast-growing clients. The ideal candidate has a strong engineering foundation, deep DevOps knowledge, and hands-on experience bridging the gap between data science and production-grade ML infrastructure.
Responsibilities- Kubernetes & ML Workload Deployment: Design, deploy, and manage scalable ML workloads, including LLMs, on Kubernetes clusters. Implement best practices for container orchestration, ensuring high availability and fault tolerance.
- CI/CD Pipeline Development: Develop and maintain GitLab CI/CD pipelines for automated testing, integration, and deployment of APIs and ML models. Implement advanced release strategies, such as blue-green deployments and canary releases, to minimize downtime and ensure reliability.
- Infrastructure as Code (IaC): Utilize Terraform to provision and manage cloud infrastructure on Google Cloud Platform (GCP). Ensure infrastructure is modular, reusable, and adheres to best practices for scalability and security.
- Configuration Management: Employ Helm and Kustomize for managing Kubernetes manifests and application configurations. Streamline deployment processes and maintain consistency across environments.
- Continuous Deployment & Monitoring: Integrate Argo CD for continuous deployment, ensuring that applications are always in sync with the desired state. Monitor system performance, implement logging, and set up alerting mechanisms to proactively address issues.
- Collaboration & Documentation: Work closely with data scientists, software engineers, and DevOps teams to understand requirements and deliver solutions. Document processes, architectures, and workflows to facilitate knowledge sharing and onboarding.
Requirements- 5+ years of professional experience in software engineering, DevOps, or data infrastructure
- 3+ years of hands-on experience with MLOps, ML pipeline automation, or production ML systems
- Strong knowledge of Docker and Kubernetes (preferably with real-time ML use cases)
- Experience working with CI/CD tools for machine learning workflows
- Proficiency in Python and/or Go (experience writing CLI tools or ML-related services a big plus)
- Familiarity with model training and orchestration tools such as Airflow, Kubeflow, MLflow, SageMaker, or Vertex AI
- Experience with cloud platforms (AWS/GCP/Azure), especially for model deployment and scaling
- Excellent communication skills and ability to collaborate cross-functionally with data and engineering teams
Nice to Have
- Experience with feature stores, data validation frameworks, or data versioning tools like DVC
- Exposure to streaming data systems (Kafka, Pulsar) for online inference or feature pipelines
- Familiarity with security practices for ML systems and data compliance (GDPR, HIPAA)
- Contributions to open-source MLOps tools or platforms
Why Work With Ardan Labs- Collaborate with high-performing teams on cutting-edge systems and technologies
- Join a team that values clean architecture, automation, and high engineering standards
Seniority level
Employment type
Job function
- Engineering and Information Technology
Industries
Referrals increase your chances of interviewing at Ardan Labs by 2x
Get notified about new Engineer jobs in United States.