Enable job alerts via email!
Boost your interview chances
A leading company in the tech industry is looking for a Principal Machine Learning Operations Developer. This role involves designing and managing software systems that enhance ML capabilities, mentoring team members, and fostering multi-cloud architectures. Successful candidates will have extensive experience in MLOps, Data Engineering, and DevOps, with a strong focus on collaborative and innovative solutions.
Job Requisition ID # 25WD89028
Principal Machine Learning Operations Developer
Job Description
Position Overview
We are seeking an experienced Principal Software Engineer to join our AI/ML Platform (AMP) team. This team develops and maintains core components to accelerate ML/AI model development, including the model development studio, feature store, model serving, and observability tools. The ideal candidate will have a background in MLOps, Data Engineering, and DevOps, with experience in building scalable deployment architectures and observability systems. As a key member of our engineering team, you will help shape the future of our AI/ML capabilities, delivering innovative solutions that add value to our organization. You will report to a manager.
Responsibilities
System Design: Design, implement, and manage software systems for the AI/ML Platform, overseeing the full ML development lifecycle for partner teams.
Mentoring: Share knowledge, best practices, and conduct design reviews to elevate team expertise.
Multi-cloud Architecture: Define components leveraging multiple cloud platforms (e.g., AWS, Azure) to optimize performance, cost, and scalability.
AI/ML Observability: Develop systems for monitoring model performance, data drift, fairness/bias, and anomalies.
ML Solution Deployment: Create tools for building and deploying ML artifacts in production environments, ensuring smooth transition from development to deployment.
Big Data Management: Automate and orchestrate large-scale data transformation and processing tasks, building data stores for ML artifacts.
Scalable Services: Design low-latency, scalable prediction and inference services.
Cross-Functional Collaboration: Work with machine learning researchers, developers, product managers, and operations teams to foster collaboration.
End-to-End Ownership: Take ownership of components, including design, architecture, implementation, rollout, onboarding, support, testing, and investigations.
Minimum Qualifications
Educational Background: Bachelor’s degree in Computer Science or equivalent practical experience.
Experience: Over 8 years in software development and engineering, delivering production systems and services.
Prior experience with MLOps teams, ML model deployment, DevOps, and data engineering.
Hands-on skills in coding with Golang, Python, or Java.
Knowledge of DevOps practices, containerization, orchestration tools such as CI/CD, Terraform, Docker, Kubernetes, GitOps.
Experience with distributed data processing frameworks like Spark, Airflow, and data lake architectures using formats like Iceberg or Parquet.
Experience collaborating with Data Science teams to deploy models and implement ML observability for inference monitoring.
Exposure to building RAG-based applications in collaboration with product teams and AI engineers.
Creative problem-solving skills, with the ability to break down complex problems.
Knowledge of AWS and/or Azure for large-scale application deployment.
Excellent communication and teamwork skills.
Preferred Qualifications
Experience integrating with third-party vendors.
Latency optimization skills for serving systems.
Familiarity with tools like MLflow, Kubeflow, TensorBoard for model monitoring.
Experience with distributed model training/inference pipelines using KubeRay or similar tools.
Experience leveraging GPU computing (CUDA, OpenCL) for AI/ML workloads.
Familiarity with ML libraries such as PyTorch, TensorFlow, XGBoost, Pandas, and Scikit-Learn.
Learn More
About Autodesk
At Autodesk, we create software that transforms how things are made, from buildings to movies. We foster a culture of innovation, diversity, and belonging, where everyone can thrive and contribute to building a better future.
Salary Transparency
Starting base salary in Canada-BC ranges from $141,600 to $194,700, based on experience and location. Compensation includes bonuses, stock grants, and benefits.
Diversity & Belonging
We are committed to an inclusive culture. Learn more: https://www.autodesk.com/company/diversity-and-belonging
Existing Contractors or Consultants
If you are an existing contractor or consultant, please apply through internal channels.