Enable job alerts via email!

Engineer - Machine Learning Ops

iHorizons

United States

Remote

USD 100,000 - 140,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading provider of business solutions, iHorizons is seeking an ML Ops Engineer to design and implement scalable machine learning pipelines. This role involves managing production environments, collaborating with data scientists, and ensuring the reliability of ML operations. Ideal candidates have strong programming skills, proficiency in CI/CD practices, and a solid background in cloud platforms.

Benefits

Growth and learning opportunities
Diverse work culture

Qualifications

  • 4 years of proven experience as an ML Ops Engineer or similar role.
  • Experience with Azure and containerization technologies.
  • Familiarity with machine learning frameworks (TensorFlow, PyTorch) is valuable.

Responsibilities

  • Design, build, and maintain ML pipelines to ensure efficient deployment.
  • Develop and manage APIs to support ML models and services.
  • Implement monitoring and logging solutions for ML model performance.

Skills

Python
CI/CD
Problem-solving
Communication
Collaboration

Education

Bachelor's or master's degree in Computer Science, Engineering, or Data Science

Tools

Docker
Kubernetes
API management tools (Kong)
Azure
AWS
CI/CD tools

Job description

iHorizons is a leading provider of business solutions and technology services in the Arab World. We work with prominent clients lin Media, Telecom, Government & Sports amongst others, enabling large enterprises in their digital service migrations. The ultimate outcomes are radically improved customer experiences and increased operational efficiencies.

Our Team

At iHorizons, we focus on hiring and retaining the best resources available in the region. We challenge our staff to design and implement innovative solutions, and we stimulate their creative thinking by empowering them to help our clients solve their most difficult problems.

Our Culture

With a team spread across 3 different continents and 9 nationalites, iHorizons embdies the spirit of a diverse and welcoming culture of grooming, growth and learning. We aim to foster an environment where people thrive on their skillset all the while being a value addition to our culture through their unique experiences, strengthening our core values.

You will be responsible for designing, building, and maintaining scalable machine learning pipelines, deploying models to production environments, and ensuring the reliability and scalability of ML operations. The role involves managing infrastructure, implementing CI/CD pipelines, containerization, API management, monitoring, security, collaboration with data scientists, and performance optimization.

Reporting Structure

· This job reports to the Manager – AI.

Job Objectives

· Design, build, and maintain scalable ML pipelines and deploy models to test and production environments.

· Set up and manage cloud and on-premises infrastructure to support ML operations.

· Develop and maintain CI/CD pipelines for ML models and automate build, test, and deployment processes.

· Utilize Docker and Kubernetes for deploying ML models and manage containers for smooth operation and scalability.

· Develop and manage APIs to support ML models, monitor and secure API calls, and ensure seamless integration with external applications.

Job Responsibilities

Pipeline & APIs Deployment and Management

· Design, build, and maintain scalable machine learning pipelines to ensure efficient data processing and model deployment.

· Develop and manage APIs to support machine learning models and services.

· Ensure seamless integration between machine learning models and external applications.

· Utilize API management tools to monitor and secure API calls, enforcing access control and data protection measures.

· Deploy machine learning models to various environments, including testing and production, ensuring seamless integration and functionality.

· Ensure the reliability, availability, and scalability of ML pipelines by implementing robust monitoring and alerting systems.

· Provision pipeline operations effectively, managing resources such as compute, storage, and networking to optimize performance and cost-efficiency.

(CI/CD) Implementation & Containerization

· Develop and maintain CI/CD pipelines tailored for ML models and applications.

· Automate the build, test, and deployment processes.

· Utilize containerization technologies such as Docker and Kubernetes for deploying ML models, ensuring consistency and portability across environments.

· Manage and orchestrate containers effectively to optimize resource utilization and maintain scalability.

Performance Monitoring and Optimization

· Implement comprehensive monitoring and logging solutions to track the performance of ML models and pipelines, enabling proactive issue detection and resolution.

· Set up robust alerting systems to detect and respond to issues and anomalies promptly, minimizing downtime and performance degradation.

· Ensure compliance with security standards and regulations, implementing measures to protect data privacy and model security.

· Continuously monitor and optimize the performance of ML models and infrastructure, identifying and resolving bottlenecks to improve system efficiency.

· Respond to and resolve incidents related to ML operations promptly.

Scalability and Resource Optimization

· Set up and manage both cloud and on-premises infrastructure to support ML operations.

· Optimize models and infrastructure for performance and scalability in production environments, ensuring efficient and reliable operations.

· Manage resource allocation to ensure cost-effective operations.

· Develop scripts and automation tools to streamline ML operations, automating repetitive tasks to improve operational efficiency.

Disaster Recovery and Incident Report

· Implement backup and disaster recovery plans for ML models and data.

· Ensure data and model availability in case of failures.

· Conduct root cause analysis and implement preventive measures to mitigate future occurrences.

Collaboration and Best Practices

· Collaborate closely with data scientists and engineers throughout the ML lifecycle, from model development, and testing to deployment and maintenance.

· Collaborate with data scientists and AI researchers to develop and test machine learning models.

· Provide support and guidance on best practices for ML operations, facilitating effective teamwork and knowledge sharing.

· Implement best practices for model versioning, testing, and validation.

Job Requirements

Educational Qualification

· Bachelor’s or master’s degree in computer science, Engineering, Data Science, or a related field.

Previous Work Experience

· 4 years of proven experience as an ML Ops Engineer or similar role in a production environment.

· Experience with Azure cloud platform. AWS experience is a plus.

· Experience with containerization technologies (Docker, Kubernetes).

· Experience with API management tools (Kong)

Skills and Abilities

· Strong programming skills in Python

· Proficiency in CI/CD tools

· Familiarity with machine learning frameworks (TensorFlow, PyTorch).

· Strong understanding of DevOps practices and principles.

· Excellent problem-solving skills and attention to detail.

· Strong communication and collaboration skills.

We've received your resume. Click here to update it.

Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) orPaste resume

How many years of professional experience do you have (excluding internships, freelance and part time work)*

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Machine Learning Engineer (100% Virtual/Remote)

US Foods

null null

Remote

Remote

USD 125,000 - 263,000

Full time

4 days ago
Be an early applicant

Machine Learning Engineers

GlobalPros

null null

Remote

Remote

USD 100,000 - 720,000

Full time

2 days ago
Be an early applicant

Data Scientist , Robotics Operations Data Solutions

Lensa

Seattle null

Remote

Remote

USD 97,000 - 185,000

Full time

Today
Be an early applicant

Machine Learning Engineer

Ascendion

null null

Remote

Remote

USD 130,000 - 140,000

Full time

10 days ago

Data Scientist

GlobalPros

null null

Remote

Remote

USD 60,000 - 720,000

Full time

10 days ago

Technical Lead (Applied Data Scientist)

10a Labs

null null

Remote

Remote

USD 120,000 - 180,000

Full time

10 days ago

Machine Learning Engineer

Fragrance.com

null null

Remote

Remote

USD 80,000 - 120,000

Full time

Today
Be an early applicant

PRINCIPAL DATA SCIENTIST - GENERATIVE AI, MACHINE LEARNING, PYTHON, R - REMOTE

Lensa

Milwaukee null

Remote

Remote

USD 130,000 - 170,000

Full time

5 days ago
Be an early applicant

Data Scientist

Davita Inc.

Georgetown null

Remote

Remote

USD 96,000 - 162,000

Full time

5 days ago
Be an early applicant