Enable job alerts via email!

Senior AI Infrastructure Engineer

Quantum Talent Group

Abu Dhabi

On-site

AED 120,000 - 200,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a skilled senior AI infrastructure engineer to manage the deployment and maintenance of Red Hat OpenShift and AI platforms. The role involves optimizing Kubernetes environments and providing expertise in container orchestration, requiring strong analytical skills and a background in computer science.

Qualifications

  • 5+ years experience provisioning and administering container orchestration platforms.
  • Proven experience as a Red Hat OpenShift administrator or Kubernetes administrator.
  • Excellent communication skills and ability to articulate information.

Responsibilities

  • Deploy, configure, and manage OpenShift Container Platform and Kubernetes infrastructure.
  • Monitor and analyze performance metrics to prevent issues.
  • Provide best practice guidance on configuration for container orchestration.

Skills

Container orchestration
Virtualization
Configuration management
Networking
AI/ML concepts
Analytical skills
Problem-solving
Communication

Education

Bachelor's degree in Computer Science

Tools

Ansible
Terraform
Prometheus
Grafana

Job description

We are looking for a skilled senior AI infrastructure engineer to manage the provisioning, deployment, optimization and maintenance of Red Hat OpenShift Container Platform and Red Hat OpenShift AI platform to support AI/ML workflows. The ideal candidate is expected to deploy and maintain production Kubernetes environments, providing expertise in OpenShift to our clients and partners. Proficiency with virtualization, container orchestration, configuration management, and their capabilities are required, as well as familiarity with networking, databases, operating systems, and AI/ML concepts.

Responsibilities:

Administration of Red Hat OpenShift Solutions

  • Deploy, configure, and manage OpenShift Container Platform, OpenShift AI, and associated Kubernetes infrastructure for a variety of client environments.
  • Engage in all aspects of OpenShift administration, including the management of users and policies, resources, networking configuration, creation and management of applications, and configuration of pod scheduling and cluster scaling.
  • Perform routine upgrades, patching and maintenance to ensure infrastructure is secure and up to date.
  • Implement and maintain automated solutions for provisioning and configuring the OpenShift environment and its associated infrastructure.
  • Monitor and analyze performance metrics, working proactively to prevent issues before they impact operations.
  • Troubleshoot and resolve infrastructure issues, ensuring minimal downtime and high level of performance.
  • Provide best practice guidance on configuration for container orchestration platforms across multiple applications and projects.

Maintain thorough documentation for processes, platform architecture, system configurations, and troubleshooting steps.

Qualifications:

Required Skills & Experience:

  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent work experience)
  • 5+ years experience provisioning and administering container orchestration platforms to support mission critical workloads.
  • Proven experience as a Red Hat OpenShift administrator or Kubernetes administrator in a production environment.
  • Proficiency with Red Hat Enterprise Linux (RHEL), Red Hat Core OS (RHCOS), or similar Red Hat-based Linux distributions.
  • Experience with automation tools for infrastructure provisioning and configuration such as Ansible and Terraform.
  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, or similar.
  • Understanding of networking principles and security best practices in a containerized environment.
  • Excellent communication skills and ability to articulate information to both technical and non-technical stakeholders.
  • Ability to work collaboratively in a cross-functional environment and adapt to evolving requirements and priorities.
  • Strong analytical, problem-solving, and critical thinking skills with a keen attention to detail

Preferred:

  • Certification(s): Red Hat Specialist in OpenShift Administration, Red Hat Certified Specialist in OpenShift AI and/or Certified Kubernetes Administrator (CKA)
  • Familiarity with cloud platforms and integrating OpenShift within hybrid/multi-cloud environments
  • Knowledge of AI/ML workload management on OpenShift AI is a plus, as well as familiarity with GPU management and training and inferencing applications
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.