Enable job alerts via email!

Senior Solutions Architect, Cloud Infrastructure and DevOps - NVIS

Nvidia Corp

Saudi Arabia

On-site

USD 84,000 - 110,000

Full time

12 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading global corporation is seeking a Senior Cloud Infrastructure/DevOps Solutions Architect for its Infrastructure Specialist Team in Saudi Arabia. The role involves maintaining and optimizing HPC/AI clusters, developing CI/CD pipelines, and working closely with customers to deliver innovative solutions in AI and data analytics. Ideal candidates will have extensive experience in networking, Linux systems, and cloud technologies.

Benefits

Competitive salaries
Extensive benefits
Diverse and inclusive work environment

Qualifications

  • At least 8 years of professional experience in networking and data center architecture.
  • Hands-on experience with Kubernetes for AI/ML workloads.
  • Experience managing HPC clusters and strong Linux knowledge (Redhat/CentOS, Ubuntu).

Responsibilities

  • Maintain large-scale HPC/AI clusters with monitoring and alerting.
  • Develop and maintain CI/CD pipelines.
  • Create tooling for automation and resource management.

Skills

Networking fundamentals
TCP/IP
Linux systems knowledge
Python
Bash scripting
Kubernetes
CI/CD pipelines

Education

BS/MS/PhD in Computer Science or related fields

Tools

Jenkins
Ansible
Puppet
Chef

Job description

NVIDIA is looking for a Senior Cloud Infrastructure/DevOps Solutions Architect to join its NVIDIA Infrastructure Specialist Team. Academic and commercial groups around the world are using NVIDIA products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world! We are seeking someone with excellent interpersonal skills to work on a dynamic, customer-focused team. This role involves interacting with customers, partners, and internal teams to analyze, define, and implement large-scale networking projects. The scope includes networking, system design, automation, and acting as the face to the customer.

What you'll be doing:

  • Maintain large-scale HPC/AI clusters with monitoring, logging, and alerting
  • Manage Linux job/workload schedulers and orchestration tools
  • Develop and maintain CI/CD pipelines
  • Create tooling to automate deployment and management of infrastructure, operational monitoring, alerting, and self-service resource consumption
  • Deploy monitoring solutions for servers, network, and storage
  • Perform troubleshooting from bare metal to application level
  • Develop, redefine, and document standard methodologies for internal teams
  • Support R&D activities and engage in POCs/POVs for future improvements

What we need to see:

  • BS/MS/PhD or equivalent in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields
  • At least 8 years of professional experience in networking fundamentals, TCP/IP, and data center architecture
  • Knowledge of HPC and AI solution technologies, including CPUs, GPUs, high-speed interconnects, and supporting software
  • Hands-on experience with Kubernetes, including container orchestration for AI/ML workloads, resource scheduling, scaling, and HPC integration
  • Experience managing and deploying HPC clusters, including troubleshooting and optimization
  • Strong Linux systems knowledge (Redhat/CentOS, Ubuntu), including internals and protocols
  • Experience with storage solutions like Lustre, GPFS, ZFS, XFS, and familiarity with emerging storage tech
  • Proficiency in Python and bash scripting
  • Experience with automation/configuration tools like Jenkins, Ansible, Puppet, or Chef

Ways to stand out from the crowd:

  • Knowledge of CI/CD pipelines for deployment and automation
  • Experience with Kubernetes, microservice architectures, and GPU hardware/software (DGX, CUDA)
  • Background with RDMA fabrics like InfiniBand or RoCE

NVIDIA leads in AI, HPC, and Visualization. We offer competitive salaries, extensive benefits, and a diverse, inclusive, flexible work environment. We are committed to fostering a supportive workplace for all.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.