Enable job alerts via email!

AI Infrastructure Engineer - C++ & Golang | Artificial Intelligence | Linux | Kubernetes | Dock[...]

Cisco Systems, Inc.

Bengaluru

On-site

INR 25,00,000 - 30,00,000

Full time

Today
Be an early applicant

Job summary

A leading tech company in Bengaluru is seeking an AI Infrastructure Engineer to design and implement next-gen AI products. The role requires 11+ years of engineering experience, strong skills in C/C++, Golang, and Kubernetes. You will optimize AI workloads and enhance system reliability to maintain leadership in AI infrastructure development. This position offers the chance to shape innovations that influence the AI community.

Qualifications

  • Proficiency in programming languages such as C/C++, Golang, Python, or eBPF.
  • Strong understanding of Linux and Kubernetes technologies.
  • 11+ years of relevant Engineering experience.

Responsibilities

  • Design and develop infrastructure components for high-performance AI workloads.
  • Benchmark and optimize AI infrastructure performance.
  • Manage installation and deployment of AI infrastructure on Kubernetes.
  • Collaborate across teams to shape AI infrastructure development.

Skills

C/C++
Golang
Python
Linux
Kubernetes

Education

Bachelor’s degree
Job description
AI Infrastructure Engineer - C++ & Golang | Artificial Intelligence | Linux | Kubernetes | Docker | 11-18 Yrs

Meet the TeamWe are an innovation team on a mission to transform how enterprises harness AI. Operating with the agility of a startup and the focus of an incubator, we’re building a tight-knit group of AI and infrastructure experts driven by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions that redefine what's possible — faster, leaner, and smarter.

We thrive in a fast-paced, experimentation-rich environment where new technologies aren’t just welcome — they’re expected. Here, you'll work side-by-side with seasoned engineers, architects, and thinkers to craft the kind of iconic products that can reshape industries and unlock entirely new models of operation for the enterprise.

If you're energized by the challenge of solving hard problems, love working at the edge of what's possible, and want to help shape the future of AI infrastructure — we'd love to meet you.

IMPACT

Cisco is seeking a forward-thinkingAI Infrastructure Engineerto help design and implement the next-generation AI products. This role will focus on delivering high-performance, efficient, and reliable solutions that power AI workloads across Cisco's ecosystem.

As an AI Infrastructure Engineer at Cisco, you will play a pivotal role in shaping the AI systems that enable cutting-edge innovations. Your work will directly impact:

  • The performance and efficiency of AI workloads on the node.
  • The reliability and availability of AI systems for Cisco’s customers.
  • Advancements in AI and machine learning infrastructure, enabling better utilization and improving efficiency for applications across industries.
  • Collaboration across internal teams to bring system level innovation across different cisco products.

Your contributions will help Cisco maintain its leadership in AI infrastructure development and influence the broader AI and machine learning community.

Key Responsibilities

  • Design and develop node-level infrastructure components to support high-performance AI workloads.
  • Benchmark, analyze, and optimize the performance of AI infrastructure, including CUDA kernels and memory management for GPUs.
  • Minimize downtime through seamless config and upgrade architecture for software components.
  • Manage the installation and deployment of AI infrastructure on Kubernetes clusters, including the use of CRDs and operators.
  • Develop and deploy efficient telemetry collection systems for nodes and hardware components without impacting workload performance.
  • Work with distributed system fundamentals to ensure scalability, resilience, and reliability.
  • Collaborate across teams and time zones to shape the overall direction of AI infrastructure development and achieve shared goals.

Minimum Qualifications:

  • Proficiency in programming languages such as C/C++, Golang, Python, or eBPF.
  • Strong understanding of Linux operating systems, including user space and kernel-level components.
  • Experience with Linux user space development, including packaging, logging, telemetry and lifecycle management of processes.
  • Strong understanding of Kubernetes (K8s) and related technologies, such as custom resource definitions (CRDs).
  • Strong debugging and problem-solving skills for complex system-level issues.
  • Bachelor’s degree+ and relevant 11+ years of Engineering work experience.

Preferred Qualifications:

  • Linux kernel and device driver hands-on expertise is a plus.
  • Experience in GPU programming and optimization, including CUDA, UCX is a plus.
  • Experience with high-speed data transfer technologies such as RDMA.
  • Use of Nvidia GPU operators and Nvidia container toolkit and Nsight, CUPTI.
  • Nvidia MIG and MPS concepts for managing GPU consumption.

Cisco is an equal opportunities employer and welcomes applications from all qualified candidates. We are committed to providing a work environment that is free from discrimination and harassment.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.