¡Activa las notificaciones laborales por email!

Kubernetes DevOpsPlatform Engineer

Mirantis

Madrid

A distancia

EUR 60.000 - 90.000

Jornada completa

Hoy
Sé de los primeros/as/es en solicitar esta vacante

Descripción de la vacante

A leading cloud infrastructure company is seeking a skilled Kubernetes DevOps / Platform Engineer to drive custom integration across their platform. Responsibilities include designing and deploying Kubernetes clusters for AI workloads and providing operational support. Ideal candidates will have advanced Kubernetes expertise and strong automation skills. This full-time position offers remote work with a competitive compensation package.

Servicios

Professional development and training
Competitive compensation package
Company outings and hackathons

Formación

  • Hands-on experience operating production Kubernetes clusters.
  • Experience in troubleshooting complex multi-tenant environments.
  • Strong scripting and automation skills in Bash and Python.

Responsabilidades

  • Provide operational support by diagnosing and resolving system issues.
  • Design and deploy bare metal Kubernetes clusters for GPU workloads.
  • Implement GPU workload onboarding systems for training and inference.

Conocimientos

Kubernetes expertise
Troubleshooting
Cloud infrastructure
Automation scripting (Bash, Python, etc.)
CI/CD automation
GitOps practices
Virtualization technologies
Nvidia GPU infrastructure
InfiniBand networking
Software Defined Networking
Cluster API framework

Herramientas

Kubernetes
GitHub CI
Ansible
Terraform
KVM
Metal3
Descripción del empleo

We are looking for a skilled Kubernetes DevOps / Platform Engineer to drive end-to-end custom integration across our k0rdent-ai platform. You will collaborate with our engineering teams to design and deliver scalable GPU infrastructure orchestration based on Kubernetes stack.

Mirantis k0rdent AI empowers platform architects and MLOps engineers with open composable infrastructure management for AI workloads and scalable inference application hosting at scale.

It allows rapid deployment and execution of models alongside core application components and Mirantis-validated foundation services. Deployment can be performed on any cloud or infrastructure with zero lock-in all built on Kubernetes standards.

It also enables automated observation scaling and management to ensure optimal performance GPU utilization and cost efficiency.

Responsibilities
  • Provide operational support by diagnosing triaging and resolving complex system issues
  • Design and deploy bare metal Kubernetes clusters for GPU / AI workloads in customer datacenters
  • Design and implement datacenter networking with Nvidia Bluefield 3 DPUs
  • Configure and troubleshoot Infiniband fabrics for high-performance GPU interconnects
  • Implement Metal3-based bare metal provisioning pipelines for physical server infrastructure
  • Configure and integrate Kubevirt for VM-based workloads on Kubernetes
  • Deploy and manage k0rdent (Cluster API-based) tooling for Kubernetes cluster lifecycle management for tenant clusters
  • Implement GPU workload onboarding systems for training and inference
  • Build automation using GitHub CI for product integration testing
  • Work directly with product teams to collect and drive the requirements for future features / fixes
Qualifications
  • Advanced Kubernetes expertise - Hands-on experience operating production clusters including Deep understanding of Kubernetes architecture controllers and operators
  • Experience with Cluster API lifecycle management and upgrades
  • Troubleshooting complex multi-tenant environments
  • Custom Resource Definitions (CRDs) and operator patterns
  • Storage networking (NVMe-oF Ceph)
  • GitHub Actions / CI or similar automation platforms
  • Bare metal infrastructure management - Direct experience provisioning and managing physical servers BIOS / firmware management and hardware lifecycle automation
  • Virtualization technologies - Practical experience with KVM LibVirt and VM management on Linux
  • Software Defined Networking (SDN) - Understanding of overlay networks network policies and SDN controllers in Kubernetes and VM environments
  • Golang proficiency - Ability to read debug and contribute to Kubernetes operator code and controllers
  • CI / CD automation - Strong scripting and automation skills (Bash Python Ansible Terraform) and experience building infrastructure-as-code pipelines
  • GitOps practices - Experience with declarative infrastructure management and Git-based workflows
  • InfiniBand networking experience
  • Cluster API framework experience
  • Nvidia GPU infrastructure (NVLink)
  • SmartNIC experience (Nvidia Bluefield or similar)
  • OVN (Open Virtual Network) or other SDN platforms
  • Metal3 or similar baremetal provisioning tools
Benefits
  • Work with an established Silicon Valley leader in the cloud infrastructure industry;
  • Work with exceptionally passionate talented and engaging colleagues helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies;
  • Be a part of cutting-edge open-source innovation;
  • Thrive in the high-energy environment of a young company where openness collaboration risk-taking and continuous growth are valued;
  • Professional development and training;
  • Attend conferences and working groups;
  • Company outings happy hours hackathons and tech talks;
  • Receive a competitive compensation package with a strong benefits plan.
Other Details
  • Remote Work: Yes
  • Employment Type: Full-time
  • Key Skills:
  • Experience: years
  • Vacancy: 1
Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.