Enable job alerts via email!

Senior AI Infrastructure Engineer

T-Mobile

Frisco (TX)

On-site

USD 113,000 - 205,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a talented engineer to drive the future of on-premises infrastructure. This role involves designing and maintaining high-performance computing environments optimized for AI and machine learning workloads. You will work with cutting-edge technologies, including NVIDIA DGX servers and container orchestration tools, to enhance performance and reliability. Join a forward-thinking team dedicated to empowering customers through innovative technology solutions, and enjoy a competitive compensation package that includes stock options and comprehensive benefits. If you have a passion for technology and a desire to make a difference, this is the opportunity for you.

Benefits

Annual stock grant
Employee stock purchase plan
401(k)
Free money coaches
Medical benefits
Dental benefits
Vision benefits
Paid time off
Parental leave
Tuition assistance

Qualifications

  • 5+ years of technical engineering experience in multiple technology areas.
  • Expert understanding of AI/ML infrastructure components and GPU-based systems.
  • Hands-on experience with container orchestration and GPU workload management.

Responsibilities

  • Design, deploy, and maintain high-performance computing environments optimized for AI workloads.
  • Build scalable infrastructure and provide self-service tooling.
  • Collaborate with teams to support AI-driven applications.

Skills

AI/ML infrastructure components
GPU-based systems
Linux/UNIX
Bash scripting
Python scripting
Ansible
Terraform
Kubernetes
Docker
Networking solutions

Education

Bachelor's degree in computer science
Experience in lieu of degree

Tools

NVIDIA DGX servers
Prometheus
Grafana
Git
Jenkins
PyTorch
TensorFlow

Job description

At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That’s how we’re UNSTOPPABLE for our employees!

Job Overview

Do you have a desire to help drive the future direction of T-Mobile’s On-Premises Infrastructure? Is there a passion within you for helping IT customers and developers achieve their technology goals? If so, this could be the position for you! T-Mobile’s Platform Delivery & Automation team is looking for the ideal candidate to join our team of passionate change agents, helping to ensure our developers are data-informed and AI enabled through innovative technology infrastructure solutions. If developing private cloud solutions, hands-on engineering, and customer focus is your thing, then apply!

Our Mission

Our mission is to accelerate the delivery of on-premises and private cloud solutions that empower our customers to self-serve on scalable, secure, cost-effective infrastructure faster and on-demand. We support and deliver complex virtualized and bare-metal workloads. We challenge the status quo and everything we do is focused on value-add to our customer base.

Role Responsibilities
  1. Design, deploy, and maintain high-performance computing environments optimized for AI and machine learning workloads.
  2. Build scalable infrastructure, ensure efficient workload management, and provide self-service and on-demand tooling.
  3. Collaborate with teams to support AI-driven applications and drive operational excellence.
  4. Work with diverse hardware and software solutions to enhance performance and reliability of on-premises AI/ML infrastructure.
Minimum Requirements
  • 5+ years of technical engineering experience in multiple technology areas.
  • Expert understanding of AI/ML infrastructure components, GPU-based systems, preferably in high-availability, large-scale environments.
  • Hands-on experience with NVIDIA DGX servers, BasePOD architectures, and advanced GPU technologies.
  • Proficiency in Linux/UNIX, scripting/automation tools (Bash, Python, Ansible, Terraform).
  • Understanding of AI infrastructure security best practices.
  • Experience with container orchestration (Kubernetes, Docker) and GPU workload management tools.
  • Strong knowledge of networking (InfiniBand/Ethernet) and storage solutions in AI/ML contexts.
Nice to Have
  • Understanding of CI/CD pipelines (Git, Artifactory, Jenkins).
  • Experience with AI/ML pipelines (PyTorch, TensorFlow, RAPIDS AI).
  • Experience with monitoring tools (Prometheus, Grafana, NVIDIA DGCM).
Education
  • Bachelor’s degree in computer science, information systems, engineering, or related field.
  • Experience in lieu of degree may be considered.
Additional Requirements
  • At least 18 years of age.
  • Legally authorized to work in the United States.
  • Travel required: Yes.
  • DOT Regulated Position: No.
  • Safety Sensitive Position: No.
Compensation & Benefits

Base pay range: $113,600 - $205,000. Corporate bonus target: 15%. The actual pay will depend on location, qualifications, and experience. Employees are eligible for bonuses, benefits include medical, dental, vision, 401(k), stock plans, paid time off, parental leave, family benefits, tuition assistance, disability, insurance options, discounts, and more. For details, visit www.t-mobilebenefits.com.

Equal Opportunity & Accommodation

T-Mobile is an Equal Opportunity Employer. Discrimination or harassment based on protected characteristics is not tolerated. For accommodation requests, contact ApplicantAccommodation@t-mobile.com or call 1-844-873-9500.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA

Remote

USD 144,000 - 271,000

11 days ago

Senior Water Infrastructure Engineer

Ramboll US Corporation

Virginia

Remote

USD 90,000 - 150,000

4 days ago
Be an early applicant

Senior Infrastructure Engineer

Tyler Technologies

Town of Texas

Remote

USD 85,000 - 150,000

5 days ago
Be an early applicant

Senior AI Infrastructure Engineer

CRM Hike

San Francisco

Remote

USD 160,000 - 230,000

30+ days ago

Senior Microsoft Cloud Infrastructure Engineer

Kraft & Kennedy, Inc.

Town of Texas

Remote

USD 150,000 - 200,000

Yesterday
Be an early applicant

Senior Security Compliance & Infrastructure Engineer

Tekgence Inc

Remote

USD 80,000 - 130,000

Yesterday
Be an early applicant

Senior Microsoft Cloud Infrastructure Engineer

Kraft & Kennedy, Inc.

Pennsylvania

Remote

USD 150,000 - 200,000

Yesterday
Be an early applicant

Senior Microsoft Cloud Infrastructure Engineer

Kraft & Kennedy, Inc.

Orlando

Remote

USD 150,000 - 200,000

2 days ago
Be an early applicant

Senior/Staff Engineer, Infrastructure (DevOps)

Pryon

Washington

Remote

USD 180,000 - 215,000

Yesterday
Be an early applicant