Job Search and Career Advice Platform

Enable job alerts via email!

Infrastructure Architect – GPU Test Automation Farm

Advanced Micro Devices

Markham

On-site

CAD 80,000 - 100,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading semiconductor company is seeking a skilled Systems Deployment Architect in Markham, Ontario, to design and lead a large-scale GPU test automation farm. The ideal candidate will have deep technical expertise in infrastructure design, extensive hands-on experience, and knowledge of datacenter operations. Responsibilities include defining best practices, collaborating across teams, and mentoring engineers. A Bachelor's or Master's degree in a relevant field is required, along with proven skills in automation tools and cluster environments. This role offers impactful work in a collaborative culture.

Benefits

Innovative work environment
Comprehensive benefits package

Qualifications

  • Proven expertise in GPU or HPC cluster environments.
  • Expertise in Windows and Linux administration.
  • Experience with automation tools and scripting.

Responsibilities

  • Architect and design a distributed, large-scale GPU test automation farm.
  • Lead the deployment of infrastructure in datacenter-like environments.
  • Collaborate with cross-functional teams for seamless test workflows.

Skills

Infrastructure design expertise
Large compute farms
Automation systems
Operational discipline
Architectural judgment
Performance tuning

Education

Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering

Tools

Ansible
Terraform
CI/CD pipelines
MaaS platforms
Job description
WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.

Together, we advance your career.

THE ROLE

AMD is looking for a highly skilled and experienced systems deployment architect to design, plan, and lead the deployment of a large‑scale GPU test automation farm in a datacenter‑style environment. This individual will translate AMD’s test and validation vision into a robust, modular, and scalable infrastructure capable of supporting continuous integration and validation for next‑generation products.

THE PERSON

The ideal candidate combines deep technical expertise in infrastructure design with hands‑on experience building large compute farms and automation systems, and has a strong understanding of datacenter operational constraints. Able to demonstrate strong architectural judgment, operational discipline, and a practical understanding of the technologies that enable scalable infrastructure.

KEY RESPONSIBILITIES
  • Architect and design a distributed, large‑scale GPU test automation farm optimized for performance, scalability, and reliability.
  • Lead the deployment and operation of infrastructure in datacenter‑like environments, ensuring compliance with standards for power, cooling, networking, and management systems.
  • Define and enforce best practices for system configuration, monitoring, and fault tolerance to ensure high availability and performance.
  • Collaborate with cross‑functional teams (QA, IT, software, datacenter ops, and engineering) to deliver seamless test workflows and system integration.
  • Evaluate and implement technologies that improve deployment efficiency, system observability, and scalability (containerization, virtualization, orchestration, MaaS, etc.).
  • Mentor engineers in infrastructure design principles and contribute to the overall architectural vision of AMD’s GPU validation environment.
PREFERRED EXPERIENCE
  • Proven expertise in GPU or HPC cluster environments, including system provisioning, scheduling, and performance tuning.
  • Expert background in Windows and Linux administration, including automation tools and scripting.
  • Experience with automation frameworks (Ansible, Terraform, etc.) and CI/CD pipelines for infrastructure deployment.
  • Hands‑on experience with MaaS (Metal‑as‑a‑Service) platforms for large‑scale bare‑metal provisioning.
  • Knowledge of Network Boot (PXE, iPXE, UEFI) configurations and automation.
  • Experience building or integrating inventory health management systems, including real‑time monitoring of servers, network devices, and supporting services.
  • Skilled in space allocation and racking strategies in datacenter or lab environments.
  • Deep understanding of power planning for dense compute environments.
  • Experience with network design and topology optimization for high‑throughput data paths.
ACADEMIC CREDENTIALS
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
LOCATION

Markham, Ontario Canada

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.