Enable job alerts via email!

Hardware Systems Engineer, AI NPI

Meta

Menlo Park (CA)

On-site

USD 163,000 - 225,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a dynamic team at an innovative company where your engineering expertise will drive the validation and deployment of cutting-edge AI and HPC hardware systems. In this role, you will engage in hands-on participation, exploring new use cases and troubleshooting complex failures. With a focus on automation and process improvement, you will work closely with cross-functional teams to enhance test methodologies and ensure successful product introductions. This is a fantastic opportunity to contribute to transformative technologies that shape the future of social connection and immersive experiences. If you're passionate about hardware systems and eager to make an impact, this role is for you!

Benefits

Bonus
Equity
Comprehensive benefits package

Qualifications

  • 12+ years in software, firmware, or hardware engineering.
  • Experience in ASIC development, board level debug, and system validation.

Responsibilities

  • Drive end-to-end system validation strategy for AI/HPC hardware systems.
  • Lead validation and deployment of hardware systems in large scale.

Skills

Python
C/C++
Rust
Linux
Debugging
System Validation
Test Specification Development
High-Performance Computing (HPC)

Education

Bachelor's degree in Computer Science or related field

Tools

JTAG
GDB
Trace32
Oscilloscopes
Protocol Analyzers
Traffic Generators

Job description

Hardware Systems Engineers in RTP work closely with Hardware/Software co-design teams, hardware designers, networking teams, system manufacturers, component vendors, capacity engineering, production engineering, production services, and data center operations teams to enable new systems that will be deployed in our production data centers. Ramping to production and solving the datacenter scaling and deployment challenges requires us to take a systems based approach to the new product introduction (NPI) phase.

Hardware Systems Engineer, AI NPI Responsibilities
  1. Drive and execute end-to-end system validation strategy (hardware and software), with a focus on various AI/HPC hardware systems in datacenter applications.
  2. Lead the bring-up, validation, and deployment of cutting-edge hardware systems in large scale deployment with active hands-on participation.
  3. Explore new use cases with customer teams and identify related test methodologies/test cases accordingly.
  4. Investigate and troubleshoot complex failures potentially related to Hardware systems with cross-function teams, which may involve different stacks like silicon, firmware, software, etc.
  5. Triage failures and continue root causing while driving project development work forward.
  6. Identify gaps and opportunities to improve test process and test methodologies across the NPI space.
  7. Guide automation efforts and data analysis for NPI projects through engagement with related cross-function teams.
  8. Communicate project progress and assessments to related internal and external teams.
Minimum Qualifications
  1. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  2. 12+ years of experience in hands-on SW, FW or HW engineering to build any of the following products (AI Silicon, GPUs, TPUs, Autonomous cars, AI servers).
  3. 7+ years of work experience in one or more domains such as: ASIC development (Silicon design, bringup, characterization, validation), board level debug, firmware validation, system validation.
  4. 3+ years of experience with leading Silicon or System troubleshooting and debugging.
  5. 3+ years of experience in developing test specifications, procedures, and debug guides for test solutions.
  6. 5+ years of experience with one or more of the following modules/domains: PCIe, NVlink, Networking, Flash, Memory, CPU, GPU, TPU, DRAM (DDR4/5 or HBM), AI silicon/AI accelerators.
  7. 3+ years of experience in Linux environment.
  8. 3+ years of experience in Python, C/C++, Rust and/or similar languages (data structures, algorithms, and OOP).
Preferred Qualifications
  1. Proficiency in High-Performance Computing (HPC) or AI system architecture at rack level and at scale.
  2. 10+ years of hands-on experience in software, firmware, and hardware engineering to develop systems/products for datacenter applications such as video processing, AI/ML, and networking.
  3. 7+ years of experience in ASIC development/validation, including silicon bring-up, emulation, characterization or system-level testing.
  4. 7+ years of Experience in GPU/TPU related system bring-up, testing and debugging.
  5. Proven history to optimize software algorithms for performance and scalability.
  6. Proven history in embedded systems architecture, components, and test development with a focus on automation.
  7. Familiarity with lab debugging tools such as oscilloscopes, protocol analyzers, and traffic generators.
  8. 7+ years of experience with debugging tools for SoCs (e.g., JTAG, GDB, Trace32) and knowledge of common bus protocols such as I2C, SPI, USB, and PCIe.
  9. Proficiency in Linux environment and server system management.
  10. Demonstrated history to explore and author new test plans based on new test methodologies.
  11. 7+ years of experience integrating lab tools for automated workflows and managing large-scale deployments.
  12. 7+ years of experience in using continuous integration and version control tools for system development and testing.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Hardware Systems Engineer, AI Systems Menlo Park, CA • Infrastructure • Hardware Menlo Park, CA[...]

Meta

Menlo Park

On-site

USD 132,000 - 191,000

18 days ago

Senior Applied AI Engineer

Horizon3.ai

San Francisco

Remote

USD 150,000 - 220,000

Yesterday
Be an early applicant

Software Engineer, AI Infrastructure

Figma

San Francisco

Remote

USD 149,000 - 350,000

4 days ago
Be an early applicant

AI engineer

writer.com

San Francisco

Remote

USD 120,000 - 180,000

6 days ago
Be an early applicant

Hardware Systems Engineer, AI Systems

The Rundown AI, Inc.

Menlo Park

On-site

USD 132,000 - 191,000

18 days ago

Applied AI Software Engineer

Canvas Medical

San Francisco

Remote

USD 120,000 - 180,000

6 days ago
Be an early applicant

Lead AI ML Engineer - Remote

Davita Inc.

San Francisco

Remote

USD 106,000 - 195,000

6 days ago
Be an early applicant

Sr Computer Vision Engineer - Remote - Video Processing, Algos

CyberCoders

San Francisco

Remote

USD 120,000 - 200,000

3 days ago
Be an early applicant

Advanced Supplier Quality Engineer

Lensa

San Francisco

Remote

USD 108,000 - 181,000

3 days ago
Be an early applicant