Enable job alerts via email!

Lead GPU Benchmarking Engineer

Silicon Data

United States

Remote

USD 100,000 - 140,000

Full time

Today
Be an early applicant

Job summary

A cutting-edge technology firm is seeking a highly skilled lead GPU benchmarking Engineer to design and execute rigorous testing protocols. The ideal candidate will have extensive experience with GPU hardware, benchmarking tools, and cloud computing, along with strong programming skills. This role offers competitive salary and benefits, along with career growth opportunities in an innovative and collaborative environment.

Benefits

Competitive salary and benefits package
Opportunity to work with cutting-edge technology
Career growth and development opportunities

Qualifications

  • Proven experience in compute benchmarking, stress testing, and performance analysis.
  • Experience with using driver APIs to obtain raw data.
  • Familiarity with statistical analysis tools.

Responsibilities

  • Develop and implement test plans to evaluate GPUs.
  • Monitor key performance metrics during testing.
  • Mentor and lead a team of engineers.

Skills

GPU performance analysis
Benchmarking tools proficiency
Leadership and mentorship
Programming skills (Python, C/C++)
Analytical problem-solving
Cloud computing familiarity

Education

Bachelor's degree in Computer Science or Electrical Engineering

Tools

3DMark
CUDA
OpenCL benchmarks
FurMark
MSI Kombustor
SPECviewperf
Unigine Heaven
Superposition Benchmark
Job description

Location: Remote / NYC / Bay Area / Chicago

About Us: Silicon Data is on a mission to revolutionize compute markets through data transparency by developing cutting-edge data products that provide essential intelligence and efficiency.

Position Summary: We are seeking a highly skilled and motivated lead GPU benchmarking Engineer to join our team. The ideal candidate will have extensive hands-on experience with GPU hardware, benchmarking tools, performance analysis, programming, and automation. This role involves designing and executing rigorous testing protocols to assess the reliability of GPUs, as well as leading the development and implementation of comprehensive GPU benchmarking frameworks. The candidate should also have the potential to lead and operate at a larger scope, with an eye towards leadership roles such as Chief Technology Officer (CTO).

Responsibilities
  • Test Design and Execution: Develop and implement comprehensive test plans to evaluate GPUs under prolonged heavy workloads using stress testing software.
  • Monitor key metrics such as frame rates, temperature, peak and average power consumption, Peak Flops, Sustained Flops, cross-node bandwidth, and stability over time.
  • Benchmark GPUs using industry-standard benchmarking tools to measure and analyze performance.
  • Provide leadership and mentorship to a team of engineers, fostering a culture of innovation and technical excellence.
  • Data Collection and Analysis: Conduct baseline tests on new GPUs to establish initial performance benchmarks.
  • Track performance metrics over time to detect and analyze any degradation.
  • Utilize GPU driver APIs to collect low-level telemetry during various operational conditions.
  • Performance Comparison and Validation: Compare performance metrics across different cluster configurations to identify comparative strengths and weaknesses.
  • Perform statistical analyses to ensure the validity and reliability of the test results.
  • Repeat tests to ensure consistency and accuracy of data.
  • Reporting and Documentation: Prepare detailed reports outlining test setups, methodologies, and data-driven conclusions.
  • Clearly communicate findings, insights, and recommendations to team members and stakeholders.
  • Cloud Computing Integration: Configure, deploy, and maintain cloud infrastructure for automation, orchestration, and integration.
  • Utilize cloud computing resources to create scalable and efficient testing environments.
  • Optimize cloud platform usage for benchmarking and data analysis tasks.
Required Qualifications
  • Bachelor's degree in Computer Science, Electrical Engineering, or a related field.
  • Proven experience in compute benchmarking, stress testing, and performance analysis.
  • Proficiency with benchmarking tools such as 3DMark, CUDA, OpenCL benchmarks, FurMark, MSI Kombustor, SPECviewperf, Unigine Heaven, and Superposition Benchmark.
  • Strong understanding of GPU clusters architectures and relevant performance metrics.
  • Experience with using the driver APIs to get the raw data directly
  • Strong programming and scripting skills, including experience with Python, C/C++, Bash, or PowerShell.
  • Familiarity with cloud computing platforms and environments.
  • Excellent analytical, problem-solving, and communication skills.
Preferred Qualifications
  • Experience with statistical analysis tools and techniques.
  • Familiarity with Tensor, GPU cluster testing methodologies, and large-scale data analysis.
  • Demonstrated leadership experience or potential to grow into a Chief Technology Officer (CTO) role.
What We Offer
  • Competitive salary and benefits package.
  • Opportunity to work with cutting-edge technology and innovative projects.
  • A collaborative and dynamic work environment.
  • Career growth and development opportunities.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.