Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An established industry player is seeking a Distinguished System Reliability Engineer to join their innovative team in Canada. This role involves managing system reliability across chips, networks, and hardware, ensuring stability and performance under high traffic conditions. The ideal candidate will possess a Master's or Ph.D. in relevant fields and have over 15 years of experience in system reliability management. You will work closely with cross-functional teams to address system failures and enhance reliability strategies using advanced analysis methods. This position offers a unique opportunity to contribute to cutting-edge semiconductor solutions while collaborating with global experts.
Huawei Canada has an immediate permanent opening for a Distinguished System Reliability Engineer.
About the team:
Initially founded in 1991 as Huawei's ASIC Design Center, the IC Lab is a leading global fabless semiconductor lab. This lab delivers trusted, cutting-edge semiconductor products and services for smart devices, contributing to smart home and mobility solutions. The local team in Canada specializes in semiconductors, and chipset solutions.
About the job:
System Chip Reliability Management and Control: Closely cooperate with chip development and deeply participate in chip reliability design based on application scenario requirements to ensure system reliability from the beginning. Participate in the chip reliability test plan. Analyze chip failure cases, identify potential design defects or process problems, and promote improvement. Establish a chip reliability warning mechanism to detect and resolve chip risks.
Network Reliability Management and Control: Design and optimize network architectures to maintain stability under high traffic. Develop redundancy strategies, monitor performance, and lead fault troubleshooting.
Hardware Reliability Management and Control: Oversee hardware reliability across the product lifecycle, from component selection to post-market analysis. Implement rigorous testing and maintain a fault database.
System Engineering Reliability Management and Control: Develop reliability strategies considering system architecture, software-hardware collaboration, and interface compatibility. Use FTA and FMEA for risk analysis.
System Reliability Problem Definition and Analysis: Rapidly diagnose system failures, coordinate cross-functional teams to resolve issues, and maintain a reliability knowledge base for future improvements.
About the ideal candidate:
Master’s or Ph.D. in Electronic Engineering, Computer Science, or Reliability Engineering.
15+ years in system reliability, with expertise in chip, network, and hardware reliability management.
Strong knowledge of chip reliability testing, network architecture optimization, and hardware fault analysis.
Proficiency in reliability analysis methods (FTA, FMEA) and related software tools.
Excellent communication skills, with experience presenting to executives and global R&D teams.