Enable job alerts via email!

Principal Software Engineer - Cloud AI Infrastructure

Huawei Technologies Canada Co., Ltd.

Markham

On-site

CAD 100,000 - 130,000

Full time

2 days ago
Be an early applicant

Job summary

A leading technology company in York Region, Canada is seeking a Principal Engineer. The successful candidate will integrate AI frameworks with cloud infrastructure, support prototyping efforts, and ensure system performance. Required qualifications include extensive software development experience, proficiency in Golang or Rust, and a strong understanding of AI technologies. This role is essential for enhancing AI services and optimizing cloud solutions.

Qualifications

  • 5 years of software development experience, with 2 years in AI infrastructure R&D.
  • Familiarity with AI workload profiling tools and troubleshooting.
  • Strong understanding of AI technologies and model training.

Responsibilities

  • Integrate AI frameworks with cloud infrastructure to optimize architecture.
  • Collaborate on concept prototypes and validation of optimization strategies.
  • Support the product team in developing prototypes.

Skills

Software development
AI infrastructure
Proficiency in Golang or Rust
Kubernetes or Ray
Cloud services (AWS, Azure)
Analytical skills
Problem-solving skills
Learning ability

Education

Master's or Ph.D. in Computer Science/Engineering

Job description

Huawei Canada has an immediate permanent opening for a Principal Engineer.

About the team:

Established in 2014, the Distributed Scheduling and Data Engine Lab is Huawei Cloud's technical innovation center in Canada. The lab focuses on researching and developing advanced cloud technologies, supporting the productization and iterative optimization of its technical achievements. Current research areas include cloud native databases, infrastructure resource scheduling and prediction, cloud-native middleware, media engines, and user experience studies. The lab fosters a robust technical environment, allowing collaboration with industry experts to create a highly competitive cloud platform. Our team has an immediate permanent opening for a Principal Software Engineer.

About the job:

  • Integrate AI frameworks with cloud infrastructure to optimize end-to-end architecture for AI inference and fine-tuning scenarios. Focus on improving the observability, reliability, and performance of AI services.

  • Collaborate with team members to design and develop concept prototypes. Conduct validation of optimization strategies to ensure effectiveness.

  • Work closely with the product team to support the development of prototypes, taking into account the constraints and requirements of the product's current status.


About the ideal candidate:

  • 5 years of software development experience, with a minimum of 2 years of experience in AI infrastructure-related platform R&D for fine-tuning or inference, including but not limited to AI workload profiling tools development, vLLM or SGLang development, infrastructure level troubleshooting and root cause analysis.

  • Proficiency in Golang or Rust. Must be able to write clean, efficient, and high-quality code from scratch.

  • In-depth understanding of AI technologies and familiarity with the module interactions involved in AI model training, inference framework and storage system.

  • Proficient in Kubernetes or Ray, with practical experience in developing services based on these platforms.

  • Strong understanding of cloud services and platforms such as AWS and Azure.

  • Highly analytical, with strong problem-solving skills and the ability to address complex technical challenges effectively.

  • Self-driven, with a proven ability to learn quickly and take initiative.

  • Master's or Ph.D. degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs