Enable job alerts via email!

Principal Software Engineer - Cloud AI Infrastructure

Futureshaper.com

Markham

On-site

CAD 90,000 - 150,000

Full time

27 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Principal Engineer to join their innovative team in Canada. This role focuses on integrating AI frameworks with cloud infrastructure to enhance performance and reliability of AI services. You will collaborate with experts to develop prototypes and validate optimization strategies, ensuring cutting-edge solutions are delivered. The ideal candidate will possess strong software development skills, particularly in AI infrastructure, and be proficient in programming languages such as Golang or Rust. Join a dynamic environment that fosters growth and innovation in cloud technologies.

Qualifications

  • 5+ years of software development experience with AI infrastructure focus.
  • Proficiency in Golang or Rust with a strong coding ability.
  • In-depth understanding of AI technologies and cloud platforms.

Responsibilities

  • Integrate AI frameworks with cloud infrastructure for optimization.
  • Collaborate on prototypes and validate optimization strategies.
  • Support product development considering current constraints.

Skills

Software Development
AI Infrastructure
Golang
Rust
Kubernetes
Ray
Cloud Services (AWS, Azure)
Problem-Solving

Education

Master's Degree in Computer Science
Ph.D. in Engineering

Tools

AI Workload Profiling Tools

Job description

Huawei Canada has an immediate permanent opening for a Principal Engineer.

About the team:

Established in 2014, the Distributed Scheduling and Data Engine Lab is Huawei Cloud's technical innovation center in Canada. The lab focuses on researching and developing advanced cloud technologies, supporting the productization and iterative optimization of its technical achievements. Current research areas include cloud native databases, infrastructure resource scheduling and prediction, cloud-native middleware, media engines, and user experience studies. The lab fosters a robust technical environment, allowing collaboration with industry experts to create a highly competitive cloud platform. Our team has an immediate permanent opening for a Principal Software Engineer.

About the job:

  • Integrate AI frameworks with cloud infrastructure to optimize end-to-end architecture for AI inference and fine-tuning scenarios. Focus on improving the observability, reliability, and performance of AI services.

  • Collaborate with team members to design and develop concept prototypes. Conduct validation of optimization strategies to ensure effectiveness.

  • Work closely with the product team to support the development of prototypes, taking into account the constraints and requirements of the product's current status.

Requirements

About the ideal candidate:

  • 5 years of software development experience, with a minimum of 2 years of experience in AI infrastructure-related platform R&D for fine-tuning or inference, including but not limited to AI workload profiling tools development, vLLM or SGLang development, infrastructure level troubleshooting and root cause analysis.

  • Proficiency in Golang or Rust. Must be able to write clean, efficient, and high-quality code from scratch.

  • In-depth understanding of AI technologies and familiarity with the module interactions involved in AI model training and inference.

  • Proficient in Kubernetes or Ray, with practical experience in developing services based on these platforms.

  • Strong understanding of cloud services and platforms such as AWS and Azure.

  • Highly analytical, with strong problem-solving skills and the ability to address complex technical challenges effectively.

  • Self-driven, with a proven ability to learn quickly and take initiative.

  • Master's or Ph.D. degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Software Engineer - Distributed Data Systems

Huawei Technologies Canada Co., Ltd.

Markham

On-site

CAD 120,000 - 160,000

Yesterday
Be an early applicant

Lead Software Engineer - Distributed Data Systems

Huawei

Markham

On-site

CAD 90,000 - 150,000

9 days ago

Oracle Fusion Cloud Supply Chain & Manufacturing Associate Director (Remote)

NTT DATA, Inc.

Toronto

Remote

CAD 90,000 - 150,000

15 days ago

Salesforce Product Owner - Order to Cash (O2C) - Remote Canada Position

NTT DATA North America

Toronto

Remote

CAD 80,000 - 120,000

15 days ago

Engineering Manager, Auth0 Tenant Protection Team

Okta, Inc.

Toronto

Remote

CAD 90,000 - 150,000

7 days ago
Be an early applicant

Sr SW Engineer, AI

HRB

Remote

CAD 80,000 - 120,000

4 days ago
Be an early applicant

Principal Software Engineer - Cloud AI Infrastructure

Huawei Canada

Markham

On-site

CAD 80,000 - 140,000

26 days ago

Senior Engineer, DevOps Platform - Cloud Software

NVIDIA Corporation

Remote

CAD 112,000 - 234,000

5 days ago
Be an early applicant

Salesforce Product Owner - Lead to Order - (L2O) - Remote Canada Position

Applicable Limited

Remote

CAD 80,000 - 110,000

7 days ago
Be an early applicant