Enable job alerts via email!

Site Reliability Engineer

Talentlab

Toronto

On-site

CAD 90,000 - 120,000

Full time

30+ days ago

Job summary

A leading AI start-up in Toronto is seeking a Site Reliability Engineer to join their growing team. This full-time, permanent position involves working on cutting-edge technology aimed at optimizing deep learning processors. Ideal candidates will have a strong background in Linux and DevOps tools, contributing significantly to the infrastructure and operational efficiency of the company.

Qualifications

  • 5 years experience in Linux administration required.
  • Knowledge of DevOps tools and system/network management.
  • Bachelor's degree in IT or related field.

Responsibilities

  • Design and implement improvements to the network and systems.
  • Conduct Linux server administration and datacenter management.
  • Create architecture and technical documentation.

Skills

Linux administration
DevOps tools
System management
Network management

Education

Bachelor's Degree in Information Technology

Tools

Docker
Kubernetes
CI/CD tools

Job description

Site Reliability Engineer

Location: Toronto ON

We've partnered with an up-and-coming AI start-up to assist in building their thriving team. Equipped with their latest round of funding, they're building a new processor optimized for deep learning. We're assisting in the search for a Site Reliability Engineer to join their team in a full-time, permanent role. This is a great opportunity to become one of the first members of a growing technical team working on cutting-edge technology.

The Role:
  • Recommend, design, and deliver improvements to the network and systems to meet changing demands and new technology
  • Design and implement architecture and construct technical documentation
  • Linux server administration (OS installs, standard OS image creation, backup, user login, hardware malfunctions and upgrades, etc.)
  • IT infrastructure - network switches, login servers, web servers, VMs - software updates, hardware malfunction and upgrades, etc.
  • Datacenter management (expansion planning, power/cooling, ISP deployment, storage, backup)
The Requirements:
  • Bachelor's Degree in Information Technology or a related field
  • Minimum 5 years experience in Linux administration
  • Knowledge of DevOps tools (Docker, Kubernetes, CI/CD tools, databases, scripting, machine maintenance, and monitoring)
  • System and network management experience
How to apply?

All interested and qualified applicants should apply directly on our website at www.talentlab.com. Although we thank all interested applicants, only those under consideration will be contacted.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs