
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A technology recruitment firm is seeking a Site Reliability Engineer in Richmond Hill, Ontario. The ideal candidate will have over 5 years of experience with cloud infrastructure, strong skills in Python, Kubernetes, and AWS services. You will lead incident responses and develop operational tools to enhance system efficiency. This full-time role requires on-call rotation and offers competitive compensation.
Location: Richmond Hill, Ontario
Our client’s platform engineering group operates with a Site Reliability Engineering (SRE) mindset, committed to delivering highly reliable, scalable, and performant systems across a public cloud infrastructure. The team specializes in enhancing system transparency, enabling deep diagnostics, and ensuring seamless collaboration between development and operations. Shared ownership, proactive problem‑solving, and continuous improvement are at the core of everything they do.
As a Site Reliability Engineer, you will be responsible for the design, development, deployment, and further management and support of public cloud infrastructure. The candidate should have experience with designing highly available and fault tolerant cloud native enterprise solutions. As well as some background in development, the candidate should also have familiarity with Kubernetes. The role requires someone with experience interfacing with development teams throughout the full development lifecycle to produce reliable and secure production infrastructure and operate in multiple environments in the SDLC.
This is a full‑time position. Days and hours of work are Monday through Friday, during normal business hours. This position will also participate in on‑call rotation which will be 2 weeks of primary and 2 weeks of secondary. This is offering 24/7 support for the platform during these rotations. Typically, this is 4 out of every 8 weeks.