Enable job alerts via email!

Senior Site Reliability Engineer

Cerebras

Vancouver

On-site

CAD 120,000 - 160,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI technology is seeking a Senior Site Reliability Engineer/DevOps focused on building and maintaining robust infrastructure. The ideal candidate will have a proven track record with AWS, automation, and security best practices, ensuring optimal performance and reliability of their SaaS platform. This role offers the opportunity to lead key infrastructure projects in a dynamic, innovative environment.

Qualifications

  • 6+ years in SRE, DevOps, or infrastructure roles.
  • Extensive experience with AWS services.
  • Strong skills in Terraform, Docker, and CI/CD systems.

Responsibilities

  • Design and maintain secure and reliable infrastructure.
  • Architect a monitoring and alerting system.
  • Drive automation and CI/CD improvements.

Skills

AWS
Terraform
Docker
Python
Bash
New Relic
Prometheus
Grafana
PagerDuty
SQL Databases
NoSQL Databases

Job description

Responsibilities

We’re seeking a senior Site Reliability Engineer/DevOps who is passionate about building the best infrastructure and maintaining the health of the systems.

  • Design and maintain scalable, secure, and reliable infrastructure to support Regie.ai's SaaS platform and AI/data workloads.
  • Architect a unified monitoring and alerting system for engineering teams to continuously monitor and improve system availability, reliability, performance.
  • Drive infrastructure automation and CI/CD improvements to reduce operational overhead and deployment risk.
  • Optimize infrastructure costs, support compliance efforts (e.g., SOC 2), and enforce security best practices.
Required Skills & Qualifications
  • 6+ years of experience in SRE, DevOps, or infrastructure engineering roles.
  • Extensive hands-on experience with AWS and its core services.
  • Strong experience with Terraform (or similar IaC tools), Docker and containerization, and modern CI/CD systems.
  • Proficient in scripting or programming languages such as Python and Bash.
  • Deep experience with monitoring and alerting tools (e.g., New Relic, Prometheus, Grafana, PagerDuty).
  • Strong hands-on experience with both SQL and NoSQL databases (e.g., MongoDB, PostgreSQL, MySQL).
  • Proven track record of designing and maintaining production-grade infrastructure with high availability and low latency.
  • Excellent troubleshooting abilities, along with strong communication and collaboration skills.
  • Solid understanding of cloud security and compliance best practices, including SOC 2 readiness and audit support.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Jobber

null null

Remote

Remote

CAD 145 000 - 198 000

Full time

10 days ago

Senior Site Reliability Engineer

Ampcus Incorporated

Toronto null

Remote

Remote

CAD 110 000 - 150 000

Full time

8 days ago

Senior Site Reliability Engineer

Circle

Vancouver null

On-site

On-site

USD 147 000 - 195 000

Full time

10 days ago

Senior Machine Safety Engineer

Jobot

Toronto null

Remote

Remote

CAD 110 000 - 140 000

Full time

Today
Be an early applicant

Senior Site Reliability Engineer

Canonical

Mississauga null

Remote

Remote

CAD 120 000 - 180 000

Full time

30+ days ago

Senior Site Reliability Engineer

Canonical

Gatineau null

Remote

Remote

CAD 90 000 - 130 000

Full time

30+ days ago

Senior Site Reliability Engineer

Canonical

Calgary null

Remote

Remote

CAD 90 000 - 130 000

Full time

30+ days ago

Senior Site Reliability Engineer

Canonical

Winnipeg null

Remote

Remote

CAD 90 000 - 130 000

Full time

30+ days ago

Senior Site Reliability Engineer

Canonical

Regina null

Remote

Remote

CAD 90 000 - 130 000

Full time

30+ days ago