Job Search and Career Advice Platform

Enable job alerts via email!

Senior Site Reliability Engineer

Cerebras

Vancouver

On-site

CAD 100,000 - 130,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A tech company in Metro Vancouver is seeking a Senior Site Reliability Engineer/DevOps to build and maintain resilient infrastructure for SaaS and AI workloads. The ideal candidate has 6+ years of experience, strong AWS skills, and expertise in CI/CD automation and monitoring tools. Excellent collaboration skills are essential. This position offers the chance to work in a dynamic environment focused on scalability and security.

Qualifications

  • 6+ years of experience in SRE, DevOps, or infrastructure engineering roles.
  • Proven track record of designing production-grade infrastructure.
  • Solid understanding of cloud security and compliance best practices.

Responsibilities

  • Design and maintain scalable, secure, and reliable infrastructure.
  • Architect a unified monitoring and alerting system.
  • Drive infrastructure automation and CI/CD improvements.
  • Optimize infrastructure costs and enforce security best practices.

Skills

AWS core services
Terraform
Docker
Python
Bash
Monitoring tools
SQL databases
NoSQL databases
Troubleshooting
Collaboration

Tools

New Relic
Prometheus
Grafana
PagerDuty
Job description
Responsibilities

We’re seeking a senior Site Reliability Engineer/DevOps who is passionate about building the best infrastructure and maintaining the health of the systems.

  • Design and maintain scalable, secure, and reliable infrastructure to support Regie.ai's SaaS platform and AI/data workloads.
  • Architect a unified monitoring and alerting system for engineering teams to continuously monitor and improve system availability, reliability, performance.
  • Drive infrastructure automation and CI/CD improvements to reduce operational overhead and deployment risk.
  • Optimize infrastructure costs, support compliance efforts (e.g., SOC 2), and enforce security best practices.
Required Skills & Qualifications
  • 6+ years of experience in SRE, DevOps, or infrastructure engineering roles.
  • Extensive hands-on experience with AWS and its core services.
  • Strong experience with Terraform (or similar IaC tools), Docker and containerization, and modern CI/CD systems.
  • Proficient in scripting or programming languages such as Python and Bash.
  • Deep experience with monitoring and alerting tools (e.g., New Relic, Prometheus, Grafana, PagerDuty).
  • Strong hands-on experience with both SQL and NoSQL databases (e.g., MongoDB, PostgreSQL, MySQL).
  • Proven track record of designing and maintaining production-grade infrastructure with high availability and low latency.
  • Excellent troubleshooting abilities, along with strong communication and collaboration skills.
  • Solid understanding of cloud security and compliance best practices, including SOC 2 readiness and audit support.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.