Enable job alerts via email!

Staff Site Reliability Engineer

ClickUp

Vancouver

On-site

CAD 90,000 - 120,000

Full time

16 days ago

Job summary

A leading productivity software company in Vancouver is seeking a Site Reliability Engineer (SRE) to design and build highly reliable systems and partner closely with engineering teams. The ideal candidate will have 4–6 years of experience with AWS and Kubernetes, strong problem-solving skills, and excellent communication abilities. The role includes leading monitoring efforts and improving site reliability practices in a dynamic environment.

Qualifications

  • 4–6 years of experience with AWS ecosystem.
  • Experience working with Kubernetes.
  • Strong self-starter with a problem-solving attitude.
  • Excellent written and oral communication skills.

Responsibilities

  • Lead designing and building systems for performance and reliability.
  • Partner with engineering teams on design decisions.
  • Champion monitoring infrastructure and improve site reliability posture.
  • Respond to and troubleshoot downtime events.

Skills

Experience with Amazon Web Services ecosystem (EC2, ECS, VPC, Redis, RDS, ALB, ECR)
Kubernetes
Production-critical infrastructures management
DevOps mindset
SRE best practices
IaC (CDK, Terraform)
CI/CD (GitHub Actions, ArgoCD)
Containerization (Docker)
Network firewall and security best practices
Self-healing automation and monitoring tools (DataDog, CloudWatch)
Relational databases (preferably PostgreSQL)
Linux-based EC2 instances management
Excellent interpersonal and communication skills
Job description
Overview

At ClickUp we’re not just building software – we’re architecting the future of work. By converging tasks, docs, chat, calendar and enterprise search into a single AI‑powered workspace, we’re empowering millions of teams to break free from silos, reclaim their time and unlock new levels of productivity. As a Site Reliability Engineer (SRE) at ClickUp you’ll learn, pioneer and shape AI in ways that influence not only our product but the future of work itself.

Responsibilities
  • Lead designing and building systems for maximum performance, reliability and scalability.
  • Serve as a lead in partnership with engineering teams on product design decisions and troubleshooting.
  • Increase general stability, observability and metrics surrounding uptime and reliability.
  • Champion our monitoring infrastructure.
  • Implement and improve our general site reliability posture (error and downtime budgets, MTTD and MTTR improvements, alerting and notifications, minimizing customer impact from incidents).
  • Respond to and troubleshoot downtime events while actively developing safeguards to prevent them.
  • Participate in brainstorming sessions with the engineering team and contribute ideas to our technology and algorithms.
  • Mentor members of the team to improve overall excellence.
Qualifications
  • 4–6 years of experience with the Amazon Web Services ecosystem (EC2, ECS, VPC, Redis, RDS, ALB, ECR).
  • Experience working with Kubernetes.
  • Experience in managing production‑critical infrastructures and a DevOps mindset.
  • Familiar with SRE best practices and procedures.
  • Experience with IaC (CDK, Terraform) and CI/CD (GitHub Actions, ArgoCD).
  • Familiar with Containerization (Docker).
  • Knowledgeable in network firewall and security best practices.
  • Experience with self‑healing automation and monitoring tools (DataDog, CloudWatch).
  • Knowledge of relational databases, preferably PostgreSQL (not mandatory).
  • A strong self‑starter, operationally focused and a problem‑solver.
  • Excellent interpersonal, written and oral communication skills.
  • Experience with application security testing is a plus (not mandatory).
  • Experience managing Linux‑based EC2 instances.
Benefits

We hire based on ambition, grit and a passion for improving the way people work. If you think ClickUp is the company for you we encourage you to apply!

ClickUp is an Equal Opportunity Employer and qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or national origin.

ClickUp collects and processes personal data in accordance with applicable data protection laws.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.