Enable job alerts via email!

Senior Site Reliability Engineer (Kubernetes)

Supermetrics Oy

Canada

Remote

EUR 80,000 - 110,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Senior Site Reliability Engineer to enhance its Infrastructure team. This fully remote position involves mentoring on Kubernetes expertise, operating a platform for global SaaS products, and driving automation to meet SLAs. You'll write Terraform configurations, maintain tooling with Golang, and support incident responses, all while collaborating with a dynamic team. With a competitive compensation package, including equity and a personal learning budget, this role offers a fantastic opportunity for growth and impact in a forward-thinking environment. If you're passionate about reliability engineering and eager to tackle challenges, this is your chance to shine.

Benefits

Competitive compensation package
Home office allowance
Health care benefits
Leisure time insurance
Annual personal learning budget
Sports and wellbeing allowance

Qualifications

  • 4+ years in Site Reliability or Platform Engineering roles.
  • Strong Kubernetes and database management experience.
  • Proficient in Terraform and automation scripting.

Responsibilities

  • Mentor colleagues in Kubernetes and manage clusters.
  • Write Terraform configurations and maintain tooling.
  • Respond to incidents and support pre-sales inquiries.

Skills

Kubernetes
Terraform
Containers
Golang
Linux Systems
AWS
GCP
Database Management
Automation (Python/Bash)
CI/CD Systems

Education

Bachelor's Degree in Computer Science or related field

Tools

ArgoCD
GitHub
Helm
Cloudflare

Job description

Senior Site Reliability Engineer (Kubernetes)

We're looking for a Senior Site Reliability Engineer (Kubernetes) to join our Infrastructure team in Supermetrics.

Location: Canada (fully remote)

Role: Permanent, full-time. This role consists of on-call rotation.

Onboarding: As part of your onboarding, we expect the candidate to spend 2-3 weeks at our HQ in Helsinki (we organize the travel arrangements).

In this role, you'll:

  • Raise the team's bar in Kubernetes expertise, mentoring, guiding, and supporting colleagues as well as other members of our Engineering organization in working with managed Kubernetes clusters across providers.
  • Operate the platform that enables our SaaS products to be used by thousands of businesses globally, defining SLAs and SLOs and driving the automation that will ensure we meet them.
  • Use your expertise in containers, Kubernetes, databases, and automation to streamline our operations and improve our infrastructure.

Your day-to-day work and responsibilities will include:

  • Writing Terraform configuration and modules that bootstrap a Kubernetes cluster, or reviewing PRs with contributions from other members, ensuring our modules are reusable and well-defined.
  • Writing (using Golang, for example) and maintaining or improving our tooling, facilitating platform utilization by engineering teams.
  • Developing and maintaining Helm charts for internal deployments and third-party software.
  • Responding to incidents within our production environment.
  • Supporting our pre-sales team to answer potential customers' questions on our architecture and data security.
  • Reviewing architecture changes involving new databases and participating in discussions regarding their pros and cons.
  • Rewriting a GitHub Action to improve how we deploy to Kubernetes using GitOps.
  • Troubleshooting and resolving technical issues as they arise.
  • Participating in our on-call rotations to provide support, respond to incidents, or handle internal user questions.

Technologies you'll be working with:

  • ArgoCD, Helmfile, Helm, External Secrets, Cert-manager, Nginx, Contour
  • Terraform
  • Cloudflare (CDN, DNS), Aiven, Redis Co.
  • GitHub Cloud and GitHub Enterprise
  • PHP, Golang

Requirements:

  • 4+ years of experience in Site Reliability Engineering, Platform Engineering, or related roles.
  • Strong understanding of containers and experience operating Kubernetes clusters at scale.
  • Experience operating databases in production.
  • Proficient in database concepts with hands-on experience in both relational and NoSQL databases.
  • In-depth knowledge of Linux systems and Terraform.
  • Experience with AWS and/or GCP.
  • Solid understanding of modern observability practices and tools.
  • Automation mindset with the ability to automate repetitive tasks using scripting languages such as Python or Bash.
  • Team player spirit.
  • Willingness to take on-call rotations during non-business hours.
  • Good communication skills, particularly in writing (documentation and PRs).
  • Strong problem-solving skills with a passion for tools, technologies, and challenges in this space.

Nice to have:

  • A developer background with the ability to write CLIs and other tools in Go, Python, or Rust.
  • Security mindset with experience implementing security best practices.
  • Experience in creating and managing Helm charts.
  • Expert knowledge of CI/CD systems and experience developing and maintaining GitHub Actions.

Recruitment Process:

  • Screening call with the recruiter.
  • Team Interview.
  • Final chat with CIO.

Benefits we offer:

  • Competitive compensation package, including equity.
  • Excellent work equipment and home office allowance for remote workers.
  • Health care benefits and leisure time insurance.
  • Annual 1000 euros personal learning budget.
  • Sports and wellbeing allowance.

Does this sound like your next adventure? Apply now! We'll fill the role as soon as we find the right person.

Join us on our mission to make data a marketing superpower.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer II

Tbwa Chiat / Day Inc

Ontario

Remote

CAD 100,000 - 130,000

Today
Be an early applicant

Site Reliability Engineer

Wave Mobile Money

Ontario

Remote

USD 100,000 - 153,000

Today
Be an early applicant

Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - Canada)

Hopper

Toronto

Remote

CAD 100,000 - 130,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

VTR Global Com

British Columbia

Remote

CAD 80,000 - 120,000

8 days ago

Intermediate Site Reliability Engineer, Foundations

GitLab

Remote

USD 103,000 - 222,000

17 days ago

Site Reliability Engineer

Dayforce

Remote

CAD 70,000 - 110,000

11 days ago

Senior Site Reliability Engineer, Environment Automation

Tbwa Chiat/Day Inc

Remote

CAD 80,000 - 140,000

30+ days ago

Site Reliability Engineer (SRE) – CVaaS

Arista Networks

Vancouver

On-site

CAD 95,000 - 145,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer, Environment Automation

GitLab

Remote

CAD 100,000 - 125,000

30+ days ago