Enable job alerts via email!

Site Reliability Engineer, Customer Security

Coalition Inc

Canada

Remote

CAD 80,000 - 100,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Site Reliability Engineer, where you'll create and maintain scalable, reliable cloud infrastructure. This role offers the chance to work on impactful projects, automate processes, and improve system observability. Collaborate with cross-functional teams to develop strategies that minimize downtime and enhance operational excellence. If you're passionate about solving complex problems and thrive in a dynamic environment, this opportunity is perfect for you. Be part of a mission-driven team dedicated to protecting organizations from digital risks while fostering an inclusive culture.

Qualifications

  • 3+ years in SRE/DevOps/Cloud or Software Development roles.
  • Strong AWS knowledge and experience with IaC tools.

Responsibilities

  • Design and manage robust cloud solutions for high performance.
  • Automate infrastructure and optimize cloud resources.

Skills

SRE/DevOps/Cloud engineering
AWS services
Terraform
Containerization
CI/CD pipelines
Analytical skills
Problem-solving skills

Education

Bachelor's degree in Computer Science
Master's degree in Computer Science

Tools

Terraform
AWS
Kubernetes
Github Actions

Job description

Site Reliability Engineer, Customer Security

Coalition is the world's first Active Insurance provider designed to help prevent digital risk before it strikes. Founded in 2017, Coalition combines broad insurance coverage with a digital risk assessment and continuous security monitoring to help organizations protect themselves in today’s hyper-connected world.

Opportunities to make an impact with bold thinking are real - and happening daily.

About the role

Are you passionate about creating and maintaining scalable, reliable, and secure cloud infrastructure? As a Site Reliability Engineer at Coalition, you'll play a pivotal role in ensuring the performance, availability, and efficiency of our cloud-based systems. Working closely with cross-functional teams, you’ll design, implement, and manage robust cloud solutions that drive our mission to protect the unprotected.

This role offers the opportunity to work on impactful projects such as automating infrastructure, building developer-friendly platforms, optimizing cloud resources, improving system observability, and driving operational excellence across the organization. You will also participate in a low-volume on-call rotation to ensure our systems remain highly reliable and available.

In this role, you'll focus on isolating, trapping, and responding to the inevitability of system failure, developing strategies for continuous monitoring and analysis to minimize downtime and reduce the need for manual intervention. Our core platform is primarily built in Python, with some services written in Java and Go. We take a pragmatic approach to technology, using the right tools for the job and designing systems to scale and evolve as we grow.

This role is a great fit for software engineers looking to transition into SRE or for experienced SREs eager to work in a dynamic environment solving challenging technical problems. If you thrive on solving complex problems, working with cutting-edge technologies, and making systems resilient, we’d love to hear from you!

Responsibilities
  • 3+ years of experience in SRE/DevOps/Cloud engineering or Software Development roles in a full stack engineering environment
  • Strong understanding of AWS services (e.g., EC2, S3, RDS, Lambda, VPC, etc.) and best practices for building scalable, secure, and cost-effective infrastructure.
  • Hands-on experience with IaC tools like Terraform, CloudFormation, or CDK to automate cloud infrastructure deployment and management.
  • Must have experience working with containerization and orchestration tools such as ECS, Kubernetes etc
  • Experience working with fault tolerant services and the iterative development of highly-available systems
  • Exposure to full-stack monitoring from system level metrics to SLOs, failure-based testing approaches, and monitoring strategies
  • Understanding of CI/CD pipelines to accelerate deployments and improve both security and auditability (e.g. Github Actions, Jenkins, Travis, or CircleCI)
  • Experience soliciting systems requirements, designing, and implementing new platform components leveraging infrastructure or SaaS services
  • Some knowledge of software engineering design patterns, agile development, and architecture principles
  • Strong analytical and problem-solving skills, with experience in debugging and resolving infrastructure or application issues.
  • Ability to work closely with cross-functional teams, effectively communicate complex ideas, and advocate for best practices.
  • Bachelor’s or Master’s degree in Computer Science, related field, or equivalent experience
Bonus Skills
  • Experience working with Hashicorp Nomad and/or Vault
  • Experience in Go or Python, writing libraries and tooling
  • Familiarity with cloud networking concepts (e.g., VPC, DNS, Load Balancers, NAT) and cloud security principles, including IAM, role-based access control, and encryption.
  • Exposure to Kafka or other event streaming platforms
Why Coalition?

We’re a remote-first, mission-driven team committed to building a more inclusive culture with people of all different backgrounds. We trust our team members to take responsibility, share ownership, and put in the work to help us in our pursuit to solve digital risk.

Coalition’s exceptional growth stems from its ability to address real-world problems for organizations of all sizes and remain true to our founding values of character, humility, responsibility, purpose, authenticity, and inclusion.

We’re always looking for collaborative, inquisitive individuals to join #OurCoalition.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

GoDaddy

British Columbia

Remote

CAD 90.000 - 130.000

Today
Be an early applicant

Site Reliability Engineer, Customer Security

Coalition, Inc.

Remote

CAD 90.000 - 120.000

3 days ago
Be an early applicant

Reliability Engineer

Chelsea Avondale

Remote

CAD 70.000 - 90.000

Yesterday
Be an early applicant

Staff Infrastructure Site Reliability Engineer

Remoteworldwide

Remote

CAD 90.000 - 150.000

14 days ago

Site Reliability Engineer

Dayforce

Remote

CAD 70.000 - 110.000

14 days ago

Senior Platform Engineer

Veracity Software Inc

Remote

CAD 80.000 - 100.000

4 days ago
Be an early applicant

Software Platform Engineering Manager - Ubuntu for Next-Gen Silicon

Canonical

Moncton

Remote

USD 90.000 - 150.000

20 days ago

Software Platform Engineering Manager - Ubuntu for Next-Gen Silicon

Canonical

Regina

Remote

USD 90.000 - 150.000

21 days ago

Site Reliability Engineer, Courthouse Technology

Tyler Technologies

Vancouver

Hybrid

CAD 90.000 - 120.000

5 days ago
Be an early applicant