Enable job alerts via email!

Site Reliability Engineer

Hawksman Technology

Vancouver

On-site

CAD 90,000 - 120,000

Full time

Today
Be an early applicant

Job summary

A tech firm specializing in digital asset infrastructure is seeking a Site Reliability Engineer to oversee the health and reliability of production environments. The role requires a minimum of 3 years’ experience managing complex infrastructures, strong DevOps skills, and a customer-focused approach. This is a full-time position based in Vancouver.

Qualifications

  • 3+ years of experience managing complex infrastructure in demanding environments.
  • Deep knowledge of SRE principles and automation.
  • Proven expertise in DevOps workflows and release management.

Responsibilities

  • Oversee the health and reliability of a managed services production environment.
  • Drive operational excellence through software engineering practices.
  • Collaborate with engineering teams to resolve issues quickly.

Skills

Automation
Monitoring
Performance tuning
Incident management
Kubernetes
Docker
Python
SQL

Tools

Linux
Ansible
Terraform
YAML
Shell
Go
Java

Job description

Get AI-powered advice on this job and more exclusive features.

Hawksman Technology is collaborating with a global leader in digital asset infrastructure for banks, public and private companies. Using blockchain technology, its platform supports cryptocurrencies, tokenized securities, tokenized assets (such as NFTs), digital currencies, and stablecoins.

Role : Site Reliability Engineer

  • Oversee the health and reliability of a managed services production environment by developing and deploying effective monitoring tools.
  • Drive operational excellence through software engineering practices.
  • Enhance system reliability and performance by fixing bugs and implementing scalability improvements directly in the product.
  • Support production environments through incident management, upgrades, health checks, and optimization efforts, including assisting with critical escalations when needed.
  • Collaborate with engineering teams to resolve issues quickly and ensure high client satisfaction.

Requirements :

  • 3+ years of experience managing complex infrastructure in demanding environments; experience in financial services is a plus.
  • Deep knowledge of SRE principles, including automation, monitoring, performance tuning, and incident management.
  • Strong skills with modern DevOps tools (Linux, Kubernetes, Docker, Ansible, Terraform, Python, YAML, Shell).
  • Proven expertise in DevOps workflows, build and testing processes, and release management.
  • Strong communication skills and a customer-focused approach.
  • Experience with databases (SQL) and software development (Go, Java, or Python).
  • Seniority level

    Mid-Senior level

    Employment type

    Full-time

    Job function

    Engineering, Information Technology, and Product Management

    Industries

    Financial Services, Banking, and Technology, Information and Media

    Referrals increase your chances of interviewing at Hawksman Technology by 2x

    Sign in to set job alerts for “Site Reliability Engineer” roles.

    J-18808-Ljbffr

    Get your free, confidential resume review.
    or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.