Job Search and Career Advice Platform

Enable job alerts via email!

Senior Site Reliability Engineer, Observability

Framework Ventures

Remote

CAD 100,000 - 130,000

Full time

3 days ago
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading blockchain technology company in Toronto is seeking a Senior Site Reliability Engineer (SRE) to build and maintain their observability platform. Ideal candidates will have a strong DevOps mindset, 7+ years of relevant experience, and expertise in programming and real-time systems. The role offers an opportunity to work with cutting-edge technology in a versatile team environment, where collaboration and innovation are key to accelerating engineering efficiency.

Qualifications

  • 7+ years of experience in DevOps, infrastructure, SRE, or platform teams.
  • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby.
  • Expert knowledge in managing large real-time systems.

Responsibilities

  • Build and orchestrate Modern OTEL-based Observability Platform.
  • Ensure reliability, security, and performance exceed SLAs.
  • Lead design and deployment of monitoring services.

Skills

DevOps mindset
Observability platforms
Monitoring and logging
Communication skills

Tools

AWS
Terraform
Kubernetes
Prometheus
Grafana
Job description
Overview

About Chainlink

Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides essential data, interoperability, compliance, and privacy standards needed to power advanced blockchain use cases for institutional tokenized assets, lending, payments, stablecoins, and more. Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi. Many of the world’s largest financial services institutions have adopted Chainlink’s standards and infrastructure, including Swift, Euroclear, Mastercard, Fidelity International, UBS, S&P Dow Jones Indices, FTSE Russell, WisdomTree, ANZ, and top protocols such as Aave, Lido, GMX and many others. Chainlink leverages a novel fee model where offchain and onchain revenue from enterprise adoption is converted to LINK tokens and stored in a strategic Chainlink Reserve. Learn more at chain.link.

The Observability Team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Reliability is vital to the success of our company. As a Senior SRE, you will help us accelerate and enable other engineering teams by increasing self-service and decreasing cognitive load.

This role is ideal for someone with a strong DevOps mindset, a passion for building and maintaining a mature GitOps environment, and experience focusing on observability. The entire engineering team is expanding, offering opportunities to build, learn, and grow.

We welcome applicants from diverse backgrounds. If you think you would do a great job at Chainlink, we look forward to speaking with you, even if you don\'t match 100% of the job requirements: those describe people we\'ve usually had a great time working with, but they\'re not a tick-box exercise.

Your Impact
  • Build and orchestrate Modern OTEL-based Observability Platform
  • Support multiple telemetry types, like metrics, logs and traces
  • Define and support modern governance in observability and problems at scale
  • Ensure reliability, security, and performance exceed our defined SLAs
  • Collaborate with engineers across the company to troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load
  • Lead the design and deployment of monitoring/observability services to detect and alert the team of needed action
  • Ingest, aggregate, transform, and utilize data from multiple sources in our real-time data pipeline
  • Oversee availability, performance, and supportability of our observability infrastructure
  • Create processes around alert response operations and support the team to ensure reliable delivery of oracle data
  • Suggest metrics to enable alerts with every new feature release
  • Champion reliability and security by doing work right the first time
Requirements
  • 7+ years of relevant professional experience in devops, infrastructure, SRE, and/or platform teams
  • Ability to develop software beyond typical infrastructure requirements and configurations
  • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby
  • Expert knowledge in designing, developing, and managing large real-time systems
  • Experience with monitoring and logging; exporting metrics with Prometheus; Grafana dashboards; and centralized logging solutions like ELK Stack, Splunk, or Grafana Stack
  • Experience with distributed systems and container orchestration; maintenance or building Kubernetes clusters; deploying new services on Kubernetes
  • Strong communication skills with comfort in planning meetings and code reviews
Desired Qualifications
  • Excitement for blockchain, Web 3.0, and decentralized technologies
  • Experience running infrastructure in the blockchain/web3 space
  • Ability to scale systems sustainably through automation and evolving systems for reliability and velocity
  • Experience working remotely in a distributed team
  • Desire to grow and automate services to reduce toil
Tools and Services
  • AWS; Terraform/Terragrunt; Kubernetes, Calico and ArgoCD; Prometheus and Grafana; GitHub Actions; Packer
  • We expect proficiency with these tools and related ones

All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, overlap with Eastern Standard Time (EST) is encouraged.

We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes. The closing date is listed on the job advert, so please prepare your application thoughtfully. We will fully consider your experience and skills, and you will hear from us regarding the status of your application shortly after the closing date.

Commitment to Equal Opportunity

Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via this form.

Global Data Privacy Notice for Job Candidates and Applicants

Information collected and processed as part of your Chainlink Labs Careers profile and any job applications you submit is subject to our Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.