Job Search and Career Advice Platform

Enable job alerts via email!

Site Reliability Developer 1

Vena Solutions

Toronto

Hybrid

CAD 89,000 - 121,000

Full time

8 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A dynamic technology firm is seeking a Site Reliability Developer 1 to enhance the observability and stability of its SaaS platform. This role offers flexibility with options to work remotely or from the Toronto office. Responsibilities include supporting ITIL processes, defining runbooks, and maintaining service health. Ideal candidates possess a Bachelor's degree, 2+ years in DevOps or IT operations, and experience in cloud infrastructure management. Competitive compensation ranges from $89,250 to $120,750 CAD annually.

Qualifications

  • 2+ years of experience in IT Operational, DevOps, SRE, or Software Engineering.
  • Experience with cloud computing (AWS and Azure).
  • Ability to write production-level code.

Responsibilities

  • Support ITIL processes like Incident and Change management.
  • Define and document runbooks.
  • Triage operational requests.
  • Maintain services by monitoring availability and health.
  • Participate in on-call rotation.

Skills

Operational Requests Handling
Troubleshooting
Collaboration
Coding
Monitoring

Education

Bachelor’s degree in computer science or equivalent

Tools

AWS
Terraform
Ansible
Docker
Jenkins
Job description

Department: SaaS Operations

Location: Canada - Remote (0002)

Compensation: $89,250 - $120,750 / year

This is a flexible position and has the option of working in our Toronto office full time, hybrid throughout the week or working entirely remotely. Please note that this role includes participating in an on-call rotation every 2–3 weeks, covering 10am–10pm from Monday through Sunday.

Vena is looking for an SRE to join our SaaS Technology and Operations (STO) team. This role is a match for you if you love building highly scalable, resilient, and automated services. We are an innovative team which aims to provide exceptional customer experience by leveraging best-in-class automation and orchestration practices for Vena's SaaS platform. As a Site Reliability Developer 1, you will be part of the first level of contact into the STO team, focusing on improving the observability, scalability, stability and security of Vena’s SaaS platform.

How You'll Make An Impact
  • Support key ITIL processes, including Incident management, request management, problem management and change management.
  • Define and document runbooks and standard operating procedures.
  • Field operational requests from our Application Support team and other internal stakeholders.
  • Triage and solve issues within defined SLA’s to ensure an excellent customer experience and to unblock other development and support teams.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Identify and troubleshoot problems, investigate root causes, and champion fixes across the organization.
  • Work with infrastructure-as-Code (IaC) with a focus on continuous improvement.
  • Collaborate with cross-functional team members on features and implementation within an agile environment.
  • Report on SLAs and performance metrics as part of the Operations function.
  • Participate in on-call rotation.
What We Use
  • A modern AWS cloud infrastructure managed through infrastructure-as-code (Terraform), configuration-as-code (Ansible), and CI/CD (Jenkins)
  • RDS MySQL, Redshift, Redshift Spectrum, MongoDB, and Elasticsearch
  • Kinesis, SQS, and RabbitMQ
  • DevOps tools written in Python
  • Back-end applications written using Java, Dropwizard, Spring Boot, and Hibernate
  • Front-end applications written using TypeScript, JavaScript, React (Context API and Hooks), and Redux
  • Monitoring with DataDog, and CloudWatch
We'd Love to See
  • Bachelor’s degree in computer science, Software engineering or equivalent experience
  • 2+ years of experience in an IT Operational, DevOps, SRE, or Software Engineering role.
  • Experience with cloud computing (AWS and Azure) services and a developing-level of knowledge with the management and setup of cloud infrastructure.
  • You can write code - in any language. You have implemented your work in a production environment and can back it up with examples.
  • Experience with tools and platforms such as: Ansible, Build/Release Pipelines, Docker, GitHub, Terraform etc.
  • Developing-level of knowledge with distributed systems in the cloud using observability and telemetry for oversight of code deployments and service level objectives (SLOs).
  • Developing experience with the operational aspects of software systems using telemetry, centralized logging, and alerting with tools such as: CloudWatch, Datadog, Prometheus, etc.
Base salary range 89,250 - 120,750 CAD
  • Our salaries are tailored to roles, levels and locations. Your individual pay within this range is influenced by factors like work location, skills, experience and education. As you progress in your role, your compensation may adapt, offering flexibility for growth beyond initial levels. For specifics, your recruiter will provide details and address any questions during the hiring process.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.