Enable job alerts via email!

Site Reliability Engineer (SRE)

ALLTECH CONSULTING SVC INC

Quebec

On-site

CAD 80,000 - 120,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading consulting firm in Quebec is seeking a Site Reliability Engineer to enhance operational reliability and customer support for their ServiceNow SaaS implementation. The ideal candidate will have strong development skills, experience in system troubleshooting, and a commitment to operational excellence in a collaborative environment.

Qualifications

  • At least 7+ years of experience.
  • Proficient in one or more programming language (e.g., Python).
  • Prioritization skills for technical debt.

Responsibilities

  • Maximizing availability and performance of supported systems.
  • Troubleshooting ServiceNow issues in a Linux environment.
  • Exploring and delivering observability metrics and alerting.

Skills

Software development skills
Communication skills
Teamwork
Troubleshooting

Job description

Job Description:

– The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for the Company’s ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead.
– This role requires delivering a range of SRE practices within a global community of other SREs.
– This means teaming up with colleagues to deliver reliable, resilient systems without wasteful operational effort.
– SRE practices include task optimization and automation, prioritizing technical debt, observability and monitoring dashboards, capacity management, incident response, and problem elimination.
– This position specializes in ServiceNow Software as a Service which provides a suite of IT service management capabilities and is integrated with many products such as chatbot technology, on-call escalation incident management, and a range of other on-premises infrastructure (including SQL databases, APIs, and web infrastructure).
– Despite the focus on value-add development and process delivery, this is also a production-side, operational role requiring participation in an on-call rotation from time to time.
– Successful candidates for SRE roles in Application Infrastructure have so far come from a variety of backgrounds; maybe a developer today looking to evolve site reliability as a practice, or an infrastructure specialist with an interest in reliability and resilience principles, or a strong system admin who enjoys troubleshooting along with some task automation experience.
– Prior experience in the financial services industry is not required, and we welcome candidates from all industries and backgrounds to apply.
Responsibilities include:
– Delivery of improvements that will maximize the availability and performance of supported systems through optimized and automated operational tasks, collaborating on the development of operational tools, ongoing problem management, and architecture reviews with colleagues.
– Troubleshooting ServiceNow issues, and also some on-premise capabilities in a Linux environment from time to time, collaborating with others get to the bottom of issues, and agreeing on lasting improvements that can be made.
– Exploring and delivering observability including metrics, logging, tracing and alerting that can define and measure the target reliability of a product.
– Being dependable and responsive during agreed hours, like when part of the on-call rotation with the rest of the global team (with a time-off in lieu system).
– A commitment to understanding the Firm’s ServiceNow instances and related dependencies, contributing to their documentation.
– Identification and prioritization of technical debt that is can impact client satisfaction or operational efficiency.
– Give feedback on policy and procedures related to the delivery of SRE and operational practices with a view to continually making the Firm safer and more efficient.
Skills required:
– The ideal candidate would have at least one of: Software development skills in one or more programming language, e.g. Python, ServiceNow administration or development experience,
– 7 + years of experience
– Proficient oral and written communication skills
– Establishing warm, effective relationships with colleagues to collaborate on successful delivery
– A dependable team worker with demonstrated commitment to client service
– Ability to respond appropriately during occasional technical emergencies, like outages.
Skills desired:
– ServiceNow administration or development experience, although this can be acquired by the successful candidate via on the job and via training.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer (SRE)

Pragmatike

Quebec

Remote

CAD 90,000 - 130,000

2 days ago
Be an early applicant

Site Reliability Engineer

Upsun

Remote

CAD 80,000 - 120,000

5 days ago
Be an early applicant

Senior Site Reliability Engineer, Environment Automation New Remote, Canada

GitLab Inc.

Remote

CAD 90,000 - 130,000

3 days ago
Be an early applicant

Site Reliability Engineer

Diversis Capital LLC

Toronto

Remote

CAD 90,000 - 130,000

4 days ago
Be an early applicant

Senior Turbine Reliability Engineer

Ctrl

Toronto

Remote

CAD 80,000 - 110,000

6 days ago
Be an early applicant

Site Reliability Engineer

HRB

Remote

CAD 100,000 - 140,000

8 days ago

Site Reliability Engineer (SRE) AWS

Pragmatike

Ottawa

Remote

CAD 100,000 - 130,000

19 days ago

Site Reliability Engineer

Wave Mobile Money

Remote

USD 60,000 - 153,000

22 days ago

Site Reliability Engineer - Data Platform

Kraken Digital Asset Exchange

Remote

CAD 110,000 - 176,000

27 days ago