Enable job alerts via email!

Cloud Site Reliability Engineer II

Zafin

Ottawa

Hybrid

CAD 110,000 - 140,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Zafin is seeking a Cloud Site Reliability Engineer II to enhance cloud infrastructure reliability and performance. This role involves leading strategic initiatives, mentoring junior engineers, and managing complex technical issues in a hybrid work environment. The ideal candidate will have extensive experience in cloud technologies, particularly Microsoft Azure, and a strong background in incident management.

Benefits

Competitive Salaries
Annual Bonus Potential
Generous Paid Time Off
Wellness Benefits
Professional Growth Opportunities

Qualifications

  • 12+ years of experience in cloud support or operations.
  • Advanced expertise in Microsoft Azure or equivalent cloud platforms.

Responsibilities

  • Lead resolution of complex technical issues in Azure cloud environment.
  • Conduct Root Cause Analysis for high-severity incidents.
  • Mentor junior engineers and drive strategic initiatives.

Skills

Cloud Support
Leadership
Incident Management
Automation
Scripting

Education

Bachelor’s degree in Computer Science
Master’s degree

Tools

Microsoft Azure
Azure DevOps
PowerShell
Python
Postgres

Job description

The world’s top banks use Zafin’s integrated platform to drive transformative customer value. Powered by an innovative AI-powered architecture, Zafin’s platform seamlessly unifies data from across the enterprise to accelerate product and pricing innovation, automate deal management and billing, and create personalized customer offerings that drive expansion and loyalty.

Zafin empowers banks to drive sustainable growth, strengthen their market position, and define the future of banking centered around customer value.

What is the Opportunity?

Zafin is seeking a Cloud Site Reliability Engineer II (CSRE II) to lead strategic initiatives in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This advanced role requires mastery in cloud technologies, strategic planning, and incident management to drive innovative solutions and operational excellence.

As a CSRE II, you will influence the direction of cloud reliability strategies, mentor junior engineers, and lead significant projects that have a broad organizational impact. This position reports directly to the VP of Cloud Services and requires a proactive, collaborative mindset to achieve operational and strategic objectives.

Mode of Work: Hybrid

What will you do?

  • Lead and manage the resolution of complex technical issues involving Zafin’s products and Azure cloud environment.
  • Design and implement strategic operational enhancements to improve resiliency and system reliability.
  • Conduct in-depth Root Cause Analysis (RCA) for high-severity incidents and drive initiatives to reduce error recurrence.
  • Represent the organization in external client escalation calls, providing expert guidance and solutions.
  • Architect and optimize cloud infrastructure for high performance, scalability, and cost-effectiveness.
  • Provide thought leadership in managing and scaling container orchestration platforms such as AKS and OpenShift.
  • Oversee the implementation of advanced monitoring solutions and integrate predictive analytics for proactive issue resolution.
  • Develop and execute automation strategies to streamline operational workflows and incident responses.
  • Create and maintain comprehensive documentation of cloud architectures, processes, and incident management strategies.
  • Mentor and coach junior engineers, fostering a culture of continuous learning and innovation.
  • Drive strategic initiatives, collaborating with cross-functional teams to achieve organizational objectives.

What do you need to succeed?

Must Haves:

  • Bachelor’s degree in Computer Science, Engineering, or a related field (Master’s degree preferred).
  • 12+ years of experience in cloud support, operations, or a related role.
  • Advanced expertise in Microsoft Azure (preferred) or equivalent cloud platforms.
  • Demonstrated experience in designing and scaling container orchestration systems like AKS or OpenShift.
  • Proven leadership in managing automated deployment pipelines, including Azure DevOps.
  • Mastery in enterprise monitoring platforms (e.g., Azure Insights, Grafana) and predictive analytics tools.
  • Advanced scripting skills with PowerShell, Python, or similar languages.
  • Extensive experience in incident management and defining SLAs for global production environments.
  • In-depth knowledge of database management, particularly Postgres

Preferred Qualifications:

  • Advanced certifications in cloud platforms (e.g., Azure Solutions Architect Expert).
  • Experience with ITSM tools and processes (e.g., ServiceNow).
  • Comprehensive understanding of security and compliance in cloud environments.

What’s in it for you

Joining our team means being part of a culture that values diversity, teamwork, and high-quality work. We offer competitive salaries, annual bonus potential, generous paid time off, paid volunteering days, wellness benefits, and robust opportunities for professional growth and career advancement. Want to learn more about what you can look forward to during your career with us? Visit our careers site and our openings:zafin.com/careers

Zafin welcomes and encourages applications from people with disabilities. Accommodations are available on request for candidates taking part in all aspects of the selection process.

Zafin is committed to protecting the privacy and security of the personal information collected from all applicants throughout the recruitment process. The methods by which Zafin contains uses, stores, handles, retains, or discloses applicant information can be accessed by reviewing Zafin’s privacy policy at https://zafin.com/privacy-notice/. By submitting a job application, you confirm that you agree to the processing of your personal data by Zafin described in the candidate privacy notice.

Create a Job Alert

Interested in building your career at Zafin? Get future opportunities sent straight to your email.

Apply for this job

indicates a required field

First Name *

Last Name *

Email *

Phone *

Resume/CV *

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Website

LinkedIn Profile

Are you legally entitled to work in Canada? *

Do you current reside in Ottawa? * Select...

Are you comfortable with a hybrid work environment? (3 days in office) * Select...

Do you have 12+ years of experience in cloud support, operations, or a related role? *

Do you have advanced expertise in Microsoft Azure (preferred) or equivalent cloud platforms? *

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer

Wave Mobile Money

Ontario

Remote

USD 100,000 - 153,000

4 days ago
Be an early applicant

Senior Site Reliability Engineer II

Tbwa Chiat / Day Inc

Ontario

Remote

CAD 100,000 - 130,000

4 days ago
Be an early applicant