Get AI-powered advice on this job and more exclusive features.
Our Client - one of Canada's fintech leaders is seeking a DevOps Lead. We are seeking an experienced Technical Lead to take ownership of our application operationalization efforts. This role is for a seasoned professional with a deep understanding of cloud platforms, automation, and Infrastructure-as-Code (IaC) within an Agile environment. You will lead in optimizing and scaling applications, driving operational excellence, and ensuring secure, reliable, and efficient application delivery.
The Manager will be responsible for:
- Leading the operationalization and stability of an e-commerce application, ensuring seamless deployment, monitoring, and management. Implement and oversee processes that enhance reliability, scalability, and performance.
- Providing strategic technical direction to the AppOps team and developers, mentoring and guiding team members to foster a culture of continuous improvement, collaboration, and innovation.
- Leading optimization initiatives using comprehensive monitoring, diagnostic tools, and analytics. Use data-driven insights to proactively address potential issues.
- Developing, managing, and provisioning cloud infrastructure using IaC tools (Terraform, Ansible, Chef, etc.) to support application scalability and automation.
- Managing and optimizing the use of cloud-native tools, including serverless architectures, microservices, and managed services, to support scalable application operations.
- Leading the development and implementation of automation solutions that reduce manual interventions, such as self-healing and auto-scaling systems, streamlining cloud infrastructure and application management processes.
- Collaborating with development, operations, and security teams to embed security best practices into all aspects of the application lifecycle, including CI/CD pipelines and IaC.
- Managing application and infrastructure certificates, ensuring they are up to date and secure. Overseeing vulnerability management processes to maintain a robust security posture.
- Working closely with developers to implement deployment strategies that ensure minimal application downtime, including canary deployments, blue-green deployments, and rolling updates. Ensuring robust rollback and failover procedures.
- Designing and implementing automated post-implementation verification (PIV) processes to ensure application stability and functionality after releases. Collaborating with QA and development teams to automate end-to-end testing and validation.
- Fostering a sense of ownership for applications, features, and services across the team, driving accountability throughout the software development lifecycle.
- Establishing and tracking key application (KPIs, SLI, SLO, SLA) and customer experience metrics to ensure operational excellence and continuous improvement.
- Working with technical writers to create and maintain comprehensive documentation for systems, automations, and processes. Promoting knowledge sharing across teams.
- Leading on-call support for critical application operations to ensure 24/7 availability and quick incident resolution.
Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or related field.
- 5-8+ years of experience in AppOps, DevOps, or SRE roles, with at least 3-5+ years in a technical leadership capacity overseeing large-scale, cloud-based applications.
- Experience leading Agile and DevOps transformations, driving CI/CD and IaC practices.
- Proven ability to lead cross-functional teams (developers, SRE, QA, security) to deliver highly available, scalable, and resilient applications.
- Extensive experience with IaC and automation tools (Terraform, Ansible, Chef, Pulumi) and guiding teams in best practices.
- Knowledge of SRE principles, SLAs, SLOs, and error budgets to drive operational excellence.
- Experience designing monitoring, alerting, and observability frameworks using tools like Prometheus, Grafana, ELK, Dynatrace, and Splunk.
- Proficiency in programming languages such as Java, Python, YAML, Go, etc.
- Ability to mentor engineering talent and foster a culture of automation and ownership.
- Must be eligible to work for Interac Corp. in Canada in a full-time capacity.
- Certifications in relevant cloud and automation tools (AWS, Azure, Kubernetes, Terraform) are a plus.
Nice to have:
- SRE Certification
- Terraform Associate Certification
- ITIL Foundation
Seniority level
Employment type
Job function
Industries
- Banking, IT Services and Consulting, Software Development