Job Search and Career Advice Platform

Enable job alerts via email!

Senior Manager, Software Engineering (Infrastructure)

Loopio Inc.

Remote

CAD 120,000 - 160,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading tech company in Canada is seeking a senior engineering leader to manage SRE, Infrastructure, and MLOps teams. This role involves ensuring reliability, scalability, and cost efficiency of systems while partnering with various teams to enhance platform performance. Candidates should have extensive experience in cloud engineering, possess an understanding of machine learning infrastructure, and have a knack for strategic communication. Join a remote-first organization that prioritizes health, wellness, and ongoing personal development.

Benefits

Health and wellness benefits
Remote work support
Professional development allowance
Flexible co-working locations

Qualifications

  • At least 8 years of experience in SRE or cloud engineering.
  • Experience leading specialized engineering teams for 3+ years.
  • Understanding of infrastructure needs for machine learning.

Responsibilities

  • Lead and grow teams across SRE, Cloud Infrastructure, and MLOps.
  • Own the operational health of production systems.
  • Evolve Loopio’s cloud architecture.

Skills

Cloud Proficiency
Operational Grit
MLOps Awareness
Systems Scaling & Observability
Strategic Communication

Education

8+ years in infrastructure or cloud engineering

Tools

AWS
Terraform
Job description
Take your career to new heights with Loopio! 🚀✨

Loopio is looking for a senior engineering leader to own our SRE, Infrastructure, and MLOps teams. In this role, you will be the primary architect of the reliability, scalability, and cost efficiency of the systems that power Loopio’s platform.

You’ll lead teams that design, build, and operate our production infrastructure, ensuring our services are resilient, observable, and ready to scale as we integrate advanced AI and agentic workflows. You’ll partner closely with Product Engineering, Security, and Data teams to enable fast, safe delivery while maintaining operational excellence.

Note: This is an existing vacancy on the team

🚀 What You’ll Be Doing

Leadership & Team Development

  • Lead and grow multiple teams across SRE, Cloud Infrastructure, and MLOps.
  • Coach and develop engineering managers and senior individual contributors, fostering a culture of ownership and high craft.
  • Build a "Platform-as-a-Product" mindset, ensuring that infrastructure and ML tooling serve as enablers for the rest of the engineering organization.
  • Partner with Recruiting to attract and retain specialized talent in the cloud, reliability, and machine learning infrastructure space.

Reliability & Operational Excellence

  • Own the operational health of production systems, including availability, latency, and durability.
  • Define and evolve SLIs, SLOs, and error budgets, moving the organization toward data-driven reliability decisions.
  • Lead incident response, driving blameless postmortems and systemic improvements to reduce toil and improve on-call sustainability.
  • Support ML-specific reliability, ensuring that model inference pipelines and vector databases meet the same high standards as our core SaaS platform.

Infrastructure & MLOps Strategy

  • Evolve Loopio’s cloud architecture, overseeing capacity planning, disaster recovery, and business continuity.
  • Drive the MLOps roadmap, establishing standards for model deployment, monitoring, and scaling (including LLM orchestration and RAG pipelines).
  • Lead Cloud FinOps, ensuring our infrastructure and AI compute costs are visible, intentional, and optimized.
  • Establish standards for infrastructure automation (IaC), configuration management, and secrets handling.

Security & Cross-Functional Leadership

  • Partner with Security to ensure a secure-by-default infrastructure and robust backup/recovery strategies.
  • Communicate risks and trade-offs clearly to senior leadership, acting as a calm, trusted voice during high-severity events.
  • Collaborate with Product Engineering to support the delivery of high-impact AI features without sacrificing platform stability.
✨What You’ll Bring to the Team
  • 8+ years of experience in infrastructure, SRE, or cloud engineering roles, with 3+ years leading specialized engineering teams.
  • Deep Cloud Proficiency: Extensive experience with AWS (preferred) and modern infrastructure-as-code (Terraform).
  • Operational Grit: A proven track record of leading teams through production incidents and complex architectural migrations.
  • MLOps Awareness: Understanding of the unique infrastructure needs for machine learning, such as GPU orchestration, model serving, or data pipeline stability.
  • Systems Scaling & Observability: Proven expertise in managing large-scale containerized environments and leveraging observability stacks to ensure platform health.
  • Strategic Communication: Ability to align technical roadmaps with business objectives and advocate for infrastructure investment.

Experience with FinOps or managing significant cloud budgets is a plus.

Background in supporting AI agentic workflows or autonomous orchestration systems is a plus.

Where You’ll Work
  • Loopio is a remote-first workplace because we recognize the advantages of working flexibly. We are HQ’d in Canada, with established hub regions around the world where we hire from.
  • Our employees (or Loopers, as we call ourselves!) live and work in Canada (British Columbia and Ontario), the United Kingdom (London), and India (Gujarat, Maharashtra, and Bengaluru).
  • The majority of our team is based in Ontario and British Columbia, which means these employees live and work remotely within a 300km radius of Toronto and Vancouver.
  • We offer flexible co-working locations available to Loopers in Ontario and British Columbia. Those based in Ontario have the option of working out of a co-working space in Downtown Toronto and near Union Station. BC Loopers can work centrally in Vancouver.
  • You’ll collaborate with your teams virtually across the UK, India, and North America with core sync hours and focus time for heads-down work during the workday.
  • We encourage asynchronous collaboration to effectively work as a global #OneTeam.
Why You’ll ♥️ Working at Loopio
  • Your manager supports your development by providing ongoing feedback and regular 1-on-1s, we leverage Lattice for our 1:1s and performance conversations.
  • You will have the opportunity to elevate your craft and explore your creativity, with a dedicated professional mastery allowance for more learning support. We encourage experimentation and innovative thinking to drive business impact.
  • We offer a wide range of health and wellness benefits to support your physical and mental well-being, starting day 1 with Loopio.
  • We’ll set you up to work remotely with a MacBook laptop, a monthly phone and internet subsidy, and a work-from-home budget to help get your home office set up.
  • You’ll be joining a supportive culture that has thoughtfully built out opportunities for connections in a remote-first environment.
  • Participate in townhalls, AMA (Ask-Me-Anything), and quarterly celebrations to celebrate the big wins and milestones as #OneTeam.
  • Our four active Employee Resource Groups offer opportunities for employees to learn and connect year-round.
  • You’ll be a part of an award-winning workplace with an opportunity to make a big impact on the business.

Questioning your qualifications? Read this — Loopio recognizes that many candidates don’t apply because they don’t hit every box. We still encourage you to apply to ensure your application is reviewed. We understand that a resume can only showcase so much, so we’ve created prompts in the application for you to share more about yourself.

AI in Recruitment — Loopio leverages artificial intelligence (AI) to enhance our recruitment process. These tools assist with tasks such as resume screening and drafting preliminary job descriptions, but AI is not used to make final hiring decisions. Our standardized hiring practices remain focused on reducing biases, with all key hiring decisions made by our team. We continuously review and refine our hiring practices to align with industry best practices and evolving legal guidelines.

Loopio is an equal opportunity employer committed to building equitable workplaces that are diverse and inclusive. We encourage candidates from all backgrounds and lifestyles to consider us as a future employer. Please contact a member of our Talent Experience team at work at loopio.com should you require accommodations at any point during our virtual interview processes.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.