Position Title
Position Title: Systems Administrator Team Lead
Location: Ottawa, ON (377 Dalhousie Street)
Work Model: Hybrid - 4 days onsite, 1 day work from home
About Rebel
OUR CUSTOMERS BRING A VISION - WE BRING THE PLATFORM TO SHARE IT ONLINE.
We believe that those who contribute make us better. It’s why we create simple, useful tools to empower participation in the world’s bravest communication space: the Internet.
We are experts in domain names and the products that make the most of them. This helps our customers showcase their ideas, stories, services and contributions to the world.
Our manifesto: Be Thoughtful, Be Simple, Be Brave.
Role Overview
We’re hiring a Systems Administrator Team Lead to lead our Systems Team, responsible for reliable, secure, and scalable operations across IT, cloud/server infrastructure, hosting platforms, and Live Production Support (LPS). You’ll combine hands‑on systems administration and platform engineering with people leadership—setting priorities, improving processes, and ensuring strong service delivery and operational excellence. This role includes participation in an LPS on‑call rotation, with after‑hours support when required to protect production systems and services.
What You’ll Do
Lead and develop a small Systems Team
- Coach, mentor, and support team members through regular 1:1s, feedback, and growth plans.
- Set clear ownership, expectations, and escalation paths, including on‑call coverage.
Run the team using agile practices
- Act as the team’s Scrum Master: facilitate planning, standups, retrospectives, and backlog refinement.
- Manage intake and prioritization with stakeholders to balance roadmap work, technical debt, and urgent operational needs.
Own IT Operations
- Oversee user lifecycle and access management, endpoint tooling, and operational standards.
- Improve documentation, runbooks, and repeatable processes for common requests and incidents.
Drive security operations
- Support security controls and operational hygiene (patching, vulnerability management, incident response readiness).
- Promote least‑privilege access, secure configurations, auditing, and continuous improvement.
Manage cloud and server infrastructure
- Administer and improve AWS and other infrastructure platforms (IAM, networking, compute, storage, monitoring).
- Maintain Windows and Linux environments, ensuring stability, patching, hardening, and automation.
Own hosting platforms
- Operate and improve hosting environments including Plesk and WordPress, focusing on uptime, performance, and security.
- Standardize deployments, upgrades, backups, and troubleshooting processes.
Advance platform engineering / DevOps
- Improve delivery and operational workflows using Git and modern DevOps practices.
- Increase automation and repeatability (scripting, infrastructure‑as‑code patterns, CI/CD improvements where applicable).
Ensure high availability and disaster readiness
- Maintain and test backups, DR plans, and recovery procedures.
- Track availability, lead incident reviews/root cause analysis, and implement reliability improvements.
Provide Live Production Support (LPS)
- Participate in LPS to ensure production stability and rapid incident response.
- Join an on‑call rotation and provide after‑hours support when required for production incidents and escalations.
- Improve LPS effectiveness through better monitoring, alerting, runbooks, and incident workflows.
What You Bring
Team leadership & people management
- Experience leading a small technical team with strong coaching, communication, and accountability.
- Ability to balance hands‑on technical work with planning, delegation, and stakeholder alignment.
Scrum Master / agile delivery
- Proven ability to run agile ceremonies, maintain a prioritized backlog, and drive continuous improvement.
Systems administration (Windows & Linux)
- Strong operational experience administering and troubleshooting Windows and Linux systems.
- Familiar with patching, hardening, identity/access controls, and automation best practices.
Cloud administration & cloud‑native infrastructure (AWS)
- Strong experience administering AWS (IAM, VPC/networking, compute, storage, monitoring/observability).
- Understanding of cloud‑native operational patterns (scalable design, resilience, monitoring, automation).
Platform engineering / DevOps & Git
- Experience with DevOps practices and tooling to improve reliability and delivery.
- Comfort with Git workflows and infrastructure/platform change management.
Hosting (Plesk / WordPress)
- Hands‑on experience managing Plesk and WordPress hosting environments, including upgrades, security, backups, and performance troubleshooting.
Disaster recovery & availability management
- Experience designing and operating DR processes, backups, and availability improvements.
- Ability to run incident response, lead post‑incident reviews, and reduce recurrence through RCA and follow‑through.
Live Production Support (LPS) / on‑call readiness
- Comfortable supporting production systems under pressure with clear communication and structured incident management.
- Willingness to participate in an on‑call rotation and provide after‑hours support when required.
What We Offer
The opportunity to work in an atmosphere that truly rewards hard work and creative thinking. We offer a competitive salary, benefits, and opportunities for growth and advancement within our company. As if that wasn’t enough we also offer a smoke‑free environment, a downtown location, a fully stocked fridge free for all staff. If Rebel sounds like the perfect workplace for you, there is only one question- What are you waiting for?
About This Role
This role represents an existing vacancy.
Compensation
CAD $90,000 - $130,000 annually, plus benefits.
How We Hire
As part of this recruitment process, we use automated or artificial intelligence–enabled tools to support the screening and assessment of candidates’ applications. All hiring decisions are made by our team.