Enable job alerts via email!

Cloud Ops Engineer

Air-tek

Toronto

On-site

CAD 80,000 - 100,000

Full time

Today
Be an early applicant

Job summary

A Canadian software company in Toronto is looking for a Cloud Operations Engineer to ensure the reliability and performance of hosted applications. Responsibilities include supporting production environments, monitoring system health, and optimizing AWS infrastructure. The ideal candidate has over 3 years of experience in Cloud Operations and a strong grasp of AWS services. This role demands excellent problem-solving skills and a proactive approach to maintaining operational excellence.

Qualifications

  • 3+ years of experience in Cloud Operations, DevOps, or IT Infrastructure Support.
  • Strong hands-on experience with AWS services.
  • Understanding of security best practices and IAM management in AWS.

Responsibilities

  • Provide day-to-day operational support for production and staging environments.
  • Monitor application and infrastructure health.
  • Investigate and resolve incidents, performance issues, and service disruptions.

Skills

AWS services
Incident response
Troubleshooting
Scripting
Linux/Windows systems
Communication
Problem-solving

Tools

CloudWatch
Datadog
Grafana
PagerDuty
Job description
About Us

Air-tek is a Canadian-based software company with a powerful suite of unique products that have already achieved a significant share of a huge global market. The product market fit is excellent, and customers are lining up to buy. Although our global customers know us, we intentionally operate in stealth mode during this growth phase.

Our diverse team shares a collective passion for solving complex problems with a drive to innovate and a desire to create the passenger-centric travel industry. Based in Toronto, our inclusive culture is built on trust, collaboration, delivering a great product, and continuous personal development. We love what we do, and we support the team around us.

About the Role

We’re looking for a Cloud Operations Engineer to join our Cloud Operation team and play a critical role in ensuring the reliability, performance, and scalability of our hosted applications. This role is ideal for someone who thrives in a fast-paced environment, enjoys solving complex technical problems, and is passionate about cloud infrastructure and operational excellence. You will be responsible for supporting production and staging environments, monitoring system health, responding to incidents, resolving support tickets, and maintaining cloud infrastructure primarily on AWS.

Key Responsibilities
  • Provide day-to-day operational support for production and staging environments
  • Monitor application and infrastructure health using monitoring and alerting tools
  • Investigate and resolve incidents, performance issues, and service disruptions
  • Handle daily service tickets related to deployments, environment setup, and troubleshooting
  • Manage and optimize AWS infrastructure, including EC2, S3, RDS, ECS, VPC and IAM
  • Deploy application updates and patches following standard change management procedures
  • Automate repetitive tasks and improve operational efficiency through scripting and tools
  • Participate in on-call rotation to provide 24/7 support coverage for production systems
  • Collaborate closely with Development, Support, Delivery and Security teams to maintain system reliability and compliance
  • Document configurations, runbacks, and operational processes
  • Help establish and follow processes and checklists, ensuring consistent results for routine and common tasks and ultimately automating where appropriate
Required Skills and Experience
  • 3+ years of experience in Cloud Operations, DevOps, or IT Infrastructure Support
  • Strong hands-on experience with AWS services, including EC2, S3, ECS, RDS, CloudWatch, and networking (VPCs, subnets, routing, security groups)
  • Solid understanding of monitoring, alerting, and incident response (e.g., CloudWatch, Datadog, Grafana, PagerDuty)
  • Ability to troubleshoot application, OS, and network-level issues in production environments
  • xperience managing Linux and/or Windows-based systems
  • Strong knowledge of scripting or automation (e.g., Bash, PowerShell, or Python)
  • Understanding of security best practices and IAM management in AWS
  • Excellent communication and problem-solving skills
  • Willingness to participate in an on-call rotation for production support
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.