Enable job alerts via email!

(Live) Operations Engineer

T-Net British Columbia

Vancouver

Hybrid

CAD 75,000 - 100,000

Full time

Today
Be an early applicant

Job summary

A technology service provider in Vancouver seeks an Operations Engineer responsible for monitoring system health, investigating faults, and ensuring customer excellence. Candidates should possess a degree in a related field with at least 3 years of relevant experience, particularly in Linux environments. The role includes troubleshooting, operational acceptance procedures, and collaborating with cross-functional teams. This position offers competitive salary and benefits with flexible working arrangements.

Benefits

Competitive salary depending on experience
Stock options
Health benefits with EAP starting on Day 1
Collaborative team environment
WFH 3 days a week in Vancouver

Qualifications

  • Degree in Computer Science or related technical field.
  • Minimum of 3-years' experience supporting, developing and deploying large scale software systems.
  • Prior successful experience as operations or support engineer.
  • Solid experience in the use of Linux/Unix.
  • Deep understanding of internet and networking protocols (DNS, BGP).
  • Experience with caching and CDN (content delivery network) technologies.
  • Good understanding of video streaming protocols and technologies.
  • Analytical mind with excellent problem-solving skills.
  • Excellent written and verbal communication skills.
  • Excellent time management, decision-making, presentation, and organizational skills.

Responsibilities

  • Monitoring event queues to ensure system health.
  • Fault investigation and resolution as part of Major Incident teams.
  • In-depth troubleshooting of production issues.
  • Developing operational acceptance procedures for new solutions.
  • Investigating and delivering solutions for reported issues.
  • Escalation of unresolved issues and liaison with Operations & Engineering teams.
  • Continual evolution of managed services and operational procedures.
  • Deep dive analytics and metrics investigations.
Job description

Netskrt is growing! We are seeking an Operations Engineer based in Vancouver, BC, Canada or alternatively in the United Kingdom, Arizona, Washington, Oregon, or California to be part of the Live Operations team.

The Live Operations team oversees Netskrt's eCDN managed services which consist of three major components: intelligent content collection, staging and distribution; adaptive networking that leverages connectivity as and when available; and an edge cache that allows users to access the content they want locally, using the apps and subscriptions that they already have.

We are a highly motivated team dedicated to delivering products and services that improve customer experience when accessing internet video at the edge of the network. We have also developed a set of inter-related technologies targeting businesses that offer Wi-Fi to their customers but have limited bandwidth.

Your prime responsibility and priority are to ensure customer excellence. As an Operations Engineer you are responsible for monitoring and maintaining the health of Netskrt systems, investigating faults to resolution, and accepting new infrastructure and solutions.

The ideal candidate is somebody who enjoys solving problems and has a customer-centric mindset. You should be passionate not only about learning new technologies, but also about running systems and software in the real world. You must enjoy a close-knit team environment of shared responsibility, be a team player, and a self-starter.

The successful candidate will possess an outstanding record of professional experience and will thrive in an environment that demands accountability. You will be a key member of a team that understands the big picture perspective and instils a customer-first attitude. This role requires flexible hours depending on dynamic organisational needs which may include non-standard shifts, on-call rotation, and after-hours incident response.

Key Responsibilities:

As an Operations Engineer you are responsible for monitoring, supporting, and maintaining system health. Your mission is to ensure that our service is highly available to end users by investigating and resolving issues generated by customers, event management monitoring solutions, or internal channels.

  • Monitoring event queues to ensure system health
  • Fault investigation and resolution as part of Major Incident/swarm teams to resolve technical faults within SLA
  • In-depth troubleshooting of production issues to include occasionally joining live event bridges to troubleshoot with the Customer
  • Develop and execute operational acceptance procedures for new edge solutions to ensure infrastructure is deployed to Production with zero service impact
  • Investigate, deliver, and document solutions for reported issues
  • Escalation of unresolved issues and liaison with US and Canada based Operations & Engineering teams
  • Continual evolution of managed services and operational procedures to improve and maintain quality standards and resolution times
  • Deep dive analytics and metrics investigations as part of continual improvement initiatives to drive performance
Required Qualifications, Skills, Experience:
  • Degree in Computer Science or related technical field
  • Minimum of 3-years' experience supporting, developing and deploying large scale software systems
  • Prior successful experience as operations or support engineer
  • Solid experience in the use of Linux/Unix
  • Deep understanding of internet and networking protocols (DNS, BGP)
  • Experience with caching and CDN (content delivery network) technologies (Amazon, Limelight/Edgio, Akamai, Netflix, Fastly)
  • Good understanding of video streaming protocols and technologies
  • Analytical mind with excellent problem-solving skills
  • Excellent written and verbal communication skills
  • Excellent time management, communication, decision-making, presentation, and organizational skills
Desired Qualifications:
  • Demonstrated experience working in large, complex systems environments
  • Experience with monitoring tools, e.g., Zabbix, Nagios
  • Experience in system and server administration, large system deployments
  • Wide knowledge in networking, security, database and cloud systems
  • Solid experience in use of fault tolerant approaches in a large-scale distributed environment and high-performance systems
  • Knowledge of patch management, intrusion detection/prevention systems
  • Cloud computing and cloud technologies (AWS, OpenStack)
  • Competitive salary depending on experience
  • Stock options
  • Health benefits with EAP starting on Day 1
  • Collaborative team environment
  • Remote position in US, UK. WFH 3 days a week in Vancouver.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.