Enable job alerts via email!

Operations Support Engineer

Vodafone Group

Gauteng

On-site

ZAR 700 000 - 900 000

Full time

Today
Be an early applicant

Job summary

A leading telecommunications company in South Africa is seeking an experienced Operations Support Engineer to ensure operational excellence in Big Data and Machine Learning platforms. Candidates should have at least 5 years of experience in support operations and knowledge of cloud technologies like AWS, GCP, and Azure. This role offers a competitive benefits package, including health care and discounts on services.

Benefits

Retirement funds
Medical aid benefits
Cell phone and data benefits
Exclusive staff discounts

Qualifications

  • 5+ years of experience in Big Data and ML operations support.
  • Strong communication and collaboration skills required.
  • Experience with cloud-based data technologies essential.

Responsibilities

  • Ensure operational excellence across production systems.
  • Monitor and maintain health of production data and ML pipelines.
  • Implement and manage ITIL processes effectively.

Skills

Big Data operations support
Machine Learning operations
AWS
GCP
Azure
Python
SQL
Kubernetes
Docker
CI/CD

Education

Bachelor's degree in computer science, engineering, or related field

Tools

Prometheus
Grafana
Postgres
AWS tools
CloudWatch
Kinesis
Job description
Overview

When it comes to putting people first, we're number 1. The number 1 Top Employer in South Africa.

Certified by the Top Employer Institute

  • .Role purpose / Business unit The job role for Operations Support Engineer is: To ensure operational excellence across all production systems supporting Big Data and Machine Learning platforms.
Responsibilities

This role is responsible for maintaining system reliability, managing incidents, changes and delivering ITIL-aligned support services to ensure seamless platform performance and user satisfaction. Production Operations & Support: Monitor and maintain the health of production data and ML pipelines, platforms, and services. Perform root‑cause analysis and resolution of production incidents. Manage and coordinate incident response, escalation, and communication. Ensure timely resolution of support tickets and service requests. ITIL Service Management: Implement and manage ITIL processes including Incident, Problem, Change, and Release Management. Maintain service documentation, runbooks, and operational procedures. Participate in CAB (Change Advisory Board) reviews and ensure compliance with change protocols. Operational Excellence: Drive continuous improvement initiatives to enhance system reliability and performance. Collaborate with DevOps, MLOps, and Platform Engineering teams to automate operational tasks. Track and report on SLAs, SLOs, and KPIs for operational services. Monitoring & Observability: Set up and maintain monitoring, alerting, and logging systems. Ensure visibility into system performance and proactively identify issues. Support observability tooling and dashboards for platform health. Stakeholder Engagement: Act as the first point of contact for production‑related issues. Liaise with internal teams and external vendors to resolve operational challenges. Provide regular updates and reports to leadership on operational status and risks.

Qualifications

A bachelor's degree in computer science, engineering, or related field. 5+ years of experience in Big Data, ML operations support. Experience with cloud‑based data technologies such as AWS, GCP or Azure. 5+ years of overall IT experience with Big Data, advanced analytics, data warehousing and business intelligence. Relevant cloud certification at professional or associate level would be advantageous. Strong communication and collaboration skills. Agile exposure, Kanban, or Scrum. Core competencies, knowledge, and experience: In‑depth knowledge of data as a product & information best practices. Experience in using a wide range of data tools such as AWS services – S3, SFTP, Glue, EMR (Spark), Airflow, Athena, CloudWatch, CloudTrail, KMS, Kinesis, OpenSearch, etc. Strong understanding of ITIL frameworks and service management. Experience in production support for data platforms, ML systems, or cloud infrastructure. Familiarity with monitoring tools (e.g. Prometheus, Grafana, Postgres). Knowledge of incident and change management workflows. Excellent troubleshooting, communication, and documentation skills. Working experience with Cloud platforms such as AWS and GCP. Working experience with Kubernetes and Docker containers. Working experience with CI/CD, IaC and DevOps tools such as CDK, Code Repos, etc. Strong programming skills in Python and SQL.

We make an impact by offering enticing incentive programs and competitive benefit packages: retirement funds, risk benefits, and medical aid benefits; cell phone and data benefits, fiber connection discounts, and exclusive staff discounts offered in collaboration with partner companies.

Closing date for applications: 06 November. The base location for this role is Vodacom, Midrand Campus. The company's approved Employment Equity Plan and Targets will be considered of the recruitment process.

As an Equal Opportunities employer, we actively encourage and welcome people with various disabilities to apply.

Vodacom is committed to an organisational culture that recognises, appreciates, and values diversity & inclusion.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.