Enable job alerts via email!

Site Reliability Engineer (Datadog)

Data Centrix

Johannesburg

On-site

ZAR 600 000 - 800 000

Full time

10 days ago

Job summary

A leading IT solutions provider in Johannesburg is seeking a Monitoring Engineer to support the design and optimization of Datadog monitoring solutions. The role involves collaborating with DevOps teams and configuring monitoring agents across environments. The ideal candidate has a degree in IT or Computer Science, and relevant experience, with a strong interest in observability practices and cloud-native technologies.

Qualifications

  • Must have experience with modern monitoring tools, ideally Datadog or similar.
  • Experience deploying and configuring monitoring agents required.
  • 13 years of experience in relevant field is essential.

Responsibilities

  • Support the design and optimization of Datadog solutions.
  • Work alongside teams to ensure observability.
  • Deploy and configure Datadog agents across environments.

Skills

Datadog Certified Fundamentals
Management of operations on virtualized infrastructures
Basic familiarity with cloud platforms
Knowledge of basic scripting
Strong interest in monitoring and DevOps

Education

Degree in Information Technology or Computer Science

Tools

Datadog
AWS
Azure
GCP
Docker
Kubernetes
Job description
Qualifications and Experience
  • Datadog Certified Fundamentals Must have
  • Degree in Information Technology or Computer Science
  • Management of operations on virtualized and distributed infrastructures
  • Management of operations on environment with clustering, replication, load balancer
  • ITIL Practitioner (V3) / ITIL Specialist (V4)
  • Windows Server : Advantage
  • 13 years of experience working with a modern monitoring / observability tool, ideally Datadog (or alternatives like Prometheus, Grafana, New Relic, or Dynatrace)
  • Experience in :
Deploying and configuring monitoring agents
  • Creating dashboards and monitors
  • Parameterizing tags and labels for proper data correlation
  • Basic familiarity with cloud platforms (AWS, Azure or GCP) and container environments (Docker / Kubernetes)
  • Experience working with Centreon - Advantage
  • Strong interest in monitoring, DevOps, SRE, or cloud infrastructure
  • Knowledge of basic scripting (e.g., Bash, Python) is a plus
Duties
  • Support the design, implementation, and optimization of Datadog monitoring solutions across infrastructure, applications, and services.
  • Work alongside DevOps, infrastructure, and application teams to ensure complete observability using custom dashboards, alerts, and tagging strategies.
  • Assist in the deployment and onboarding of new systems into the monitoring ecosystem.
  • Serve as the go-to person for building visualizations, improving signal-to-noise ratios in alerting, and aligning monitoring with business objectives.
  • Ideal for a young and motivated engineer looking to grow within observability and cloud-native monitoring.
  • Deploy and configure Datadog agents across various environments (cloud and on-prem).
  • Create and customize dashboards, monitors, and alerts for systems, services, containers, and applications.
  • Implement tagging strategies to organize, filter, and correlate metrics and logs effectively.
  • Integrate Datadog with various platforms (AWS, Azure, GCP, Kubernetes, Docker, etc.) to collect telemetry data.
  • Collaborate with developers, DevOps, and infrastructure teams to identify key business and system metrics to monitor.
  • Continuously tune and optimize monitors to reduce false positives and improve actionable alerting.
  • Document dashboards, alert logic, best practices, and knowledge for cross-team enablement.
  • Analyze incidents and outages post-mortem to identify monitoring gaps and enhance visibility.
  • Assist in evangelizing observability practices within the organization and contribute to monitoring as code efforts (e.g., Terraform for Datadog resources).
  • Stay up to date with new Datadog features and industry trends in observability and monitoring.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.