Enable job alerts via email!

Data Network Engineer – SRE, Telemetry, Observability, Monitoring & Performance

La Fosse Associates

City Of London

On-site

GBP 60,000 - 90,000

Full time

23 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A tech recruitment agency is seeking a Data Network Engineer to enhance the observability of a complex network infrastructure. The ideal candidate should have expertise in telemetry tools and cloud-native architectures, along with proficiency in programming languages like Python, Go, or Java. Design and implementation experience in observability and monitoring systems is essential. This role focuses on optimizing network performance within high availability environments.

Qualifications

Experience in Network/Platform Observability or SRE roles.
Strong expertise in telemetry and observability tools.
Proficiency in at least one programming language and infrastructure as code.

Responsibilities

Collaborate to embed observability into the development cycle.
Design and implement telemetry pipelines for metrics.
Integrate and optimize observability tools.

Skills

Telemetry tools

Observability tooling

Programming (Python, Go, Java)

Infrastructure as code tools

Cloud-native architectures

Tools

OpenTelemetry

Prometheus

Grafana

Splunk

Elastic

Data Network Engineer – SRE, Telemetry, Observability, Monitoring & Performance

Seeking a Network Engineer with experience of Telemetry, Observability, Monitoring & Peformance, ideally within a high availability Network Infrastructure Site Reliability Engineering environment. The network strategy is highly focused towards Next-Gen, Software Defined Networking and in this role you you will work at the intersection of software engineering, Networks SRE & platform operations & engineering, with the ulitmate aim of developing actionable insights from telemetry data and enhancing the value of observability tooling.

Previous experience might include:

Collaborate cross-functionally to ensure observability is embedded into the SDLC & CI/CD pipelines.
Designing & implementing telemetry pipelines for metrics, logs, traces, and events.
Developing observability standards, NMS tooling, dashboards, alerting frameworks, and SLOs.
Integrating & optimising observability tools such as OpenTelemetry, Prometheus, Grafana, Splunk & Elastic.

This role will require:

Having previously worked within Network/Platform Observability, Networks SRE, or Platform Engineering roles within complex, distributed environments.
Strong expertise with telemetry tools such as OpenTelemetry, Prometheus, Grafana, Splunk, Elastic, Loki, Jaeger, or similar.
Proficiency in at least one programming language (e.g., Python, Go, Java) and infrastructure-as-codetools (e.g., Terraform, Helm).
Deep understanding of cloud-native architectures (Kubernetes, microservices, service meshes).

Highly desired:

Industry experience such as the following Media/Streaming, High Frequency Trading e.g. Investment Banking, Online Gaming, Hyperscalers, High Availability, Low Latency Network Infrastructure

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.