We are looking for a highly skilled and experienced Senior Observability Engineer to join one of our leading clients across the EU.
The ideal candidate will have a deep understanding of observability tools, automation development, and cloud frameworks, with a specific focus on OpenTelemetry, Dynatrace, Grafana, Honeycomb, Gremlin, and cloud platforms such as AWS and Azure. This role is perfect for someone who thrives in an environment where both on-premises and cloud-based solutions are implemented and optimized.
You will play a key role in ensuring the performance and reliability of complex systems by implementing observability solutions, automating processes, and addressing intricate technical challenges.
Key Responsibilities :
- Lead the design, implementation, and management of observability solutions both on-prem and in the cloud, with a focus on OpenTelemetry frameworks.
- Collaborate with development teams to integrate observability practices into the software development lifecycle.
- Automate manual tasks and processes to optimize workflow and reduce human intervention.
- Develop custom instrumentation, metrics, and logs to enhance the monitoring capabilities of systems and applications.
- Troubleshoot and resolve complex system issues using observability tools to ensure high availability and optimal performance.
- Establish governance and standards for implementing observability across the organization.
- Stay current with the latest developments in observability tools, particularly those compatible with OpenTelemetry.
- Work across cloud platforms (AWS and Azure) to deliver robust observability solutions.
Requirements :
- Minimum of 7 years of experience in cloud, observability, or a related field.
- Proven expertise in implementing and managing OpenTelemetry solutions.
- Strong hands-on experience with cloud-based monitoring, troubleshooting, and observability tools.
- Proficiency in Kubernetes and cloud-based architecture, particularly within AWS and Azure environments.
- Hands-on experience in scripting, automation, and integrating observability tools within cloud and on-prem environments.
- Degree in Computer Science, Information Technology, or a related field.
- Strong communication skills, both written and verbal, with the ability to collaborate effectively in remote and on-prem settings.
Essential Qualifications :
- Expertise in Terraform and experience with infrastructure as code.
- Strong knowledge of DevOps practices and CI / CD pipelines.
- Experience in troubleshooting complex technical issues using observability tools.
- Demonstrated ability to lead and manage a team of developers.
- Ability to think innovatively and provide solutions to complex challenges.
Technologies / Skills :
- OpenTelemetry, Dynatrace, Grafana, Honeycomb, Gremlin
- AWS and Azure cloud platforms
- Terraform
- Scripting and automation languages (e.g., Python, Bash)
- CI / CD and DevOps practices
- Strong problem-solving and troubleshooting skills
J-18808-Ljbffr