Overview
We are seeking a highly skilled Lead OpenTelemetry Developer responsible for developing and maintaining scalable, efficient production management solutions.
This role involves code development, instrumentation of systems using both code-based and zero-code solutions, writing documentation, and promoting best practices around observability tools.
Responsibilities
- Partner with and support application developers by instrumenting systems using code-based or zero-code solutions.
- Design and build OpenTelemetry-based solutions that integrate with various monitoring platforms.
- Create clear documentation to guide developers on instrumenting applications with OpenTelemetry.
- Improve application observability by building dashboards and providing guidance on monitoring technologies.
- Educate teams on best practices related to OpenTelemetry, semantic conventions, and supported frameworks.
Qualifications
- Required Skills and Qualifications
- Programming Expertise: Proficiency in at least 3 of the following languages (and familiarity with others): Python, Java, Go, .NET, PHP, React.
- Technical Skills: Strong experience with Prometheus, Grafana, and observability platforms (e.g., Dynatrace, AppDynamics, Splunk, Amazon CloudWatch, Azure Monitor, Honeycomb).
- Hands-on knowledge of Java instrumentation techniques (e.g., bytecode manipulation, JVM internals, Java agents).
Benefits
- Strong understanding of reliability and production management, ensuring high availability and stability.
- Risk-aware mindset with awareness of key operational risks in financial services or large-scale enterprises.
- Commitment to continuous improvement by enhancing processes and systems proactively.
Others
- Strong background in system and software security (SSO, Kerberos, LDAP, Windows AD).
- Application of engineering principles to support scalable, efficient production management.
- Proven experience in automation, reducing manual work, and improving workflow consistency.