Dubai
On-site
AED 120,000 - 200,000
Full time
30+ days ago
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
Job summary
An established industry player is seeking a skilled observability engineer to design and enhance an observability platform for enterprise IT teams. In this pivotal role, you will develop monitoring frameworks, improve system stability, and lead efforts in observability across engineering teams. With a focus on automation and integration of third-party tools, you will ensure the reliability and performance of IT infrastructure. If you have a passion for cutting-edge technologies and a proven track record in observability solutions, this is an exciting opportunity to make a significant impact in a dynamic environment.
Qualifications
- 7+ years of experience in building enterprise observability solutions.
- Hands-on experience in full-stack monitoring and site reliability.
Responsibilities
- Design and build an observability platform for enterprise IT teams.
- Proactively monitor systems and improve operational efficiencies.
Skills
Enterprise Observability Solutions
Application Performance Monitoring (APM)
Scripting and Automation
Infrastructure Monitoring
Cloud Services (Azure)
CI/CD Concepts
Technical Requirement Gathering
AIOps
Education
Degree in Computer Science
Diploma in related discipline
Tools
Monitoring and Logging Frameworks
3rd Party Vendor Performance Tools
Job Responsibilities:
- Design and build an observability platform for all enterprise IT teams to consume.
- Develop and improve instrumentation for monitoring and logging the health and availability of services.
- Proactively monitor systems, networks, and applications to provide input in improving the stability, security, efficiency, and scalability of systems.
- Develop and maintain Monitoring and Logging Frameworks for the organization.
- Take personal ownership for the quality, reliability, and availability of global AFG IT infrastructure.
- Improve operational efficiencies via scripting, bots, and integrations.
- Design and develop tools for metric collection, analysis, and reporting.
- Educate and lead efforts to improve observability among all engineering teams.
- Liaise with Core Teams (Enterprise Architects, Lead Cloud Architects, other infra teams) to ensure alignment with approved standards and guidelines.
- Be part of a multidisciplinary Cloud Team which takes ownership of both new deployments and operational activities.
- Create documentation, SOPs and knowledge base articles as required.
- Integrate, implement, and utilize any 3rd party vendor performance tools along with participation in the technical assessments of 3rd party tools and vendors.
- Support and deploy observability tools and resolve performance related production problems.
- Socialize with the development team to ensure no monitoring gaps arise.
- Continuously improve and optimize observability platform design.
- Research emerging technologies and trends in Observability domain.
About You:
- A degree or diploma in Computer Science or related discipline.
- Minimum 7 years of experience building and maintaining enterprise observability solutions.
- Minimum 6+ years of hands-on technical working experience in testing, full-stack monitoring, observability, or site reliability.
- Knowledgeable in requirement gathering and rollout enterprise observability solutions. Experienced in identifying requirements, defining monitoring solutions, and implementing the same.
- Experience in Application Performance Monitoring (APM) and Infrastructure Monitoring for Different Hybrid Business Applications and Infrastructure, providing health and performance reports, developing AIOps rules, creating alerts, creating custom dashboards.
- Demonstrated experience in scripting and automation, orchestration tools, Infrastructure-as-Code and CI / CD concepts.
- Proven experience designing, deploying, and managing at least one leading Enterprise Observability solution along with legacy monitoring tools and cloud-native monitoring solutions.
- Proven experience with cloud capabilities and services, particularly for Azure Cloud Services.
- Exposure to Artificial Intelligence concepts, tools, and services, particularly in the field of Observability and AIOps.