Any
Vacancy
1 Vacancy
Job Description
- Operate and monitor the 24/7 Command Centre for IT services.
- Ensure uninterrupted functionality of monitoring and observability tools.
- Perform system inspections, emergency repairs, and incident escalation.
- Handover/takeover shift duties with detailed reporting.
- Provide inputs for incident reports and historical data logs.
- Assess, validate, and escalate major incidents as per process.
- Coordinate recovery actions and provide updates during incidents.
- Participate in Root Cause Analysis (RCA) and track improvement plans.
- Collaborate with IT teams and vendors during incident lifecycle.
- Lead crisis management calls and maintain war-room coordination.
- Maintain knowledge base and monitoring tool libraries.
- Monitor using tools (e.g., Dynatrace, Grafana) and run pre-defined recovery processes.
- Log issues, manage follow-ups, and ensure SLA compliance.
- Ensure compliance with Risk Management and Audit standards.
Desired Candidate Profile
- Strong understanding of:
- Performance monitoring and observability tools (e.g., Dynatrace, Riverbed NPM, SolarWinds, Grafana).
- Programming languages.
- Database management and SQL.
- Structured and Object-Oriented system design.
- SDLC and current IT technologies.
Excellent communication and interpersonal skills.Strong documentation and report/MIS preparation skills.Ability to manage and lead crisis situations effectively.Readiness to work in shifts and under pressure.Employment Type
Full Time
Company Industry
Department / Functional Area
- DBA
- Datawarehousing (IT Software)
Keywords
- SRE
- Dynatrace
- Solarwinds
- Grafana
- SQL
- Database Management