Enable job alerts via email!

Senior Incident Commander - Site Reliability Engineering - Remote, US

dynaTrace software GmbH

Waltham (MA)

Remote

USD 107,000 - 161,000

Full time

21 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading software company as an Incident Commander, where you will manage incidents and lead a global team to ensure best-in-class reliability. Your role is crucial in shaping incident response, coordinating high-severity incidents, and driving continuous improvement. This position offers a competitive salary and the opportunity to work with cutting-edge technologies in a dynamic environment.

Qualifications

  • Proven experience in incident management and SRE or Security Operations.
  • Strong technical background with the ability to understand complex systems.

Responsibilities

  • Manage high-severity incidents, leading response teams for timely resolution.
  • Train teams on incident response protocols and ensure readiness for critical incidents.

Skills

Incident Management
SRE
Security Operations
Communication
Teamwork

Job description

Your role at Dynatrace

We are strengthening our incident management team. You will be at the helm, managing incidents and leading the way. Your role at Dynatrace is crucial in ensuring best-in-class reliability and shaping incident response for our customers. Your detailed responsibilities in this new team will be:

Prepare for Effective Incident Response:

  • Response Coverage:Join a new global team of Incident Commanders coordinating incidents 24/7 in a follow-the-sun model
  • Training and Preparedness:Train teams on incident response protocols and ensure readiness for critical incidents
  • Process Improvement:Ensure our incident management process fits best-in-class, aligning with industry standards, company, and customer need

Navigate Critical Incidents with Success:

  • Incident Coordination:Manage high-severity incidents, leading temporary response teams to ensure timely resolution and minimal business impact.
  • Analysis and Mitigation:Coordinate the team to understand impacts, perform forensics, categorize and mitigate incidents, ensuring the right experts are engaged.
  • Communications:Ensure all personnel know their roles during incidents. Keep teams aligned and ensure regular updates to customers and internal stakeholders.

Continuously Learn and Improve:

  • Postmortem Management:Lead blameless postmortem sessions, reviewing incident response and resilience, and tracking execution of improvement actions
  • Metrics and KPIs:Define and track key metrics to measure the effectiveness of incident management and leverage them for data-driven improvement planning.
  • Customer Interaction:Prepare detailed postmortem write-ups for customers, providing clear and actionable insights. Monitor and report on SLAs.
  • Stakeholder Communication:Maintain a holistic view of production status and communicate updates to internal stakeholders and customers.
What will help you succeed
  • Proven experience in incident management and SRE or Security Operations, ideally within a SaaS environment.
  • Strong technical background with the ability to understand complex systems and troubleshoot issues.
  • Strong team player who stays calm and keeps the focus for the group in tough situations.
  • Excellent communication skills, both written and verbal, with the ability to convey technical information to non-technical stakeholders.
  • Experience with postmortem processes and continuous improvement methodologies.
  • Ability to work in a fast-paced, dynamic environment and manage multiple priorities.
  • Passionate about pushing the limits to operate a vast SaaS solution reliable and performant at scale!

Minimum Qualifications

  • Must be able to work in the US
Why you will love being a Dynatracer
  • Dynatrace is a leader in unified observability and security.
  • We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance.
  • Our employees work with the largest cloud providers, including AWS, Microsoft, and Google Cloud, and other leading partners worldwide to create strategic alliances.
  • The Dynatrace platform uses cutting-edge technologies, including our own Davis hypermodal AI, to help our customers modernize and automate cloud operations, deliver software faster and more securely, and enable flawless digital experiences.
  • Over 50% of the Fortune 100 companies are current customers of Dynatrace.

The salary range for this role is $107,000 - $161,000. When determining your salary, we consider your experience, skills, education, and work location.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Incident Commander - Site Reliability Engineering - Remote, US

dynaTrace software GmbH

Remote

USD 107,000 - 161,000

21 days ago

Site Reliability Engineer

Leidos

Remote

USD 85,000 - 154,000

5 days ago
Be an early applicant

Senior Engineering Manager, SRE - Remote, US

Paylocity

Remote

USD 143,000 - 265,000

Yesterday
Be an early applicant

Senior SRE & Automation Engineer - Multi-Client Consultant

N8TiVe Consulting

San Francisco

Remote

USD 90,000 - 150,000

5 days ago
Be an early applicant

Quality Engineer

flex

Remote

USD 83,000 - 116,000

Yesterday
Be an early applicant

MQ Middleware Administrator - 1966

Cyrten

Alexandria

Remote

USD 120,000 - 180,000

6 days ago
Be an early applicant

Senior Software Engineer- SRE - (Remote - US)

Jobgether

Remote

USD 100,000 - 150,000

12 days ago

Data Engineer 4

Nike

Beaverton

Remote

USD 80,000 - 120,000

12 days ago

Regional Customer QE

Orange County Comptroller

San Jose

Remote

USD 79,000 - 110,000

12 days ago