greyhaven-ai

grey-haven-incident-response

"Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems. Use when dealing with production outages, SEV1/SEV2 incidents, creating postmortems, or updating runbooks."

greyhaven-ai 28 3 Updated 5mo ago

Resources

3
GitHub

Install

npx skillscat add greyhaven-ai/claude-code-config/grey-haven-incident-response

Install via the SkillsCat registry.

SKILL.md

Incident Response Skill

Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems.

Description

Production incident response following SRE methodologies with incident timeline tracking, RCA documentation, and runbook updates.

What's Included

  • Examples: SEV1 incident handling, postmortem templates
  • Reference: SRE best practices, incident severity levels
  • Templates: Incident reports, RCA documents, runbook updates

Use When

  • Production outages
  • SEV1/SEV2 incidents
  • Postmortem creation
  • Runbook updates

Related Agents

  • incident-responder

Skill Version: 1.0