debugging

"Systematic root cause investigation before proposing fixes. Triggers: bugs, test failures, unexpected behavior, 'why is this failing', 'not working', 'broken'. Do NOT use when: the bug is already identified and the fix is obvious — use /triage instead."

luan 4 3 Updated 5mo ago

Resources

GitHub

Install

npx skillscat add luan/dot-claude/debugging

Install via the SkillsCat registry.

SKILL.md

Systematic Debugging

Random fixes waste time + create new bugs.

Iron Law: NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST.

4 Phases

Phase 1: Root Cause Investigation

Read errors carefully — stack traces, line numbers
Reproduce consistently — exact steps. Intermittent? Add logging at component boundaries to capture state on next occurrence.
Check recent changes — git diff, new deps
Gather evidence at component boundaries
Trace data flow to source

Phase 2: Pattern Analysis

Find working examples in codebase
Read reference implementation COMPLETELY
List ALL differences
Understand dependencies, config, assumptions

Phase 3: Hypothesis + Testing

Form single hypothesis: "X is root cause because Y"
1b. Oracle test (when applicable): Compare current behavior against a known-good baseline to confirm the bug and isolate its scope. Options: available worktree from pool (git worktree list — detached HEAD = available), reference implementation, or reduced test case with known-correct output. Creating a new worktree is expensive — justify only when the bug is non-obvious and affects multiple components. Return worktree to detached HEAD when done.
3+ plausible hypotheses: STOP — spawn parallel subagents (one per hypothesis, each traces one, reports evidence for/against). At 3+ hypotheses, sequential testing wastes time because each test cycle is slow; parallel investigation covers more ground faster.
SMALLEST possible change to test
One variable at a time
Worked → Phase 4. Didn't → NEW hypothesis (if 3rd, parallelize per step 2)

Exit condition: If 3 hypotheses all fail, re-enter Phase 1 with broader scope — re-examine assumptions and widen the investigation boundary. If Phase 1 re-entry also stalls, escalate as architectural (see "3+ Fixes" below).

Phase 4: Implementation

Create failing test that reproduces the bug
Single fix — ONE change
Verify — test passes, no regressions
Fix doesn't work? < 3 attempts → Phase 1. >= 3 → architectural issue

3+ Fixes = Question Architecture

Repeated fixes without root cause understanding signal a design problem, not a surface bug. Continuing tactical patches compounds tech debt and masks the real issue.

![ "$CLAUDE_NON_INTERACTIVE" = "1" ] && echo "Document concern in task description, attempt one structural fix, surface to caller." || echo "STOP. Discuss with human before more fixes."

Red Flags → Return to Phase 1

Listed from most dangerous (zero investigation) to least:

Proposing solutions before tracing data flow — guessing, not debugging
"It's probably X" — hypothesis without evidence
"Add multiple changes, run tests" — shotgun debugging masks root cause
"Quick fix for now" / "Just try changing X" — deferral that compounds
"Skip test" — removing the only verification signal

debugging

Resources

Install

Systematic Debugging

4 Phases

Phase 1: Root Cause Investigation

Phase 2: Pattern Analysis

Phase 3: Hypothesis + Testing

Phase 4: Implementation

3+ Fixes = Question Architecture

Red Flags → Return to Phase 1

Categories

Install

Recommended Skills