audit-agents-md

Audits and refines a CLAUDE.md or AGENTS.md file for instruction density, staleness, and effectiveness. Use when reviewing or improving an agent instruction file, after significant project changes (skills, architecture, or tooling), when agent behavior suggests instructions are ignored or misinterpreted, when the file grows beyond ~30 instruction lines, or when the user says "review my AGENTS.md" or "audit agent instructions".

shuymn 1 Updated 4mo ago

GitHub

Install

npx skillscat add shuymn/dotfiles/audit-agents-md

Install via the SkillsCat registry.

SKILL.md

Audit Agent Instruction Files

Audit a CLAUDE.md or AGENTS.md file against established best practices for agent instruction files, then propose concrete improvements.

Background

Agent instruction files go by different names depending on the tool:

AGENTS.md — tool-agnostic convention, the canonical source of truth
CLAUDE.md — Claude Code specific (often a symlink to AGENTS.md)
COPILOT.md, CURSOR.md, etc. — other tool-specific variants

A common setup is AGENTS.md as the real file with CLAUDE.md symlinked to it. This skill handles all of these transparently.

Arguments

Default (no args): auto-discovers the instruction file in the current project root. Checks for AGENTS.md, CLAUDE.md, .claude/CLAUDE.md in that order. Follows symlinks to find the canonical file.
Path argument: audits the specified file (e.g., /audit-agents-md ~/.claude/CLAUDE.md)

Symlink Awareness

Before auditing, resolve the file's symlink chain:

Check if the target is a symlink (readlink / ls -la)
If it is, identify the canonical (real) file
Audit and edit the canonical file, not the symlink
Report the symlink relationship to the user (e.g., "CLAUDE.md → AGENTS.md; editing AGENTS.md")

Audit Criteria

Evaluate the file against each criterion below. For each, assign a verdict: PASS, WARN, or FAIL.

Evidence-Based Guardrails

Use this evidence to prioritize recommendations:

LLM-generated context files reduced success rate on average while increasing cost by ~20-23%.
Agents strongly follow tool mentions in context files; naming a tool can materially increase tool usage.
Codebase overview sections are often redundant and did not consistently reduce time-to-relevant-files.

Therefore, prefer minimal, repository-specific requirements over broad guidance.

1. Density (target: 20-30 instruction lines)

Count lines that contain actual instructions (exclude blank lines, comments, headers). Every line competes for the agent's limited instruction-following budget.

PASS: ≤30 instruction lines
WARN: 31-50 instruction lines
FAIL: >50 instruction lines

2. Inferability

Flag any instruction that the agent can infer from the codebase itself:

Directory structure descriptions (agent can run rtk ls / rtk find)
Full codebase overviews that mostly restate discoverable paths/files
Code style rules already enforced by linters (eslint, prettier, rustfmt, etc.)
Dependency lists (agent can read package.json, Cargo.toml, etc.)
Build/test commands that are standard for the framework (e.g., npm test for a Node project with no custom config)

Each flagged line is a candidate for removal.

3. Staleness

Check for instructions that reference:

Files, directories, or commands that no longer exist
Tools, libraries, or frameworks not present in the project
Workflows that contradict current project structure
Skills referenced in the file that are not installed

Verify by actually checking the filesystem — do not guess.

4. Actionability

Every instruction must be something the agent can act on. Flag:

Vague guidance ("be careful with...", "keep in mind...")
Aspirational statements ("we strive to...", "ideally...")
Context without directive ("this project uses X" without "therefore do Y")
Blanket "always do X" directives (full test suite, full lint/format, broad exploration) unless explicitly required by repo policy

5. Redundancy

Flag instructions that duplicate:

What is already in a skill's SKILL.md
What is already in another instruction file in the hierarchy (global vs project)
What is stated multiple times in different words within the same file
What is already captured in README/docs/CI scripts without adding repository-specific constraints

6. Requirement Cost Pressure

Context-file instructions should avoid adding unnecessary execution burden.

Flag lines that force extra work without clear repository-specific benefit:

Generic mandatory tool directives (e.g., "always use uv/pytest/ruff") with no project-specific reason
Long mandatory checklists that add broad exploration/testing unrelated to task scope
Multiple overlapping directives that likely increase steps/tokens without improving correctness

Verdict guideline:

PASS: All mandatory/tool-specific directives are repository-specific and justified
WARN: 1-3 low-value mandatory directives remain
FAIL: 4+ low-value mandatory directives or checklist-heavy file behavior

7. Structure

Check for:

Section protection comments ()
Maintenance notes (when to update this file)
Clear section boundaries
Symlink consistency (if CLAUDE.md exists alongside AGENTS.md, is it a symlink or a separate file with divergent content?)

Process

Step 1: Read and Analyze

Resolve symlinks and identify the canonical file to edit
Read the target file
If the file is in a project, also read:
- Available skills (rtk ls the skills directory if a .claude-plugin exists)
- Project structure (top-level files and directories)
- README/docs and CI workflows to detect documentation and command redundancy
- Linter configs (.eslintrc*, .prettierrc*, rustfmt.toml, etc.)
- Global instruction file (~/.claude/CLAUDE.md or ~/.claude/AGENTS.md) to check for redundancy across levels
- Other agent instruction files in the same directory (check for divergent copies vs proper symlinks)
Count instruction lines (exclude blanks, comments, section headers)

Step 2: Score Each Criterion

For each of the 7 criteria, provide:

Verdict: PASS / WARN / FAIL
Evidence: Specific lines or findings
Recommendation: What to change (if not PASS)

Step 3: Propose Rewrite

If any criterion is WARN or FAIL:

Draft a revised version of the file
- Keep only minimal requirements that are non-inferable and repository-specific
- Remove generic tool mandates unless backed by repository constraints
- Replace broad codebase overviews with only non-obvious navigation hints
Show a diff summary: what was removed, what was added, what was reworded
Present the rewrite options to the user:
- If AskUserQuestionTool is available, present options for:
  - Apply the full rewrite
  - Apply selectively (user picks which changes)
  - Keep current version
- If AskUserQuestionTool is unavailable, ask in a single message using QID labels and require one of:
  - Q1: APPLY_FULL
  - Q1: APPLY_SELECTIVE(<concise selection>)
  - Q1: KEEP_CURRENT

If all criteria PASS: report the audit results and confirm no changes needed.

Step 4: Apply Changes

If the user approves changes:

Apply the approved edits to the canonical file (not the symlink)
Verify the updated file still meets all criteria (re-run the count)
Report the final line count and any remaining warnings

Anti-Patterns to Watch For

Do NOT introduce these when rewriting:

Catch-all sections like "Important Context" or "General Notes" — these become dumping grounds
Instructing the agent on how to think — focus on what to do, not how to reason
Over-compression that loses meaning — each line must still be independently clear
Moving essential instructions to comments — HTML comments are for maintenance notes, not instructions
Divergent copies — if both AGENTS.md and CLAUDE.md exist as separate files with different content, flag this and recommend consolidating to one canonical file with symlinks
Tool-list cargo culting — listing many tools/commands as mandatory without repository-specific necessity
Directory tree dumps — long path enumerations that are easily discoverable by the agent

audit-agents-md

Install

Audit Agent Instruction Files

Background

Arguments

Symlink Awareness

Audit Criteria

Evidence-Based Guardrails

1. Density (target: 20-30 instruction lines)

2. Inferability

3. Staleness

4. Actionability

5. Redundancy

6. Requirement Cost Pressure

7. Structure

Process

Step 1: Read and Analyze

Step 2: Score Each Criterion

Step 3: Propose Rewrite

Step 4: Apply Changes

Anti-Patterns to Watch For

Categories

Install

Recommended Skills