type: evals_registry entry_prefix: EVAL schema: id: EVAL-XXX date: YYYY-MM-DD output: string (what was produced) method: string (how it was evaluated - manual read, test, benchmark, user feedback) anomalies: list of strings (what was wrong, missing, surprising) action: [keep | correct | deprecate] rules:

Log an eval whenever you validate the quality of something Claude produced (report, audit, plan, generated code).
Action keep - the output is fit for purpose as-is.
Action correct - needs revision; capture what.
Action deprecate - the approach itself is flawed; link to the decision that replaces it. ---

Evals registry (EVAL)

Index

ID	Date	Output	Action
EVAL-001	2026-04-23	`.claude/` restructure plan (ship-feature STEP 2)	keep

EVAL-001 — `.claude/` restructure plan

Date: 2026-04-23
Output: 21-task plan to migrate tasks/ to .claude/tasks/ + create .claude/memory/ + .claude/audits/ + integrate CAPITALIZE across 5 skills + add /close skill.
Method: manual review of the 5 impacted skills/agents; verified rtk is path-agnostic; confirmed ~/.claude/CLAUDE.md symlinks to the project (single file to edit). Radical-honesty check on the session-close ritual: confirmed aspirational without skill integration → scope expanded to Option D.
Anomalies: none blocking. Note: tasks/LESSONS.md was empty (101B, header only) — migration to learnings.md is symbolic.
Action: keep — plan validated, ready for execution.

evals.md 1.5 KB Verlauf Originalformat

Evals registry (EVAL)

Index

EVAL-001 — .claude/ restructure plan

evals.md 1.5 KB

Verlauf Originalformat

EVAL-001 — `.claude/` restructure plan