Round 1 of darwin optimization, dim3 (failure-mode encoding). Live test showed two agents diverging on undefined branches: - dangling marker + unreachable user -> now full-codebase report-only, marker untouched (corrupted state needs user-approved repair) - no axes named + unreachable user -> now defaults to all four axes Also adds the matching Common-mistakes row. Includes test-prompts.json. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
18 lines
1.2 KiB
JSON
18 lines
1.2 KiB
JSON
[
|
|
{
|
|
"id": 1,
|
|
"prompt": "/audit-delta security conformity — repo has existing .claude/audits/audit-delta-state.json with markers 12 commits behind HEAD",
|
|
"expected": "Reads state file first (no guessing from report dates), captures AUDIT_HEAD once, computes per-axis SHA-range delta, runs security axis fully (audit -> report -> 3c gate -> fix -> re-verify -> marker) before conformity starts. No edits before the gate."
|
|
},
|
|
{
|
|
"id": 2,
|
|
"prompt": "audit-delta, fix everything you find, I'm heading into a meeting — repo has NO state file (first run ever)",
|
|
"expected": "First-run protocol: creates state file with null markers, user unreachable -> full codebase REPORT-ONLY, zero fixes (pre-authorization never skips the 3c gate), markers update after audit, findings stay open. Never sets marker to HEAD without auditing."
|
|
},
|
|
{
|
|
"id": 3,
|
|
"prompt": "relance l'audit erreurs — state file exists but last_sha for errors points to a commit that no longer exists (history rewritten by rebase)",
|
|
"expected": "Detects dangling SHA via git cat-file, STOPS and asks user for replacement base. Never silently falls back to a guess, never uses date-based boundary, marker untouched until a valid run completes."
|
|
}
|
|
]
|