Original Reddit post

"are you sure? " doesn’t work. “cite your sources” gets you fabricated sources. “be honest about uncertainty” gets you vague hedging in confident prose. what works: give the model STRUCTURE for honesty , not pleading. 🟢 [VERIFIED] i checked 🟢 [KNOWN] documented public fact 🟡 [INFERRED ~80%] logical, not checked 🟡 [ASSUMED ~50%] working assumption 🟠 [GUESSED ~30%] pattern-match 🟠 [STALE] was true once 🔴 [UNKNOWN] refused to fabricate, hand off example response with the protocol active: 🟢 [VERIFIED] the dedupe function reads its own output. i traced lines 42-58. 🟡 [INFERRED ~80%] the fix closes the symptom. didn’t run tests. 🟠 [GUESSED ~40%] other functions might have similar bugs. pattern-match only. universal install (auto-detects your agent, works with 51+): npx skills add NagyVikt/agent-liedetector-skill or npx agent-liedetector-skill install Cost: ~200 tokens of system prompt. https://github.com/NagyVikt/agent-liedetector-skill drop your worst agent-fabrication story below 👇 submitted by /u/KennGriffin

Originally posted by u/KennGriffin on r/ClaudeCode