eifachposte

eifachposte

Opus 4.8 launched May 28. One change matters more than the rest for how much you can trust the output: it’s four times less likely to give you a confident answer that’s quietly wrong. In Anthropic’s testing it scored 0% on uncritically reporting flawed results. Previous versions would generate something plausible, present it cleanly, and you’d only find the problem later when you went to use it. This version flags its own uncertainty and pushes back on flawed logic before you’ve invested time in it. This prompt uses that change directly. Run it on anything important before you rely on it: You just produced [the answer / plan / document above]. Before I use this, review it critically. - What are the weakest parts? - Where did you make assumptions that might not hold? - Is there anything here that sounds confident but is actually uncertain? - What should I double-check before I rely on this? Be direct. I’d rather know the problems now than discover them later. On previous versions this produced reassurance with minor caveats. On 4.8 it produces genuine self-critique, because the model is now actually calibrated to flag where it’s uncertain rather than smoothing over it. The broader shift this signals: AI is moving from a tool that produces confident output you have to verify, to a collaborator that tells you what it’s unsure about. That’s a more useful relationship and a more trustworthy one. I wrote up all four changes in the new Claude and 30 specific prompts that take advantage of each, in a doc here if it helps. If you do one thing, run the prompt above on the last important thing Claude produced for you. The difference in what it flags is the clearest way to feel what changed. submitted by /u/Professional-Rest138

Originally posted by u/Professional-Rest138 on r/ArtificialInteligence

The new Claude scored 0% on "confidently reporting wrong answers" in testing. Here's a prompt that takes advantage of it on anything important.

The new Claude scored 0% on "confidently reporting wrong answers" in testing. Here's a prompt that takes advantage of it on anything important.