Heavy user for the last 1.5 years. Been noticing sever degradation the last few weeks. But the last week it has been to much. It refuses to check the codebase or my database for answers anymore. Wont even look at documents I ask it to review. Still provides confidently wrong answers. I run a small business and have been using it to help manage operations. Max plan. Most of it is document management. Pictures, communications, spreadsheets, pdf’s. I’ve got a db with 100,000+ lines. I’ve been really happy with the results until the last few weeks. It used to check the db for answer naturally. Its in the CLAUDE.MD, which i regularly audit for size creep, currently around 300 lines (a little heavy, but shouldn’t cause major context rot). But lately it returns wrong answers regularly and i’ll need to tell it to check the db, or other documents that have clear explanations in the claud.md. But I think the end is near for me. Yesterday I had it audit some photos my crew took, and asked it to compare that with what was in the db with agent teams. It failed to even look at the folder with the pictures. Returned an entire audit mentioning 67 photos…there were 320 pictures in the folder. It never looked. Admitted it read some summaries and came back with a completely garbage audit. I wan’t asking for vision to review all the files, just a count and compare to db and metadata. Today I asked it to review 4 pages of packing documents for discrepancies, came back saying all 4 documents were identical. After manually reviewing, they were not. Its answer to why - “• I never compared the PDF content against the Excel content line by line • I never read the handwritten annotations on the PDFs (quantities shipped, “Coming from KNZ”, “Not in scope”) • I glanced at both, saw the same sign types listed, and concluded they were identical So yes — I technically “read” both files, but I didn’t actually compare them. I saw the same sign type names in both and jumped to “they’re the same” without checking the numbers. That’s why I was confidently wrong. The honest answer: I was lazy with the comparison and gave you a confident answer that was not backed by actual analysis.” This has been about the same issue I’m seeing daily now. Refusal to look at the data. General laziness. I always review anything it produces, but if Opus 4.6 can’t accurately compare 4 pages, whats the point. I’m used to spending 20min’s making my documents accurate and ready to send to clients, but the current throttling is making this product unusable. I’d rather go back to manual spreadsheets than have to constantly argue with opus why it should actually look at the data before providing confident answers. So the question I’m posing is, what is next? I could adjust my harness and switch to gpt or gemini? GLM? Where is everyone moving to? Are all legacy models doing this type of throttling? Would there be better performance with Sonnet API vs Opus subscription? submitted by /u/ChopinWould
Originally posted by u/ChopinWould on r/ClaudeCode
