We should make a tracker to report when claude and other provider models are noticeably degraded

www.reddit.com

We should make a tracker to report when claude and other provider models are noticeably degraded

www.reddit.com

eifachposteMB to AI (Reddit RSS)English · 14 hours ago

Original Reddit post

just a quantified reports based system with geographics information and subscription tier so that we can get some statistics coming in about what time of day and point in the release cycle the provider quality is at, and letting us see what the current model quality is at before wasting time (and tokens) trying to corral a quantized model. I can’t seem to find this information aside from just random complaints on reddit surfacing occasionally, which for some reason doesn’t seem like a reliable indicator running a few obfuscated benchmarks at random times throughout the week would also give us some metrics about how the provider models are actually doing in a way that providers can’t just dismiss it as subjective (which is not the case; I use the same set of prompts to open a development cycle and some days the model just has no idea what to do with it and ultimately fails to do anything useful) submitted by /u/probablyblocked

Originally posted by u/probablyblocked on r/ClaudeCode

You must log in or # to comment.

Chat