How do I know? Because Claude is dumb AF today. I feel like an old man with arthritic knees that can predict the rain. Every time they’re gearing up for a new model release, Claude gets lobotomized. If you’re trying to build a business that has workflows that rely on these services, it’s a massive entry for your SOC-2 risk assessment that it’s virtually impossible to expect a baseline consistency from your model selection. Not only are the platforms themselves deeply unreliable from an availability perspective, but when the model performance itself changes on the whim of some knob tuning by the service provider, that’s a serious issue for companies trying to build on top of these services. My results can be pitch-perfect one day, and a total disaster the next, without having changed a thing. submitted by /u/duerra
Originally posted by u/duerra on r/ClaudeCode
