tldr; opus 4.8 was able to catch the mistakes 5.5-xhigh has been doing for past few days and one shotted everything I asked. It caught 5.5-xhigh was not actually doing meaningful work and instead was putting on a performance. (this is the best I can do to describe the “vibe” of the issue I’ve been having with codex past few weeks). an example are the tests written by gpt-5.5-xhigh in that a large bulk of it was just doing text based search on the result rather than executing the actual components. I’m also impressed that I have used very little weekly usage. 5.5-xhigh is not cheap either and that its been running past few days and opus 4.8 one shotted it in a few hours is noticeable. I don’t know if this is because there is some promotion going on (im not aware as i’ve not been on this sub for a while) or some optimizations due to the model. All I can say is bravo Anthropic, this makes me rethink using claude more and I can always use chatgpt pro and gpt image from it anyways now so first time I am thinking of downgrading codex and upgrading claude. submitted by /u/Just_Lingonberry_352
Originally posted by u/Just_Lingonberry_352 on r/ClaudeCode
