This is DeepSWE benchmark release for opus 4.8 and the xhigh seems to have reached parity with GPT 5.5 . The cost is also not bad , finally a good capable model from Anthropic. GPT 5.5 still is much cost effective and much more intelligent. Still kinda waiting for Mythos to be eventually released. submitted by /u/DepartmentOk9720
Originally posted by u/DepartmentOk9720 on r/ArtificialInteligence
You must log in or # to comment.
