Opus 4.6 dropped and it’s noticeably more expensive. So I took Cursor (to provide same conditions to all models) and ran same prompt through 7 models - Gemini 3 Flash, Gemini 3 Pro, GPT 5.2, GPT 5.2 Thinking Extra High, Sonnet, Opus 4.5 and Opus 4.6. I simply applied auto-accept mode and waited for the model to finish the task First prompt was to exactly replicate the website by provided link GPT5.2 was the only one who matched the style, others implemented their own versions (completely different colors, fonts, style). Gemini did very light job and replicated only main page, others tried to replicate referenced pages. Reddit scraper to find business ideas I asked to build a website which scrapes reddit API to find buisness ideas for specified subreddits. For ideas analyses I told to use OpenAI api. Actually every model delivered something workable, GPT and both Opus were the best imo, they produced interesting clustering graph visualisation. Desktop app for video dubbing, only local LLMs allowed Gemini completely failed, nothing worked. Others delivered half workable results, but for GPT and Opus at least it looked like a solid desktop app. Final observations: Surprisingly, I didn’t notice any difference between Gemini 3Flash and 3Pro, they both delivered simple low quality results, but for cheap. GPT: took 30-60 min for every task to finish, always one of the highest quality, moderately expensive. Opus: 4.6 tends to do less mistakes than 4.5, but overall produces very similar results. Both Opus are the most expensive from the list. For some exercises it was worth it, for some dont Sonnet: Tends to do smth simple, but workable The conclusions I made for myself: if you know what you want to build exactly and can give the model good precise instructions - use Sonnet, it is capable of delivering what you ask. If you need research, analyses capabilities - use Opus, GPT If anyone’s interested, I recorded a video with full side-by-side comparison with all outputs. submitted by /u/ConsiderationOld9893
Originally posted by u/ConsiderationOld9893 on r/ClaudeCode
