Original Reddit post

https://preview.redd.it/w9vsvcvbwf2h1.png?width=640&format=png&auto=webp&s=0794afc6154be4b284ce85e686674349c64f2dbc So Google announced Gemini 3.5 Flash this week. I was looking over the Artificial Analysis numbers and the cost jump is pretty crazy. It’s basically 5.5 times more expensive to run than the older 3.0 Flash model. They tripled the input token price to $1.50 per million, and output tokens are sitting at $9.00 now. The weirdest part is that 3.5 Flash takes a lot more steps to handle complex tasks. It averages around 49 steps compared to just 23 for 3.1 Pro, so in practical terms it actually ends up being about 75% more expensive to run than the heavier Pro model. It is really fast though, pumping out 280 tokens a second which is a 70% speed bump. On the benchmark side it scored a 55 on the IQ index, beating out Grok 4.3 and Claude Sonnet 4.6, but its coding is still kind of weak at a 45. At least hallucinations dropped by 31 points down to 61%. Honestly this seems to be a trend everywhere right now. OpenAI’s GPT-5.5 is 50 to 90% more expensive than their last one, and Claude Opus 4.7 is up by 30 to 40% too. Basically the whole market is shifting towards these autonomous multi-step systems and they just eat up massive amounts of compute. Definitely going to force everyone to rethink their API budgets and how they handle AI spending going forward. submitted by /u/andrewaltair

Originally posted by u/andrewaltair on r/ArtificialInteligence