Original Reddit post

Is it just me and please give me your opinion or are the people who always cry about token consumption (not limit) are acting like Haiku 3.5 🥀 I don’t see how people burn 1M+ tokens in minutes and yet they get junk results, because I have never EVER ran into such issues UNLESS I chat for a long period of time and be jumping between topics, etc… which is kinda valid since the models has so much RANDOM context which gets in the way of the result. This is how I talk with Claude so that we are clear (assuming this is something new I got no clue about): describe an idea I have and ask him to improve it or take care of the features. based on his output we have a small discussion (usually 2 long prompts) talk about the technical implementation (which shit to use to develop this) and I usually recommend things which I already have in my PC. Ask him to go one step at a time or to go full out depending on- I don’t even know 💀 If the problems are small hot fixes I will continue on the same chat, else I start a new chat with clean context and test maybe even continue there. This is not 100% of the time how I do it but you can get the picture. For the people who actually faced this problem a lot share your thoughts, process, workflow, etc… so we can stop running into this. P.S: I am not a vibe coder BUT I do use AI for assistance. submitted by /u/FaintShadow_

Originally posted by u/FaintShadow_ on r/ClaudeCode