I know that using /caveman mode can reduce output tokens by around 75%, which is great for keeping usage down. But what about input tokens ? I know the obvious answer is “write shorter prompts” or “be more concise,” and yeah, this is probably a dumb question. 😅 What I’m wondering is whether there are any other techniques people use to reduce input token usage in Claude Code: Settings or modes that help? Ways to reduce context size without hurting performance too much? Better workflow patterns? Anything that automatically compresses or summarizes context? Basically, I’m trying to understand whether there’s anything beyond simply typing less. Curious what experienced users are doing to keep token consumption under control. submitted by /u/Mission-Dentist-5971
Originally posted by u/Mission-Dentist-5971 on r/ClaudeCode
