Original Reddit post

I have personally never installed this much shit to save tokens, but the theory seems sound. NOT MINE: https://github.com/sgaabdu4/claude-code-tips/blob/main/claude-code-tips.md TL;DR: Five layers, each ripping tokens out at a different point in the pipeline. (1) Codebase Memory MCP trades file reads for a knowledge graph (99% off). (2) context-mode sandboxes large outputs and hands back a summary (98% off). (3) RTK compresses CLI output in place (60-90% off). (4) Headroom is an API proxy that compresses everything before it leaves your machine (47-92% off). (5) Caveman makes Claude itself talk less (50-75% off). Hooks enforce the stack so Claude can’t slip back to the lazy path. The source looks reasonable to me; just a stack of hacks scripted together for convenience. A lot of people here would have one or more of these things installed already, but some people would have none, so this is good for them. The repo only has 10 stars right now. submitted by /u/9gxa05s8fa8sh

Originally posted by u/9gxa05s8fa8sh on r/ClaudeCode