Original Reddit post

I’ve been experimenting with different approaches because I keep noticing what feels like context rot during long Claude Code sessions. Even before hitting the hard limit, I found mistake rates seemed to increase as conversations pushed past the 200k window. The default auto-compaction helps, but I’ve also found it can sometimes happen at awkward moments where important working context gets compressed away mid-task. I also did some calculations and noticed around the 180k tokens mark the 90% cache savings start to lose their benefit, and compact would be more effective for token efficiency. Because of that, I started experimenting with a different approach. I built a small open-source tool that connects to Claude Code through tmux and proactively manages compaction before sessions become excessively large. It sends helpful notes on what to preserve in the compact message automatically, and a useful continuation message. What is everyone else doing? submitted by /u/Sea_Cardiologist_212

Originally posted by u/Sea_Cardiologist_212 on r/ClaudeCode