UPDATE (April 11, 11pm): Independent replication with 11,505 API calls over 7 days confirms fallback-percentage: 0.5 is completely fixed — zero variance, not time-based, not peak/off-peak, not load-based. Fixed per-account parameter. New finding: 14% of calls had the weekly quota as binding constraint, not the 5h window. Also: some Max 5x accounts have overage-status: allowed while mine has overage: rejected + org_level_disabled. Same plan, different treatment, zero transparency. CORRECTION: overage: rejected is a user-controlled billing setting, not account-level targeting. My mistake — I had overage disabled myself. The fallback-percentage: 0.5 finding stands independently. Original post: Frustrated with hitting limits on my Max 5x plan (€100/month), I set up a transparent API proxy using claude-usage-dashboard to intercept all requests between Claude Code and Anthropic’s servers. Every single request — on both my Max 5x account AND a brand new Pro free trial account — contains this hidden header: anthropic-ratelimit-unified-fallback-percentage: 0.5 Additionally found a Thinking Gap of 384x — effortLevel: “high” in settings.json causes thinking tokens to consume 384x more quota than visible output, completely invisible to users. Full proxy data: github.com/anthropics/claude-code/issues/41930#issuecomment-4229683982 EU users: this likely violates consumer protection law. submitted by /u/Major_Sense_9181
Originally posted by u/Major_Sense_9181 on r/ClaudeCode
