Original Reddit post

After getting fed up with recent changes, I finally pulled the trigger on moving away from Claude’s native backend. I spent time bouncing between local models, OpenCode, OpenHands, Aider, and various Claude Code forks before landing on something I’m actually happy with. Here’s the thing though — Claude Code is still the best agentic IDE harness out there, bar none. So I didn’t replace it. The setup: Claude Code Router + DeepInfra I’m using CCR (Claude Code Router) — open source on GitHub — to intercept and reroute API calls away from Anthropic’s servers to my own backend. Nothing hits Anthropic. They can’t detect it, and even if they could, there’s nothing to act on. After a couple days of patching to make sure it survives Claude Code updates, I’ve got DeepInfra serving models directly into Claude Code. Here’s how I slotted the models:

  • Opus slot → DeepSeek V3 Pro
  • Sonnet slot → DeepSeek V3 (fast/flash variant)
  • Haiku slot → GLM Honest take after 2 weeks of daily use: The flash variant of DeepSeek V3 is quietly the best open-weight model available right now. It’s cheap to run, fast enough that it doesn’t break your flow, and in real coding work the gap vs. Claude Opus is smaller than you’d expect. If you’ve been hunting for a way to keep Claude Code without the Anthropic dependency, this is the move. Happy to answer questions. submitted by /u/Red0Adrenaline

Originally posted by u/Red0Adrenaline on r/ClaudeCode