Sharing a model router specifically built for Claude Code to let users configure which models power its main agent and subagents. Problems it solves: Claude Code’s API rates are significantly more expensive than subscription rates (perhaps 8-10x more). Opus is worth that money for hard tasks. But Sonnet and Haiku are overpriced when compared to open source models that are much better quality per dollar. Outages are common for Anthropic models. You can’t use OpenAI models inside of Claude Code. What it does: Rayline.ai lets you override Claude Code’s internal subagent model routing and route subtasks to open source and on-device models. You can configure your own routing rules, or use our ML to handle routing dynamically. We have a native Mac app that lives in your menu bar and lets you download on-device models like Qwen 3.6 and run subagents on-device via an MLX backend. Because Opus is “overseeing” the work of the subagents, the quality feels on par or better than using Claude Code with Sonnet as the main model while being much cheaper. My favorite way to use Rayline: I set Opus as the main agent, and I configure subagents to run on-device (I have an M4 Max 128gb so works very well). If there’s an Opus outage, I switch the main agent to use to OpenAI. Who it benefits: Any Claude Code user who is paying Claude Code’s API rates (e.g. enterprise plan or if you exceed your subscription limits). It makes costs more inline with the subscription rates. Costs: Our business model is the same as Open Router’s. You pay the inference providers’ API costs, and we charge a 7.5% mark-up on the API costs. In the early beta testing we’ve had, cost savings from Rayline vastly outweigh our markup. Our difference vs other routers (e.g. Open Router) is: We are built specifically for Claude Code model routing. We route at a subagent/subtask level. We support on-device routing. We have a built-in ML router trained specifically to route Claude Code subagent tasks. Its use is optional. Disclosure: My team and I built Rayline.ai We’ve been in private beta. We just released the public beta yesterday, so it’s hot off the press. We’d love feedback on it! submitted by /u/Turbulent-Key-348
Originally posted by u/Turbulent-Key-348 on r/ClaudeCode
