Original Reddit post

I got tired of paying for Claude twice. I use Claude Code for all my coding needs. I also have a bunch of side stuff - Open claw agents, batch scripts, an eval harness, and continue in my editor that all wanted to hit Sonnet. Every one of those was burning API credits in parallel with my subscription. Same model, two bills, felt dumb. Meanwhile, my local LLM rigs were sitting idle because, honestly, Sonnet smokes anything I can run on 24GB of VRAM for the kind of work I do. I didn’t want a local model. I wanted local plumbing to a remote model I was already paying for. So I built ottelry https://www.npmjs.com/package/otterly What’s janky (because you’ll find out anyway) It piggybacks on your authenticated Claude Code session. That means it shares Claude Code’s rate limits. If you hit the 5-hour cap in Claude Code, Otterly’s down until it resets. Some weird OpenAI params (logit_bias, n>1, seed) are stubbed. First request after a long idle can be slow while the session re-auths. Subsequent requests are ~3–7ms overhead over raw API. Single user, your machine, your subscription. Don’t expose it publicly. Don’t resell. Don’t be that person. submitted by /u/Electrical-Ad-9808

Originally posted by u/Electrical-Ad-9808 on r/ClaudeCode