I run a small Indian B2B SaaS called Trakkar. Year 4. ~10K hours/month tracked across customers. 4 in-house engineers including me. Last December I moved from Copilot-only to a parallel stack: Cursor for in-IDE edits, Claude Code in the terminal for repo-wide refactors and CI scripts. 5 months in. Sharing the honest cut because the founder posts I see on this sub are usually one extreme or the other. The numbers I actually measured (not vibes)
- PRs merged per engineer-week: up 31% (from 6.4 to 8.4 averaged across the team across 5 sprints).
- Median PR review time: down from 47min to 28min. (Most of the cut is AI generating the description + the diff summary, not the code itself.)
- Bug count in production for AI-touched code: roughly the same as hand-written. We tag commits, so this is checkable.
- Flaky test count: up. From 3 in Dec to 11 in April. We added a quarantine job because of it. Where I will keep using it forever
- Scaffolding a new module from a spec. Saves me ~2 hours every time.
- Writing the unit tests for code I already wrote by hand. The reverse direction (AI writes code, I write tests) does not work as well. Tests just reflect the AI’s wrong assumptions.
- Translating SQL to ORM calls. Sequelize syntax is awful and I refuse to memorise it in 2026.
- Summarising long PRs from teammates. The tl;dr block at the top of every PR description is now Claude Code output. Where I ripped it out
- Anything touching the payment flow. We use Razorpay. One AI-generated webhook handler silently swallowed a refund event last quarter. Not its fault, the spec was ambiguous, but the cost of that bug was 4 hours of customer support and a refund-of-refund. Hand-written from now on.
- Audit log writes. Same reason. The Indian Labour Codes are now operational. I am not going to let an AI write the immutable check-in record.
- Choosing what to build. AI is great at “write this.” It is terrible at “should we write this.” Backlog grooming stays human. The METR finding rings true Their controlled study said experienced OSS devs got 19% slower with AI even though they felt 20% faster. I see this on our team for the senior engineers on familiar code. They are faster without AI on stuff they have written before. Junior engineers on unfamiliar parts of the repo are the ones who actually compound. If you are a solo founder reading this: AI is a 10x multiplier on the work you already know how to do. It is a 0x multiplier on the work you do not. Spend the saved hours learning the unfamiliar parts of your own stack, not shipping more of the same. What is the one task you have removed from your AI workflow in the last 90 days, and why? submitted by /u/jainikpatel1001
Originally posted by u/jainikpatel1001 on r/ClaudeCode
You must log in or # to comment.
