I had 93% of my max 20x usage left yesterday and my plan refreshes on Thursday, so I thought why not spin up agent teams again for a a major refactor. My set up was sonnet worker and opus code review per worktree, job board, and a spec reviewer. Detailed spec, a lot of work some of it none trivial but not a lot. This is the second time I’ve tried this with a big task and tbh it tends to not produce a working implementation. Opus just isn’t a very good gatekeeper. For this run I also had a 3 strikes and you’re out implementation, if sonnet failed on either code or spec review 3 times, fresh 4.7 “senior” gets spawned briefed by spec reviewer and finishes the task. This never happened. Spec wasn’t followed, tests made up to make it look like it was in the most blatant way, code review and spec review didn’t catch. Set the task off before I went to bed still going when I woke up. Absolute carnage. Just slop. Don’t get me wrong it was a big refactor, and after opus audited the actual work against the spec called out all the things that were missing, at which point I just set codex 5.5 xhigh in fast mode and got him to build it out. Not sure if this is even viable with current state. Whats been your experience submitted by /u/muad_dib_the_maker
Originally posted by u/muad_dib_the_maker on r/ClaudeCode
