Original Reddit post

A couple of months ago I did some bulk data enrichment - taking non structured website copy for some facilities / organisations and saving the data into searchable, filterable format. Things like get the min age, services provided, outdoor area description - with source quotes for human in the loop review Today I had to add a new facility, and I’m on the Max plan, so I’m using Opus by default. It was incredibly infuriating at this task. It kept trying to assume certain things, tried to extrapolate new data out of thin air, kept trying make moves to expand the scope into other directions. Keeping it on task / tamed it was more work than the actual work I needed to do. All of this in-spite of a very strict starting prompt that was honed over months. Opus is amazing for some things. But when it came time to shut up and do fairly simple work Sonnet won. I really get the feeling with all the reinforcement learning Anthropic has been doing, they really have no idea how it will impact what comes out on the other side and their just rolling the dice. submitted by /u/junlim

Originally posted by u/junlim on r/ClaudeCode