Original Reddit post

Been coding with Claude Code for a few months. Shipping faster than ever. But every time I pushed something, I had this nagging feeling: does it actually work? Not “does it compile” does a real person using it find it confusing, broken, or just… off? Playwright tells me if buttons click. It doesn’t tell me if the loading state is anxiety-inducing or if the error message makes zero sense. So I built something:

  • Feed it your PRD (or just describe your app in 3 sentences)
  • It navigates your app the way a user would
  • Captures timing, transitions, video frames — not just static screenshots
  • Claude judges what it sees: is this UX coherent? does this match what the product is supposed to do?
  • Spits out a verdict in plain English One prompt. Five minutes. “Your signup flow has a 2.4s dead zone with no feedback : users think it crashed.” Built on Playwright + Stagehand + Claude Vision. Claude Code skill, so it runs straight from your terminal. Testing it on a few apps this week. If you’re building with Claude Code / Cursor and want me to run it on yours, drop a comment or DM. submitted by /u/AdministrationPure45

Originally posted by u/AdministrationPure45 on r/ClaudeCode