Original Reddit post

I built a Claude Code plugin that lets Claude see your screen, click, type, scroll, and interact with any desktop application. It’s called HandsOn . The problem it solves Claude Code can write your frontend, generate CSS, build entire UIs — but it has no idea what any of it actually looks like . It writes code blind and hopes for the best. If a button is misaligned or a modal is rendering wrong, you have to describe the problem in words and go back and forth. HandsOn closes that loop. Claude can look at what it built, spot visual bugs, and fix them — all in one workflow. What it can do Visual verification — Claude writes code, opens the app, screenshots it, sees what’s broken, fixes it. No more “the button is 2px off” conversations. GUI testing — Click through your app, fill forms, verify behavior end-to-end. Desktop automation — Automate any Windows application, even legacy apps with no API. Uses accessibility tree + OCR for precise targeting. Self-correcting clicks — If a click doesn’t produce a visual change, it automatically retries with offset positions. No more “click missed” dead ends. Window-scoped OCR — Target text within a specific window, not the whole screen. Coordinates are automatically corrected for high-DPI displays. Smart element targeting — Tries accessibility tree first, falls back to OCR automatically. Works across Qt, WPF, Electron, WinForms, and more. Install claude mcp add handson – npx -y @anthropic/claude-code-mcp-plugin-handson@latest Try it “Open my app in the browser, screenshot it, and tell me if anything looks off” “Fill out the contact form on localhost:3000 and submit it” “Open Notepad, type a test document, save it to Desktop” Status Alpha — Windows-first (macOS/Linux coming). Built and tested with Claude Code. Feedback welcome. Fun fact: HandsOn was used to post this very message. Claude navigated to Reddit, filled in the form, selected the flair, and submitted it. GitHub submitted by /u/3spky5u-oss

Originally posted by u/3spky5u-oss on r/ClaudeCode