Original Reddit post

I work a lot with browsers when it comes to giving visual context to LLMs. The usual workflow was: take a screenshot → upload it to my IDE → prompt the context. That works fine , until you’re clicking 1,000 screenshots a day. Eventually they pile up in storage, and ironically, storage costs keep skyrocketing. So I decided to make my life easier. I built agent-vision Github Repo -> https://github.com/kedarvartak/agent-vision NpmJS Package -> https://www.npmjs.com/package/agent-vision-mcp agent-vision is a vision layer between your development environment and your browser. It gives LLMs live browser context - not just screenshots or layouts, but: URL DOM Element attributes Network events Viewport size Console logs Tab title No more constantly switching tabs for browser tasks. submitted by /u/Desperate_Title1595

Originally posted by u/Desperate_Title1595 on r/ClaudeCode