Original Reddit post

I’ve been exploring the space of embodied AI agents — ones that go beyond text/voice to include visual 3D representations with real-time animations and expressions. Most agents remain disembodied, but a few platforms are experimenting with making them more “present” on the web. Here are a few examples I’ve come across: • three.ws: Browser-native approach using Three.js/WebGL. Quick 3D avatar creation from selfie or GLB, simple embedding as a web component, LLM integration, persistent memory, and some on-chain identity features on Solana. They recently announced an IBM Partner Plus collaboration for enterprise exploration.  • NVIDIA Digital Human AI Blueprint: Framework for creating photorealistic 3D avatars with support for Omniverse or Unreal Engine rendering. Integrates with NVIDIA ACE, Audio2Face for animations, and targets use cases like customer service and interactive experiences.  • RAVATAR: Platform for real-time interactive 3D AI avatars/digital humans. Focuses on lifelike expressions, multi-platform deployment (web, apps, holographic), and conversational capabilities for enterprise/workforce applications.  I’m interested in the community’s perspective on this direction, especially the technical side: • How practical is running capable 3D + LLM agents client-side in browsers at scale? What are the main performance bottlenecks? • Which enterprise use cases seem most viable (e.g., customer support, training/education, digital twins)? • How much does visual embodiment actually improve user engagement or outcomes versus focusing purely on better reasoning, memory, and tool use? Would love to hear about other projects, frameworks, or research in embodied/web-based agents. Is this area seeing meaningful technical progress, or is it still mostly frontend polish on top of existing LLMs? submitted by /u/amu4biz

Originally posted by u/amu4biz on r/ArtificialInteligence