saw this chinese team (InSpatio) just open sourced a realtime 3D world model similar to what world labs is building. got me thinking about why spatial intelligence matters most world models right now (genie 3, cosmos, runway) are basically fancy video generators. they predict the next 2D frame. but the physical world is 3D the problem: if a robot turns around and “forgets” the spatial layout behind it, it cant do complex navigation or planning. its like having amnesia every time you look away InSpatio’s approach is interesting. instead of predicting pixels, they build an actual 3D scene that persists. you can move around in it and the geometry stays consistent. no weird morphing or objects disappearing like in 2D video models runs on a single RTX 4090 which is wild. most world models need massive compute the tech uses “explicit anchors + implicit memory” basically gives the AI a coordinate system so it remembers where things are spatially. sounds simple but apparently this is hard to do what this enables: robots that can navigate complex spaces without getting lost consistent scene editing (change lighting and it updates everywhere not just one frame) unlimited generation time without degradation training data from 2D videos converted to 3D (solves the “not enough 3D data” problem) been testing some of these concepts in coding work actually. some tools maintain spatial context of your codebase so when they make changes in one file they know what breaks elsewhere. not the same as physical 3D but the principle of persistent spatial memory applies. tried this with verdent and it does help catch cascading bugs the bigger picture: if we want AI that actually understands and interacts with the physical world (not just generates videos of it) we probably need this kind of 3D native approach world labs raised at $5B valuation for basically this. now theres an open source version. could accelerate embodied AI development significantly project page: https://inspatio.github.io/worldfm/ github: https://github.com/inspatio/worldfm submitted by /u/Much-Movie-695
Originally posted by u/Much-Movie-695 on r/ArtificialInteligence
