Comment ‘3d’ to get the research. This is some quietly impressive work on making video world models actually controllab…
By Bilawal Sidhu · AI
Comment ‘3d’ to get the research. This is some quietly impressive work on making video world models actually controllable in 4D space. VerseCrafter lets you take an input image, use something like Blender to animate the 3D camera path and object trajectories, then uses that to condition generation. Scribbling in 2D feels so crude in comparison. The authors represent everything in a shared 4D world state - static background as a point cloud, moving objects as 3D gaussian trajectories. The…