Multimodal AI models have a real-world feel to them. It's amazing that I can take a screenshot of a 3D reconstruction a…

By Bilawal Sidhu · AI

Multimodal AI models have a real-world feel to them. It's amazing that I can take a screenshot of a 3D reconstruction and pass the 3D camera trajectory to a multimodal model like Google's Omni. It is able to then synthesize a video, well, I guess in this case generate something that looks so photorealistic compared to the actual physical location, which is the Lodi Garden in New Delhi

Reel

View original

HomeResourceLoading…