
Want to leverage the power of SOTA 3D models like VGGT & Video LDMs for 3D generation? Now you can! 🚀
Introducing VIST3A — we stitch pretrained video generators to 3D foundation models and align them via reward finetuning.
📄 arxiv.org/abs/2510.13454
🌐 gohyojun15.github.io/VIST3A
English






