Slava Elizarov
362 posts

Slava Elizarov
@DoctorDukeGonzo
Staff Research Scientist @canva, ex-Unity | Generative models, Computer Graphics

Introducing VGGT (CVPR'25), a feedforward Transformer that directly infers all key 3D attributes from one, a few, or hundreds of images, in seconds! No expensive optimization needed, yet delivers SOTA results for: ✅ Camera Pose Estimation ✅ Multi-view Depth Estimation ✅ Dense Point Cloud Reconstruction ✅ Point Tracking Project Page: vgg-t.github.io Code & Weights: github.com/facebookresear…

Huggingface space demo: huggingface.co/spaces/cortwav… Source code: github.com/cortwave/Occlu…

Does 3D generation always have to be either slow or complex and data-hungry?🤔 We don’t think so! With Geometry Image Diffusion, we’re all about reusing (and recycling ♻️) what already works — making it faster and easier by reducing complexity and data needs 🚀(1/10)

Does 3D generation always have to be either slow or complex and data-hungry?🤔 We don’t think so! With Geometry Image Diffusion, we’re all about reusing (and recycling ♻️) what already works — making it faster and easier by reducing complexity and data needs 🚀(1/10)






