
To bring generalist intelligent robots to the real world, we have to overcome the data scarcity problem. At Rhoda, we are solving it by reformulating robot policies as video generation. Today, we introduce the Direct Video-Action Model (DVA)
Noah Snavely
899 posts

@Jimantha
3D vision fanatic. Professor @cornell_tech & Researcher @GoogleDeepmind. He or they. https://t.co/m7Rs5xUFfG

To bring generalist intelligent robots to the real world, we have to overcome the data scarcity problem. At Rhoda, we are solving it by reformulating robot policies as video generation. Today, we introduce the Direct Video-Action Model (DVA)

Spatial reconstruction is a long-context problem: real scenes come with hundreds of images. But O(N²) transformer-based models don’t scale efficiently. Introducing: 🤐ZipMap (CVPR ’26): Linear-Time, Stateful 3D Reconstruction via Test-Time Training (TTT). ZipMap “zips” a large image collection into an implicit TTT scene state in a single linear-time operation. The state will then be decoded into spatial outputs, and can be queried efficiently for novel-view geometry and appearance (~100 FPS) ZipMap is not only much faster (>20× faster than VGGT), but also matches or surpasses the accuracy of all SOTA models.






1/5 Humans are able to look at their surroundings and pinpoint their location on a map, even for totally new buildings. Can computer vision systems do the same? 🤖🗺️ We explore this In our #NeurIPS2025 paper - C3Po: Cross-View Cross Modality Correspondence by Pointmap Prediction.

1/5 Humans are able to look at their surroundings and pinpoint their location on a map, even for totally new buildings. Can computer vision systems do the same? 🤖🗺️ We explore this In our #NeurIPS2025 paper - C3Po: Cross-View Cross Modality Correspondence by Pointmap Prediction.

.@Cornell is recruiting for multiple postdoctoral positions in AI as part of two programs: Empire AI Fellows and Foundational AI Fellows. Positions are available in NYC and Ithaca. Deadline for full consideration is Nov 20, 2025! academicjobsonline.org/ajo/jobs/30971



🌺 Join us in Hawaii for Wild3D! We're hosting our 2nd Workshop on 3D Modeling, Reconstruction & Generation in the Wild! Dive into 3D + 4D topics, from real-world reconstruction to video generative models & dynamic scene modeling 🌋 #Wild3D #ICCV2025









We are thrilled to share the appointment of @QianqianWang5 as an #KempnerInstitute Investigator! She will bring her expertise in computer vision to @Harvard. Read the announcement: bit.ly/4mIghHy @hseas #AI #ComputerVision