Princeton Vision & Learning Lab

53 posts

Princeton Vision & Learning Lab

@PrincetonVL

https://t.co/Bd2gisj8hY

Princeton, NJ Katılım Mart 2023

2 Takip Edilen1.7K Takipçiler

Princeton Vision & Learning Lab@PrincetonVL·2d

@Rafael_L_Spring @NanXue7 Part 2 was somehow hidden (not clear why). Reposting here: Code: github.com/princeton-vl/W… Paper: arxiv.org/abs/2603.24836 🧵2/2

English

Rafael Spring@Rafael_L_Spring·2d

@PrincetonVL @NanXue7 Looks very cool but where’s part 2/2?

English

169

Princeton Vision & Learning Lab@PrincetonVL·3d

Stereo depth is highly useful for robots. Meet WAFT-Stereo: #1 on ETH3D (BP-0.5), Middlebury (RMSE), and KITTI (all metrics); 61% less zero-shot ETH3D BP-0.5 error; 1.8-6.7x faster than prior SOTA. Key idea: classify disparity into bins, then iterative high-res warping.🧵1/2

English

116

7.6K

Princeton Vision & Learning Lab@PrincetonVL·3d

Code: github.com/princeton-vl/W… Paper: arxiv.org/abs/2603.24836 🧵2/2

Français

335

Princeton Vision & Learning Lab@PrincetonVL·10 Şub

Code: github.com/princeton-vl/W… Paper: arxiv.org/abs/2506.21526

Français

313

Princeton Vision & Learning Lab@PrincetonVL·10 Şub

Meet WAFT (Warping-Alone Field Transforms), our new optical-flow estimator. #1 on public benchmarks (Sintel & Spring), 1.3-4.1x faster than leading methods, and 2x lower memory. Key idea: replace cost volumes with high-res feature-space warping. Code and paper:👇

English

782

Princeton Vision & Learning Lab@PrincetonVL·1 Ara

We tested 6 intrinsics prediction methods on InFlux. When projecting 3D points, even the best methods produced projections hundreds of pixels off. Have a better method? Try our benchmark👇 🌐 Website: influx.cs.princeton.edu 📄 Paper: arxiv.org/abs/2510.23589 💻 Github: github.com/princeton-vl/I…

English

289

Princeton Vision & Learning Lab@PrincetonVL·1 Ara

To construct this mapping, we use Hollywood-grade lenses that provide reliable lens metadata. We calibrate the lenses at many settings using multi-scale board and drone targets, and extract intrinsics via Kalibr, a toolkit we modified for improved robustness and accuracy. 🧵4/5

English

320

Princeton Vision & Learning Lab@PrincetonVL·1 Ara

Estimating camera intrinsics from video is key to 3D reconstruction, but most methods assume they’re fixed per video. What if the camera keeps zooming and refocusing? Meet InFlux, the first benchmark with per-frame ground truth for videos with dynamic intrinsics. 🧵1/5

English

1.3K

Princeton Vision & Learning Lab@PrincetonVL·17 Kas

Dataset: huggingface.co/datasets/princ… Code: github.com/princeton-vl/i… Paper: arxiv.org/abs/2505.10755

Princeton Vision & Learning Lab tweet media

Nederlands

386

Princeton Vision & Learning Lab@PrincetonVL·17 Kas

Major update to Infinigen Articulated (formerly Infinigen-Sim)! You can now generate articulated 3D objects in 18 categories, simulation ready with physics parameters and improved efficiency. Also available: 20k pre-generated objects. Download links below👇

GIF

English

1.5K

Princeton Vision & Learning Lab@PrincetonVL·17 Kas

Dataset: huggingface.co/datasets/princ… Code: github.com/princeton-vl/i… Paper: arxiv.org/abs/2505.10755

Nederlands

227

Princeton Vision & Learning Lab@PrincetonVL·16 Eki

Visit our website to try your SLAM/NVS method on Princeton365 and submit your results. See our paper for technical details ⬇️. Also, come visit our poster at ICCV 2025. (4/4) Project Webpage: princeton365.cs.princeton.edu Paper: arxiv.org/abs/2506.09035 Code: github.com/princeton-vl/P…

English

293

Princeton Vision & Learning Lab@PrincetonVL·16 Eki

Princeton365 opens new frontiers for Novel View Synthesis, providing 360 scans of reflective and transparent scenes with precise camera poses—data that were previously unavailable, as conventional tools like COLMAP fail to recover accurate poses in such conditions. (3/4)

English

298

Princeton Vision & Learning Lab@PrincetonVL·16 Eki

🧵 Working on SLAM or Novel View Synthesis but need a new challenge? Try Princeton365, our new video benchmark built to push your model to the limit. It features reflective and transparent scenes, wild camera motion, night-time shots, flashing lights, video within video, and more, all with millimeter-accurate ground truth camera pose! (1/4)

English

414

Princeton Vision & Learning Lab@PrincetonVL·15 Eki

For data downloads and more details, check out our website and paper ⬇️(5/5) Paper: arxiv.org/abs/2503.11633 Project Webpage: layereddepth.cs.princeton.edu

English

228

Princeton Vision & Learning Lab@PrincetonVL·15 Eki

Our synthetic data can train models to predict multi-layer depth. It also improves the prediction of first-layer depth. (4/5)

English

246

Princeton Vision & Learning Lab@PrincetonVL·15 Eki

Depth models struggle with transparent surfaces. They may see a glass window, or what is behind it, but not both. Worse, they are often confused and inconsistent. How do we make them see the glass and see through it? Check out our ICCV 2025 paper “Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation”. (1/5)

English

366

Keşfet

@Rafael_L_Spring @NanXue7 @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA