Qi Wu

63 posts

Qi Wu banner
Qi Wu

Qi Wu

@wilson_over

Applied research scientists at NVIDIA. Views and opinions are my own and do not represent those of my employer, NVIDIA.

Katılım Eylül 2012
346 Takip Edilen180 Takipçiler
Sabitlenmiş Tweet
Qi Wu
Qi Wu@wilson_over·
Say goodbye to perfect pinhole assumptions Excited to introduce 3DGUT—a Gaussian Splatting formulation that unlocks support for distorted cameras, including time dependent effects like rolling shutter, while maintaining the benefits of rasterization, rendering at >250 FPS. 🧵
English
5
41
301
54.9K
Qi Wu retweetledi
Michał Tyszkiewicz
Michał Tyszkiewicz@jatentaki·
Feed-forward 3D reconstruction should not be limited to predicting one Gaussian per pixel. We introduce TokenGS, which uses learnable tokens to decouple the 3D Gaussian prediction from the image resolution and the number of input views. #CVPR2026Highlight [1/6]
English
6
44
249
44.8K
Qi Wu retweetledi
Janick Martinez Esturo
Janick Martinez Esturo@jmartinezesturo·
A canonical open-source data platform for multi-sensor neural 3D reconstruction is here. Introducing NVIDIA NCore — a unified data format and APIs for cameras, lidars, radars, poses, calibrations, and labels, built for AV, robotics, and physical AI. 🔗research.nvidia.com/labs/sil/proje…
English
6
81
508
44.7K
Qi Wu retweetledi
Qi Wu retweetledi
Jorge Condor
Jorge Condor@Arcanous98·
Introducing Neural Harmonic Textures: our new method for real-time novel view synthesis that outperforms all 3DGS and NeRF derivatives including (finally) ZipNeRF in terms of quality across all benchmarks. The code is released (Apache 2.0): (research.nvidia.com/labs/sil/proje…) 🧵
English
12
103
609
43.3K
Qi Wu retweetledi
Michał Tyszkiewicz
Michał Tyszkiewicz@jatentaki·
This week at @NVIDIAGTC we're presenting AlpaDreams: a generative world model for driving simulation. Compared to standard video models, AlpaDreams is autoregressive, enabling updating the conditioning (simple bounding box world) in closed loop, and multiview-consistent.
English
2
21
75
6K
Qi Wu retweetledi
Ruilong Li
Ruilong Li@ruilong_li·
Such a lovely team to work with—so many talented and devoted people. At NVIDIA, we work hard not out of fear on being fired, but because we truly enjoy the team and the project. Drop me or anyone on the team a message if you’re interested in joining. research.nvidia.com/labs/sil/membe…
Ruilong Li@ruilong_li

Special moment to see something I’ve worked on so closely come to life! Today we announce Alpadreams — a world model that lets you explore ♾endlessly♾️in ⚡real time⚡. Video: me (left) and Alpamayo policy (right) driving in Alpadreams at #GTC26. research.nvidia.com/labs/sil/proje…

English
0
6
46
4.1K
Qi Wu retweetledi
Ruilong Li
Ruilong Li@ruilong_li·
Special moment to see something I’ve worked on so closely come to life! Today we announce Alpadreams — a world model that lets you explore ♾endlessly♾️in ⚡real time⚡. Video: me (left) and Alpamayo policy (right) driving in Alpadreams at #GTC26. research.nvidia.com/labs/sil/proje…
English
2
18
97
10.1K
Qi Wu retweetledi
Zan Gojcic
Zan Gojcic@ZGojcic·
A new generation in AV simulation is here! We are announcing AlpaDreams, a real time interactive generative world model for AV simualtion! Just a year ago it took minutes to generate a few seconds of video, today it is real time and interactive! research.nvidia.com/labs/sil/proje…
English
5
26
106
18.5K
Qi Wu retweetledi
ESPN F1
ESPN F1@ESPNF1·
One for the tinfoil hats. If this ends Verstappen, Piastri, Norris, then the Monza swap WON Norris the title over Verstappen ✍️ @natesaundersF1
ESPN F1 tweet media
English
100
152
3.6K
237K
Qi Wu retweetledi
Zan Gojcic
Zan Gojcic@ZGojcic·
Our team at Nvidia Spatial Intelligence Lab is hiring PhD research interns for 2026! research.nvidia.com/labs/sil/ If you’re excited about fast video models, generative world simulators, or 3D foundation models, please reach out by email or apply directly lnkd.in/gGKU_sUr
English
5
31
214
58.4K
Qi Wu retweetledi
AK
AK@_akhaliq·
Nvidia just released Lyra on Hugging Face Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation TL;DR: Feed-forward 3D and 4D scene generation from a single image/video trained with synthetic data generated by a camera-controlled video diffusion model
English
17
94
546
55.8K
Qi Wu retweetledi
Jiahui Huang
Jiahui Huang@huangjh_hjh·
[1/N] 🎥 We've made available a powerful spatial AI tool named ViPE: Video Pose Engine, to recover camera motion, intrinsics, and dense metric depth from casual videos! Running at 3–5 FPS, ViPE handles cinematic shots, dashcams, and even 360° panoramas. 🔗 research.nvidia.com/labs/toronto-a…
English
13
104
450
63K
Qi Wu retweetledi
MrNeRF
MrNeRF@janusch_patas·
ViPE: Video Pose Engine for 3D Geometric Perception Contributions: • A robust and efficient framework, ViPE, for estimating camera parameters and dense depth from diverse, in-the-wild videos. • A system design that integrates the strengths of classical SLAM (efficiency, scalability) and learned models (robustness), with key improvements in efficiency, dynamic object handling, and depth quality over prior work. • A large-scale dataset of annotated videos, created using ViPE, to facilitate future research in 3D computer vision.
English
4
41
273
42.5K
Qi Wu retweetledi
MrNeRF
MrNeRF@janusch_patas·
GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting Contributions: • We introduce a novel radiance cache optimized for volume rendering that caches path-space radiance using multiple levels of Gaussian splats. • The cache works in real time on complex datasets and in a wide variety of use cases. It adapts quickly to changes in the transfer function and lighting parameters, improving overall image quality and rendering times. • Optimizing the cache is possible not only with clean samples but also with noisy data, as is commonly found in Monte-Carlo-based renderers. • The path-space nature of the cache and its non-invasive design make it easy to use and integrate into existing rendering solutions.
MrNeRF tweet mediaMrNeRF tweet mediaMrNeRF tweet mediaMrNeRF tweet media
English
1
11
62
3.5K
Qi Wu retweetledi
Ruilong Li
Ruilong Li@ruilong_li·
For everyone interested in precise 📷camera control 📷 in transformers [e.g., video / world model etc] Stop settling for Plücker raymaps -- use camera-aware relative PE in your attention layers, like RoPE (for LLMs) but for cameras! Paper & code: liruilong.cn/prope/
Ruilong Li tweet media
English
10
93
530
108.7K
Qi Wu retweetledi
Tri Dao
Tri Dao@tri_dao·
Getting mem-bound kernels to speed-of-light isn't a dark art, it's just about getting the a couple of details right. We wrote a tutorial on how to do this, with code you can directly use. Thanks to the new CuTe-DSL, we can hit speed-of-light without a single line of CUDA C++.
Wentao Guo@WentaoGuo7

🦆🚀QuACK🦆🚀: new SOL mem-bound kernel library without a single line of CUDA C++ all straight in Python thanks to CuTe-DSL. On H100 with 3TB/s, it performs 33%-50% faster than highly optimized libraries like PyTorch's torch.compile and Liger. 🤯 With @tedzadouri and @tri_dao

English
7
54
522
48.8K
Qi Wu retweetledi
Huan Ling
Huan Ling@HuanLing6·
We are excited to share Cosmos-Drive-Dreams 🚀 A bold new synthetic data generation (SDG) pipeline powered by world foundation models—designed to synthesize rich, challenging driving scenarios at scale. Models, Code, Dataset, Tookit are released. Website: research.nvidia.com/labs/toronto-a…
English
11
42
118
24.2K