Donny Y. Chen (@donydchen) - Twitter Profili | Zamantika Mersobahis Locabet

Donny Y. Chen retweetledi

Bingyi Kang@bingyikang·14 Kas

After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3 reveals two key insights: 💎 A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture. ✨ A single depth-ray representation is enough. No complex 3D tasks. Three series of models have been released: the main DA3 series, a monocular metric estimation series, and a monocular depth estimation series. The core team members, aside from me: @HaotongLin, Sili Chen, Jun Hao Liew, @donydchen. 👇(1/n) #DepthAnything3

English

80

495

3.6K

510.4K

Donny Y. Chen retweetledi

AK@_akhaliq·14 Kas

Depth Anything 3 Recovering the Visual Space from Any Views

English

7

111

752

47.6K

Donny Y. Chen retweetledi

Bingyi Kang@bingyikang·16 Eki

How can an AI model learn the underlying dynamics of a visual scene? We're introducing Trajectory Fields, a new way to represent video in 4D! It models the path of each pixel as a continuous 3D trajectory, which is parameterized by a B-spline function of time. This unlocks powerful physical AI tasks: ✨ Video Motion estimation and understanding. 🔮 Scene Motion forecasting. 🎯 Goal-conditioned trajectory generation. 🚀 Dynamic fusion: recovering the complete object while observing it is moving. Find more details in our work "Trace Anything: Representing Any Video in 4D via Trajectory Fields" This work was led by our amazing intern @xinhangliu123 (xinhangliu.com).

Xinhang Liu@xinhangliu123

Excited to share our latest work from the ByteDance Seed Depth Anything team — Trace Anything: Representing Any Video in 4D via Trajectory Fields 💻 Project Page: trace-anything.github.io 📄 Paper: huggingface.co/papers/2510.13… 📦 Code: github.com/ByteDance-Seed… 🤖 Model: huggingface.co/depth-anything… 🖐️ Interactive Results: trace-anything.github.io/viser-client/i…

English

1

17

83

14.4K

Donny Y. Chen retweetledi

Xinhang Liu@xinhangliu123·16 Eki

Excited to share our latest work from the ByteDance Seed Depth Anything team — Trace Anything: Representing Any Video in 4D via Trajectory Fields 💻 Project Page: trace-anything.github.io 📄 Paper: huggingface.co/papers/2510.13… 📦 Code: github.com/ByteDance-Seed… 🤖 Model: huggingface.co/depth-anything… 🖐️ Interactive Results: trace-anything.github.io/viser-client/i…

English

4

7

69

20.8K

Donny Y. Chen retweetledi

Nikhil Keetha@Nik__V__·10 Eki

Interesting ICLR submissions 🤩 Depth Anything 3 - My TLDR: Init multi view transformer of VGGT with later layer DINO weights and use teacher model trained on synthetic data only for pseudo labelling real world datasets openreview.net/forum?id=yirun… Trace Anything - My TLDR: VGGT like model predicting N view geometry and motion as a trajectory field represented using splines and control points openreview.net/forum?id=BqaCh… The field is evolving very fast!

English

5

41

389

25K

Donny Y. Chen retweetledi

Weijie Wang@wjwang2003·29 Eyl

🎉ZPressor has been accepted to @NeurIPSConf 2025. 🎊The code and model checkpoints have also been open sourced on Github and Huugging Face. Welcome to star and use it! Code: github.com/ziplab/ZPressor Project Pages: lhmd.top/zpressor Models: huggingface.co/lhmd/ZPressor

Weijie Wang@wjwang2003

🚀 We're excited to introduce ZPressor, a bottleneck-aware compression module for scalable feed-forward 3DGS. Existing feed-forward 3DGS models struggle with dense views, facing performance drops & massive redundancy. ZPressor leverages Information Bottleneck Theory to compress multi-view features, significantly boosting scalability and reconstruction quality for robust dense-view synthesis. Plug & play, lightweight, and powerful. Read more: lhmd.top/zpressor Paper: arxiv.org/abs/2505.23734 Code: github.com/ziplab/ZPressor @donydchen @SteveZeyuZhang @supremeZhuang

English

0

2

6

412

Donny Y. Chen retweetledi

AK@_akhaliq·24 Eyl

VolSplat Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction

English

1

23

184

17.4K

Donny Y. Chen@donydchen·24 Haz

Had a fantastic experience working with Chuanxia as his co-supervised PhD student. I highly recommend joining his Physical Vision Group (physicalvision.github.io) at NTU Singapore, and I believe it will be an inspiring place to grow as a researcher!

Chuanxia Zheng@ChuanxiaZ

After two amazing years with @Oxford_VGG, I will be joining @NTUsg as a Nanyang Assistant Professor in Fall 2025! I’ll be leading the Physical Vision Group (physicalvision.github.io) — and we're hiring for next year!🚀 If you're passionate about vision or AI, get in touch!

English

0

8

331

Donny Y. Chen@donydchen·9 Haz

Thanks Zhenjun for sharing. Our PM-Loss tackles low geometry quality in feed-forward 3DGS caused by "discontinuity" in predicted depth, and features a plug-and-play training loss that leverages geometry priors from pointmap-based models like VGGT. More at aim-uofa.github.io/PMLoss

Zhenjun Zhao@zhenjun_zhao

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting @Duochao_Shi, @wjwang2003, @donydchen, Zeyu Zhang, Jia-Wang Bian, @supremeZhuang, @chunhua_shen tl;dr: pre-trained 3D reconstruction models->pointmaps->geometry prior->loss arxiv.org/abs/2506.05327

English

0

4

58

4.2K

Donny Y. Chen@donydchen·2 Haz

ZPressor, an architecture-agnostic module that enables existing feed-forward #3DGS models to effectively handle dense input views (up to 100 on an 80G GPU). More at: lhmd.top/zpressor/

Weijie Wang@wjwang2003

🚀 We're excited to introduce ZPressor, a bottleneck-aware compression module for scalable feed-forward 3DGS. Existing feed-forward 3DGS models struggle with dense views, facing performance drops & massive redundancy. ZPressor leverages Information Bottleneck Theory to compress multi-view features, significantly boosting scalability and reconstruction quality for robust dense-view synthesis. Plug & play, lightweight, and powerful. Read more: lhmd.top/zpressor Paper: arxiv.org/abs/2505.23734 Code: github.com/ziplab/ZPressor @donydchen @SteveZeyuZhang @supremeZhuang

English

0

3

41

3.7K

Donny Y. Chen retweetledi

Zhenjun Zhao@zhenjun_zhao·8 Kas

MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views Yuedong Chen, @ChuanxiaZ, @haofeixu, @supremeZhuang, Andrea Vedaldi, Tat-Jen Cham, Jianfei Cai tl;dr: extend MVSplat to 360° NVS arxiv.org/pdf/2411.04924

English

1

12

49

4K

Donny Y. Chen@donydchen·12 Ara

Fantastic dinner @theworldlabs, glad to meet so many nice people around.

Rajko Radovanović@rajko_rad

❤️ @theworldlabs cc @chlassner @drfeifei @donydchen @venturetwins

English

0

5

219

Donny Y. Chen retweetledi

Haofei Xu@haofeixu·22 Mar

MVSplat enables super fast scene reconstructions from as few as two images in a single forward pass! Code & models: github.com/donydchen/mvsp… Joint work with amazing collaborators @donydchen @ChuanxiaZ @supremeZhuang @mapo1 @AutoVisionGroup Tat-Jen Cham and Jianfei Cai!

MrNeRF@janusch_patas

MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images Paper: donydchen.github.io/mvsplat/static… Project: donydchen.github.io/mvsplat/ Code: github.com/donydchen/mvsp… (1/2)

English

0

8

49

6.5K

Donny Y. Chen@donydchen·3 Eki

Well summarised, thanks for the hard work! Glad to share that me and @QianyiWu7 have managed to make our contributions to such a fast developing community. More details please refer to Sem2NeRF(donydchen.github.io/sem2nerf/) and ObjectSDF(wuqianyi.top/objectsdf/).

Mark Boss@markb_boss

As #ECCV2022 is approaching rapidly, I wanted to find all @neural_fields related papers. I've compiled my gatherings in another post of (hopefully) all NeRFy papers. Feel free to message me about anything I have missed :) markboss.me/post/nerf_at_e…

English

0

3

0

Donny Y. Chen@donydchen·30 Oca

McDonald’s inside a park. Haven’t ate hamburgers for a long time, nice food and beautiful views make a enjoyable Saturday dinner.

English

0

Donny Y. Chen@donydchen·28 Oca

After shipping a table from Taobao, my housemates start playing Mahjong every night. Interesting to find that I’m the only one in the house who don’t know how to play🧐

English

0

Donny Y. Chen retweetledi

Andrej Karpathy@karpathy·3 Oca

Finding it increasingly hard to keep up with all of the activity in deep learning right now, as # tabs -> \infty and tab width -> 0. It hasn't even been a decade since AlexNet (~Sep 2012 => ~8.5yrs) and a lot happened. Another 8.5 will be ~2030, I wonder what that's like.

English

52

91

1.3K

0

Donny Y. Chen@donydchen·8 Haz

#NowPlaying 500 Miles, by Peter, Paul and Mary open.spotify.com/track/6oMZhY0f…

English

0

Donny Y. Chen@donydchen·22 Nis

@david2881234 What is this?

English

1

0

David C@david2881234·22 Nis

share from my ps4 pro #PS4share

English

1

0

2

0

Donny Y. Chen@donydchen·8 Şub

@leehsienloong 7 confirmed cases today, 2 of which are taxi drivers, considering the highly contagious of the novel coronavirus, I don’t understand why govt still keeps saying the risk of infection is low. Also, 4 out of 40 cases in SG are in critical condition, looks lethal enough already.

English

0

leehsienloong@leehsienloong·8 Şub

We have faced the 2019-nCoV situation for about 2 weeks now. People are understandably anxious & fearful, but there is no need to panic — Singapore has ample supplies. Instead, let us remain united & resolute, stay calm & carry on with our lives. – LHL youtu.be/oNw1pyksKHo

YouTube

English

78

377

995

0

Donny Y. Chen

Keşfet