Donny Y. Chen

29 posts

Donny Y. Chen banner
Donny Y. Chen

Donny Y. Chen

@donydchen

Researcher @BytedanceTalk Seed (Singapore) I Previously PhD @MonashUni | Working on 3D Vision | ALL VIEWS ARE SOLELY HIS OWN.

Singapore Katılım Mayıs 2013
128 Takip Edilen287 Takipçiler
Donny Y. Chen retweetledi
Bingyi Kang
Bingyi Kang@bingyikang·
After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3 reveals two key insights: 💎 A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture. ✨ A single depth-ray representation is enough. No complex 3D tasks. Three series of models have been released: the main DA3 series, a monocular metric estimation series, and a monocular depth estimation series. The core team members, aside from me: @HaotongLin, Sili Chen, Jun Hao Liew, @donydchen. 👇(1/n) #DepthAnything3
English
80
495
3.6K
510.4K
Donny Y. Chen retweetledi
AK
AK@_akhaliq·
Depth Anything 3 Recovering the Visual Space from Any Views
English
7
111
752
47.6K
Donny Y. Chen retweetledi
Bingyi Kang
Bingyi Kang@bingyikang·
How can an AI model learn the underlying dynamics of a visual scene? We're introducing Trajectory Fields, a new way to represent video in 4D! It models the path of each pixel as a continuous 3D trajectory, which is parameterized by a B-spline function of time. This unlocks powerful physical AI tasks: ✨ Video Motion estimation and understanding. 🔮 Scene Motion forecasting. 🎯 Goal-conditioned trajectory generation. 🚀 Dynamic fusion: recovering the complete object while observing it is moving. Find more details in our work "Trace Anything: Representing Any Video in 4D via Trajectory Fields" This work was led by our amazing intern @xinhangliu123 (xinhangliu.com).
Xinhang Liu@xinhangliu123

Excited to share our latest work from the ByteDance Seed Depth Anything team — Trace Anything: Representing Any Video in 4D via Trajectory Fields 💻 Project Page: trace-anything.github.io 📄 Paper: huggingface.co/papers/2510.13… 📦 Code: github.com/ByteDance-Seed… 🤖 Model: huggingface.co/depth-anything… 🖐️ Interactive Results: trace-anything.github.io/viser-client/i…

English
1
17
83
14.4K
Donny Y. Chen retweetledi
Nikhil Keetha
Nikhil Keetha@Nik__V__·
Interesting ICLR submissions 🤩 Depth Anything 3 - My TLDR: Init multi view transformer of VGGT with later layer DINO weights and use teacher model trained on synthetic data only for pseudo labelling real world datasets openreview.net/forum?id=yirun… Trace Anything - My TLDR: VGGT like model predicting N view geometry and motion as a trajectory field represented using splines and control points openreview.net/forum?id=BqaCh… The field is evolving very fast!
Nikhil Keetha tweet mediaNikhil Keetha tweet media
English
5
41
389
25K
Donny Y. Chen retweetledi
Donny Y. Chen retweetledi
AK
AK@_akhaliq·
VolSplat Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction
English
1
23
184
17.4K
Donny Y. Chen
Donny Y. Chen@donydchen·
Had a fantastic experience working with Chuanxia as his co-supervised PhD student. I highly recommend joining his Physical Vision Group (physicalvision.github.io) at NTU Singapore, and I believe it will be an inspiring place to grow as a researcher!
Chuanxia Zheng@ChuanxiaZ

After two amazing years with @Oxford_VGG, I will be joining @NTUsg as a Nanyang Assistant Professor in Fall 2025! I’ll be leading the Physical Vision Group (physicalvision.github.io) — and we're hiring for next year!🚀 If you're passionate about vision or AI, get in touch!

English
0
0
8
331
Donny Y. Chen
Donny Y. Chen@donydchen·
Thanks Zhenjun for sharing. Our PM-Loss tackles low geometry quality in feed-forward 3DGS caused by "discontinuity" in predicted depth, and features a plug-and-play training loss that leverages geometry priors from pointmap-based models like VGGT. More at aim-uofa.github.io/PMLoss
Zhenjun Zhao@zhenjun_zhao

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting @Duochao_Shi, @wjwang2003, @donydchen, Zeyu Zhang, Jia-Wang Bian, @supremeZhuang, @chunhua_shen tl;dr: pre-trained 3D reconstruction models->pointmaps->geometry prior->loss arxiv.org/abs/2506.05327

English
0
4
58
4.2K
Donny Y. Chen retweetledi
Haofei Xu
Haofei Xu@haofeixu·
MVSplat enables super fast scene reconstructions from as few as two images in a single forward pass! Code & models: github.com/donydchen/mvsp… Joint work with amazing collaborators @donydchen @ChuanxiaZ @supremeZhuang @mapo1 @AutoVisionGroup Tat-Jen Cham and Jianfei Cai!
MrNeRF@janusch_patas

MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images Paper: donydchen.github.io/mvsplat/static… Project: donydchen.github.io/mvsplat/ Code: github.com/donydchen/mvsp… (1/2)

English
0
8
49
6.5K
Donny Y. Chen
Donny Y. Chen@donydchen·
Well summarised, thanks for the hard work! Glad to share that me and @QianyiWu7 have managed to make our contributions to such a fast developing community. More details please refer to Sem2NeRF(donydchen.github.io/sem2nerf/) and ObjectSDF(wuqianyi.top/objectsdf/).
Mark Boss@markb_boss

As #ECCV2022 is approaching rapidly, I wanted to find all @neural_fields related papers. I've compiled my gatherings in another post of (hopefully) all NeRFy papers. Feel free to message me about anything I have missed :) markboss.me/post/nerf_at_e…

English
0
0
3
0
Donny Y. Chen
Donny Y. Chen@donydchen·
McDonald’s inside a park. Haven’t ate hamburgers for a long time, nice food and beautiful views make a enjoyable Saturday dinner.
Donny Y. Chen tweet mediaDonny Y. Chen tweet media
English
0
0
0
0
Donny Y. Chen
Donny Y. Chen@donydchen·
After shipping a table from Taobao, my housemates start playing Mahjong every night. Interesting to find that I’m the only one in the house who don’t know how to play🧐
Donny Y. Chen tweet media
English
0
0
0
0
Donny Y. Chen retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Finding it increasingly hard to keep up with all of the activity in deep learning right now, as # tabs -> \infty and tab width -> 0. It hasn't even been a decade since AlexNet (~Sep 2012 => ~8.5yrs) and a lot happened. Another 8.5 will be ~2030, I wonder what that's like.
Andrej Karpathy tweet media
English
52
91
1.3K
0
Donny Y. Chen
Donny Y. Chen@donydchen·
@leehsienloong 7 confirmed cases today, 2 of which are taxi drivers, considering the highly contagious of the novel coronavirus, I don’t understand why govt still keeps saying the risk of infection is low. Also, 4 out of 40 cases in SG are in critical condition, looks lethal enough already.
English
0
0
0
0
leehsienloong
leehsienloong@leehsienloong·
We have faced the 2019-nCoV situation for about 2 weeks now. People are understandably anxious & fearful, but there is no need to panic — Singapore has ample supplies. Instead, let us remain united & resolute, stay calm & carry on with our lives. – LHL youtu.be/oNw1pyksKHo
YouTube video
YouTube
English
78
377
995
0