Shunsuke Saito

941 posts

Shunsuke Saito banner
Shunsuke Saito

Shunsuke Saito

@psyth91

Director, Research Scientist @AIatMeta Digital Human/Vision/Graphics デジタルヒューマンの研究してます。 #pifuhd #sapiens All opinions are my own.

Pittsburgh Katılım Nisan 2020
522 Takip Edilen4.5K Takipçiler
Sabitlenmiş Tweet
Shunsuke Saito
Shunsuke Saito@psyth91·
Relightable Codec Avatars is now extended to full-body! At #SIGGRAPH2025, we will present Relightable Full-body Gaussian Codec Avatars. Key contributions include learnable Zonal Harmonics and deferred learnable radiance transfer for specular! Check it out! neuralbodies.github.io/RFGCA/
English
2
30
178
13.4K
Shunsuke Saito retweetledi
Pablo Vela
Pablo Vela@pablovelagomez1·
I've been working a lot with SAM3 and the Momentum Human Rig (MHR). I finally integrated it into the data I'm working with @rerundotio. The progression I've taken looks as follows SAM3 + SAM3D-body on 1. a single image 2. a set of multiple images 3. a single video 4. A multiview video capture I took inspiration from the SAM3D-body paper and built a multiview fitting optimization pipeline. This pipeline involves using the 2D keypoints from the single-view pipeline, triangulating them, and employing an L1 loss between the 2D/3D keypoints. The temporal stability isn't great, so that's the next portion I'm going to focus on. One really frustrating thing about SAM3D-body is the lack of per-joint confidence values. It makes it harder to deal with occlusions. I'm probably going to need to use a separate model, or maybe add a confidence head.
Pablo Vela@pablovelagomez1

Back to working on exo + ego in @rerundotio. This is a big jump in progress! One of the main painpoints I've had is getting the ego and exo views aligned in the same coordinate system, but I finally managed to get it all working. This means that now I have 1. Slam working for ego 2. Calibrated exo views 3. 3D keypoints for the full human body 4. 6DoF wrist poses 5. Temporally aligned videos 6. Spatially aligned multi cameras Now it's time to scale it up 🙂

English
9
43
448
42.3K
Shunsuke Saito retweetledi
ゴメパパ
ゴメパパ@gomessdegomess·
毎年お馴染みlevelsfyiの年度末レポートがやってきたので気になるところだけまとめてく メリカのトップ給与動向のまとめ
ゴメパパ tweet media
日本語
1
6
40
28.7K
Shunsuke Saito retweetledi
AI at Meta
AI at Meta@AIatMeta·
SAM 3D is helping advance the future of rehabilitation. See how researchers at @CarnegieMellon are using SAM 3D to capture and analyze human movement in clinical settings, opening the doors to personalized, data-driven insights in the recovery process. 🔗 Learn more about SAM 3D: go.meta.me/305985
English
34
89
491
64.8K
Shunsuke Saito retweetledi
AI at Meta
AI at Meta@AIatMeta·
Introducing SAM 3D, the newest addition to the SAM collection, bringing common sense 3D understanding of everyday images. SAM 3D includes two models: 🛋️ SAM 3D Objects for object and scene reconstruction 🧑‍🤝‍🧑 SAM 3D Body for human pose and shape estimation Both models achieve state-of-the-art performance transforming static 2D images into vivid, accurate reconstructions. 🔗 Learn more: go.meta.me/305985
English
130
1.1K
6.5K
851.8K
Shunsuke Saito retweetledi
Shunsuke Saito retweetledi
Jihyun Lee
Jihyun Lee@jyun_leee·
I have two exciting career updates to share! 😃 1️⃣ After memorable years at KAIST, I recently joined Meta as a Postdoctoral AI Research Scientist! I’m thrilled to be part of the Codec Avatars Lab, working with Shunsuke Saito (@psyth91) — one of the few researchers I admired most during my PhD years — and his amazing team. I’m genuinely super excited about the next-generation avatar project we’re pushing forward! 2️⃣ I’m currently attending ICCV 🏖 and will be giving a keynote talk at the HANDS workshop this afternoon. If you’re interested, please join the talk at 13:40 in room 305B. If you’d like to connect or chat outside of the talk, also feel free to drop me a message!
Jihyun Lee tweet media
English
18
15
466
41.1K
Shunsuke Saito retweetledi
David Park
David Park@park_jinhyung1·
Introducing ATLAS: A high-fidelity, parametric human body model enabling precise, independent control of surface and skeletal attributes for character creation. To be presented at #ICCV2025! Learn more about ATLAS here: jindapark.github.io/projects/atlas/
David Park tweet media
English
6
33
187
25.6K
Shunsuke Saito
Shunsuke Saito@psyth91·
Want Gaussian Avatar on mobile? Turns out the bottleneck is decoding of pose correctives. At #SIGGRAPH2025, we present a simple yet highly effective solution. We make *any* Gaussian avatars mobile-ready via linear distillation and corrective sharing. 👉forresti.github.io/squeezeme/
English
2
12
104
14.7K
Shunsuke Saito retweetledi
Kuroko
Kuroko@_c_he_·
📢 #SIGGRAPH2025 I'll be presenting our paper "3DGH: 3D Head Generation with Composable Hair and Face". Swing by and let's talk about hair and head generation! #Meta #Yale ⏰ Monday, Aug 11 | 2:00pm - 3:30pm PDT 📍 West Building, Rooms 301-305 🔗 c-he.github.io/projects/3dgh/
English
1
4
17
1.1K
Shunsuke Saito
Shunsuke Saito@psyth91·
Relightable Codec Avatars is now extended to full-body! At #SIGGRAPH2025, we will present Relightable Full-body Gaussian Codec Avatars. Key contributions include learnable Zonal Harmonics and deferred learnable radiance transfer for specular! Check it out! neuralbodies.github.io/RFGCA/
English
2
30
178
13.4K
Shunsuke Saito retweetledi
Kashu Yamazaki
Kashu Yamazaki@kashu_yamazaki·
AIロボット分野で米国大学院進学希望していて今年出願する人がいるなら気軽に声かけてください。できる範囲で手伝います。 もっとこの分野に日本人プレーヤー増やさないと!! トップ大学のめぼしい研究室に最低1人はいるような状況にしないと、ダメ。
日本語
1
83
432
182.1K
Shunsuke Saito retweetledi
Tobias Kirschstein
Tobias Kirschstein@TobiasKirschst1·
We will present Avat3r at #ICCV2025! 🥳 Avat3r brings animation to Large Reconstruction Models. One surprising finding was that we can get rid of any template-based deformation modeling and simply use cross-attention to an abstract facial expression code. tobias-kirschstein.github.io/avat3r/
Matthias Niessner@MattNiessner

📢📢 𝐀𝐯𝐚𝐭𝟑𝐫 📢📢 Avat3r creates high-quality 3D head avatars from just a few input images in a single forward pass with a new dynamic 3DGS reconstruction model. Video: youtu.be/P3zNVx15gYs Project: tobias-kirschstein.github.io/avat3r Our core idea is to make Gaussian Reconstruction Models animatable. We find that a simple cross-attention to an expression code sequence is already sufficient to model complex facial expressions. We then incorporate position maps from DUSt3R and feature maps from Sapiens to facilitate the prediction task. While DUSt3R's position maps act as a pixel-aligned initialization for the Gaussians' positions, the Sapiens feature maps help the cross-view transformer to match corresponding image tokens in the 4 input images. One major challenge in creating a 3D head avatar from smartphone images comes from inconsistent facial expressions when the subject could not remain perfectly static during the capture. We eliminate this static requirement by simply showing our model input images with different facial expressions during training. This technique makes our model robust to inconsistent input images later on. Finally, we show that despite the model has been trained with 4 input images, one can even create a 3D head avatar when only a single image is available. To achieve this, we employ a pre-trained 3D GAN to lift the single image to 3D and then render the 4 input images for our model. This allows us to create 3D head avatars from single images and even highly out-of-distribution examples like AI generated faces, paintings or statues. Great work by @TobiasKirschst1 from his internship at Meta with Javier Romero, @ASevastopolsky, and @psyth91

English
1
29
151
13K
T.Hayase
T.Hayase@ThayaFluss·
もしかして、現Xだと論文宣伝しても届きづらい? オンラインで宣伝、どこですればいいんだ
日本語
1
0
1
529
Shunsuke Saito retweetledi
Matthias Niessner
Matthias Niessner@MattNiessner·
Seven papers accepted at #ICCV2025! Exciting topics: lots of generative AI using transformers, diffusion, 3DGS, etc. focusing on image synthesis, geometry generation, avatars, and much more - check it out! So proud of everyone involved - let's go🚀🚀🚀 niessnerlab.org/publications.h…
Matthias Niessner tweet media
English
3
27
200
14.6K
Shunsuke Saito retweetledi
Jack Saunders
Jack Saunders@jack_r_saunders·
3DGH: 3D Head Generation with Composable Hair and Face TLDR: A Generative model of (static) Gaussian Avatars using a GAN. Here the head and hair are modelled separately with templates, and data is generated synthetically. 📽️ Project Page: c-he.github.io/projects/3dgh/ 📜 Paper: arxiv.org/pdf/2506.20875
Jack Saunders tweet media
English
1
19
70
4.2K
Take Ohkawa
Take Ohkawa@tkhkaeio·
My first-author paper has been accepted to #ICCV2025! Excited to share with you soon!
English
5
0
19
967