Liam Schoneveld

118 posts

Liam Schoneveld

@liamschoneveld

Computer vision researcher @ Woven by Toyota.

Tokyo Katılım Aralık 2011

377 Takip Edilen83 Takipçiler

Liam Schoneveld retweetledi

Matthias Niessner@MattNiessner·23 Ara

📢Pix2NPHM: Learning to Regress NPHM Reconstructions From a Single Image📢 We directly regress neural parametric head models (NPHMs) from a single image — fast, stable, and significantly more expressive than classical 3DMMs such as FLAME. Face tracking & 3D reconstruction are often limited by the representational capacity of PCA-based face models. By lifting NPHMs to a first-class reconstruction primitive, we enable more accurate geometry, richer expressions, and finer animation control. Pix2NPHM obtains fast and reliable NPHM reconstructions on real-world data. Inference-time optimization against surface normals and canonical point maps can further increase fidelity. Key to successful and generalized training of our ViT-based network are: (1) large-scale registration of existing 3D head datasets, and (2) self-supervised training on vast in-the-wild 2D video datasets using pseudo ground-truth surface normals. Finally, we show that geometry-aware pretraining on pixel-aligned reconstruction tasks significantly outperforms generic visual pretraining (e.g., DINO-style features) in terms of generalization. 🌍simongiebenhain.github.io/Pix2NPHM 🎥youtu.be/MgpEJC5p1Ts Great work by @SGiebenhain, @TobiasKirschst1, @liamschoneveld, Davide Davoli, Zhe Chen

YouTube

English

542

37.6K

Liam Schoneveld@liamschoneveld·9 Ara

🐑🐑 SHeaP inference code is out ! 🐑🐑For all your real-time head pose and expression tracking desires! Check it out at: 🤗 HuggingFace spaces: huggingface.co/spaces/nlml/sh… 📟 Github: github.com/nlml/sheap

English

Liam Schoneveld@liamschoneveld·25 Nis

@Michael_J_Black @MattNiessner 😍 This approach can still be improved a lot, I think

English

Michael Black@Michael_J_Black·18 Nis

@MattNiessner Nice. I’ve been wanting to replace the old photometric loss with splatting. Results look great.

English

1.5K

Matthias Niessner@MattNiessner·17 Nis

📢 SHeaP: Self-Supervised Head Predictor Learned via 2D Gaussians 📢 Given a single input image, we predict accurate 3D head geometry, pose, and expression. Previous works (e.g. DECA, EMOCA) use differentiable mesh rasterization to learn a self-supervised head geometry predictor via a photometric reconstruction loss. We borrow these ideas, but our key insight is to replace the mesh rendering with 2D Gaussian Splatting. This leads to much higher accuracy of the underlying predicted geometry and thus more gradient signal during training. 🌍 nlml.github.io/sheap/ 🎥 youtu.be/vhXsZJWCBMA/ Great work by @liamschoneveld @_davidedavoli_ @jiapeng_tang

YouTube

English

341

28.5K

Liam Schoneveld@liamschoneveld·18 Nis

📢 Our new paper - SHeaP - is out! 📢 TLDR: self-supervised head tracking and geometry (FLAME) prediction, learned via photometric loss with a 2D gaussian splatting renderer. See more: 🌍 nlml.github.io/sheap/ 🎥 youtu.be/vhXsZJWCBMA/

YouTube

English

Liam Schoneveld@liamschoneveld·31 Ara

@minchoi Are its predictions in a local or world coordinate system?

English

Min Choi@minchoi·30 Ara

Read more at their project page: m-usamasaleem.github.io/publication/Ge…

English

6.6K

Min Choi@minchoi·30 Ara

This is GenHMR. New AI research for 3D human modeling. Turn a single image into a lifelike 3D human model. Handles tricky poses, occlusions, & depth issues with ease. 10 examples: 1. Chasing

English

134

1.1K

176.7K

Liam Schoneveld@liamschoneveld·31 Ara

Great work by @jiapeng_tang and co ! Gaussian Avatars from just a handful of input images, by leveraging a multi-view diffusion prior !

Matthias Niessner@MattNiessner

📢📢𝐆𝐀𝐅: 𝐆𝐚𝐮𝐬𝐬𝐢𝐚𝐧 𝐀𝐯𝐚𝐭𝐚𝐫 𝐑𝐞𝐜𝐨𝐧𝐬𝐭𝐫𝐮𝐜𝐭𝐢𝐨𝐧 𝐟𝐫𝐨𝐦 𝐌𝐨𝐧𝐨𝐜𝐮𝐥𝐚𝐫 𝐕𝐢𝐝𝐞𝐨𝐬 𝐯𝐢𝐚 𝐌𝐮𝐥𝐭𝐢-𝐯𝐢𝐞𝐰 𝐃𝐢𝐟𝐟𝐮𝐬𝐢𝐨𝐧📢📢 We reconstruct animatable Gaussian head avatars from monocular videos captured by commodity devices such as smartphones. Key idea: distill reconstruction constraints from a multi-view head diffusion model to complete unobserved regions. tangjiapeng.github.io/projects/GAF/ youtu.be/QuIYTljvhyg Great work by @jiapeng_tang @davidedavoli @TobiasKirschst1 @liamschoneveld

English

Liam Schoneveld@liamschoneveld·11 Kas

@dome_271 Classifier-free guidance always seemed like some weird hack to me. There must be a more mathematically elegant solution out there, waiting to be found.

English

104

dome | Outlier@dome_271·11 Kas

Do you all think there is something still fundamentally suboptimal in diffusion models? Just the reason that we need to rely on cfg-style sampling so much seems weird. Not using cfg just still looks very bad.

English

5.6K

Liam Schoneveld@liamschoneveld·6 Kas

@camo2572 @LabAgainstWar @AlboMP @RichardMarlesMP @SenatorWong I don’t think it would be that hard to come up with a better plan than spending $360b for offensive nuclear subs we are not even contractually guaranteed to receive? I feel like literally any plan is better than that one.

English

Here4CarltonMeltdowns⚫️⚪️⚫️🇦🇺✊🏾🌊🏄‍♂️@camo2572·6 Kas

@LabAgainstWar @AlboMP @RichardMarlesMP @SenatorWong What’s your plan then to defend our island nation. Do you honestly think we abandon AUKUS we will be left alone because we are a nice country . I’m really curious when I hear this demand to scrap AUKUS but no plan forward

English

879

Labor Against War@LabAgainstWar·6 Kas

Australia must withdraw from AUKUS. Labor Against War calls on @AlboMP @RichardMarlesMP @SenatorWong to make Australia safe again.

English

148

608

49.2K

Liam Schoneveld@liamschoneveld·6 Kas

@1111nonbeliever @janusch_patas They could release the code but you wouldn’t get very far without their data and compute 😅

English

Non Believer@1111nonbeliever·4 Kas

@janusch_patas No code released?

English

641

MrNeRF@janusch_patas·4 Kas

URAvatar: Universal Relightable Gaussian Codec Avatars Contributions (cited): (1) We introduce a universal relightable avatar prior model learned from hundreds of dynamic performance captures with a multi-view and multi-light system. (2) We build a drivable head avatar from a phone scan that can be rendered and relit with global light transport in real-time. (3) A capture system and evaluation protocol to measure the accuracy of relighting under continuous illuminations.

English

680

50.1K

Liam Schoneveld@liamschoneveld·2 Kas

@MartinGTobias Most of these measures make sense to me? As someone working in tech, I think it’s well worth spending a little money to encourage more women into the field.

English

Martin Tobias (Pre-Seed VC)@MartinGTobias·2 Kas

Your government at work with your money.

Matt Walsh@MattWalshBlog

1/ Kamala became VP in 2020 because of DEI. She became the nominee in 2024 because of DEI. Now, @DoNoHarm has compiled a list of all the ways the Biden-Harris Admin spent your money on DEI. As a certified DEI expert myself, here are my favorites:

English

1.5K

Liam Schoneveld@liamschoneveld·30 Eki

@finbarrtimbers Perhaps limiting the representation space via the limited size of the codebook forces the network to better compress what's really important in the images.

English

finbarr@finbarrtimbers·29 Eki

I don’t understand quantized image tokens (VQ-VAE style). Why would we ever want to use them vs continuous visual features?

English

335

85K

Liam Schoneveld@liamschoneveld·28 Eki

@lvminzhang Is there a paper accompanying this code somewhere?

English

205

lllyasviel@lvminzhang·27 Eki

github.com/lllyasviel/IC-…

ZXX

339

49.1K

Liam Schoneveld@liamschoneveld·18 Eki

@janusch_patas Thanks!

English

MrNeRF@janusch_patas·17 Eki

@liamschoneveld Ups, did not remove the expiration date. You are welcome :) discord.gg/NqwTqVYVmj

English

MrNeRF@janusch_patas·9 Eki

I've started a Discord server for hacking on Gaussian Splatting, discussing radiance field papers, the latest view-independent episode, or just hanging out. If you're interested, feel free to join (invitation link in comments)!

English

27.5K

Liam Schoneveld@liamschoneveld·11 Eyl

@AlboMP Wow that was easy! Now could you quickly do gambling ads and mining companies paying no royalties!?

English

Anthony Albanese@AlboMP·11 Eyl

Our plan to legislate a minimum age for social media will support parents and protect children.

English

2.1K

124

904

361.1K

Liam Schoneveld@liamschoneveld·21 Tem

@senbmckenzie Are you suggesting we should raise teacher salaries? Good on you!

English

Senator The Hon. Bridget McKenzie@senbmckenzie·19 Tem

Big build projects blow out by $10.1 billion in the recent budget driven by CFMEU corruption that has been a major contributor to rising inflation. This Labor Government rewards their mates at the expense of everyday Australians. Study and work hard for years to earn a decent wage? Take a 3-day course, enter the CFMEU and become a stop-go worker and make $200k? That $4.2 million that the CFMEU chucked in Labor coffers at the last election sure paid off! Australia cannot take another 3 years of Labor and their mates draining tax-payer money that could be funding more infrastructure, more hospitals, and more services for all of the citizenry.

Senator The Hon. Bridget McKenzie tweet media

English

180

370

27K

Liam Schoneveld@liamschoneveld·4 Tem

@levelsio @whittomd Wow LEDs! What futuristic technology

English

227

@levelsio@levelsio·4 Tem

@whittomd But the same happened in Korea and they keep updating, same with China x.com/Rainmaker1973/…

Massimo@Rainmaker1973

The beautiful Tianfu IFC twin towers, Chengdu, Sichuan Province, China. [📹 lalinx0x] twitter.com/i/status/18019…

English

106

55.2K

@levelsio@levelsio·4 Tem

Japan is not futuristic Japan is a fossil stuck in 1990 99% of Westerners still don't get this Futuristic Asia is China, Korea, Vietnam, etc No Asian rates Japan for modernity at all

yifei e/λ (meetmeinshibuya april 26)@yifever

Japan is still on web 0.5, no one uses any apis, even the "tech" orgs are just dumping csv files back and forth. all arrangements are bespoke, and technical challenges solved 10 years ago are still blockers for huge conglomerates

English

903

2.3K

21.3K

3.3M

Liam Schoneveld@liamschoneveld·3 Tem

@techchildrights @laion_ai Coming from a human rights organization, I am sure you appreciate the importance of transparency. Without @laion_ai's ongoing AI transparency efforts, we would know very little about the data going into these models.

English

Hye Jung Han@techchildrights·3 Tem

🇦🇺NEW: The personal photos of Australian children are being secretly used to build powerful AI tools. Others are then using these tools to create malicious deepfakes, putting even more children at risk of serious harm. hrw.org/news/2024/07/0…

English

19.7K

Liam Schoneveld@liamschoneveld·26 Haz

@Saboo_Shubham_ @laion_ai This definitely wouldn’t work as well on papers that haven’t had 1000s of blog and Reddit etc posts written about it though

English

199

Shubham Saboo@Saboo_Shubham_·26 Haz

Claude 3.5 Sonnet transformed a research paper into an interactive learning dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Education will never be the same again with AI.

English

123

714

4.8K

678.7K

Liam Schoneveld@liamschoneveld·30 Nis

@dome_271 I actually had this problem a long time ago when trying to use ConvNets to generate audio. Perhaps looking at audio generative model literature may help as high frequency details are perhaps even more important in that domain.

English

dome | Outlier@dome_271·29 Nis

Does anyone else feel like diffusion models have a hard time generating high frequency details? Any experience / thoughts / ideas / pointers on that manner? We have observed often that our models don't generate high frequencies in images and have found it hard improving it.

English

19.9K

Keşfet

@SGiebenhain @TobiasKirschst1 @Michael_J_Black @MattNiessner @_davidedavoli_ @jiapeng_tang @minchoi @dome_271