Liam Schoneveld

118 posts

Liam Schoneveld banner
Liam Schoneveld

Liam Schoneveld

@liamschoneveld

Computer vision researcher @ Woven by Toyota.

Tokyo Katılım Aralık 2011
377 Takip Edilen83 Takipçiler
Liam Schoneveld retweetledi
Matthias Niessner
Matthias Niessner@MattNiessner·
📢Pix2NPHM: Learning to Regress NPHM Reconstructions From a Single Image📢 We directly regress neural parametric head models (NPHMs) from a single image — fast, stable, and significantly more expressive than classical 3DMMs such as FLAME. Face tracking & 3D reconstruction are often limited by the representational capacity of PCA-based face models. By lifting NPHMs to a first-class reconstruction primitive, we enable more accurate geometry, richer expressions, and finer animation control. Pix2NPHM obtains fast and reliable NPHM reconstructions on real-world data. Inference-time optimization against surface normals and canonical point maps can further increase fidelity. Key to successful and generalized training of our ViT-based network are: (1) large-scale registration of existing 3D head datasets, and (2) self-supervised training on vast in-the-wild 2D video datasets using pseudo ground-truth surface normals. Finally, we show that geometry-aware pretraining on pixel-aligned reconstruction tasks significantly outperforms generic visual pretraining (e.g., DINO-style features) in terms of generalization. 🌍simongiebenhain.github.io/Pix2NPHM 🎥youtu.be/MgpEJC5p1Ts Great work by @SGiebenhain, @TobiasKirschst1, @liamschoneveld, Davide Davoli, Zhe Chen
YouTube video
YouTube
English
15
80
542
37.6K
Michael Black
Michael Black@Michael_J_Black·
@MattNiessner Nice. I’ve been wanting to replace the old photometric loss with splatting. Results look great.
English
1
0
9
1.5K
Matthias Niessner
Matthias Niessner@MattNiessner·
📢 SHeaP: Self-Supervised Head Predictor Learned via 2D Gaussians 📢 Given a single input image, we predict accurate 3D head geometry, pose, and expression. Previous works (e.g. DECA, EMOCA) use differentiable mesh rasterization to learn a self-supervised head geometry predictor via a photometric reconstruction loss. We borrow these ideas, but our key insight is to replace the mesh rendering with 2D Gaussian Splatting. This leads to much higher accuracy of the underlying predicted geometry and thus more gradient signal during training. 🌍 nlml.github.io/sheap/ 🎥 youtu.be/vhXsZJWCBMA/ Great work by @liamschoneveld @_davidedavoli_ @jiapeng_tang
YouTube video
YouTube
English
3
60
341
28.5K
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
📢 Our new paper - SHeaP - is out! 📢 TLDR: self-supervised head tracking and geometry (FLAME) prediction, learned via photometric loss with a 2D gaussian splatting renderer. See more: 🌍 nlml.github.io/sheap/ 🎥 youtu.be/vhXsZJWCBMA/
YouTube video
YouTube
English
0
0
1
92
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@minchoi Are its predictions in a local or world coordinate system?
English
0
0
0
27
Min Choi
Min Choi@minchoi·
This is GenHMR. New AI research for 3D human modeling. Turn a single image into a lifelike 3D human model. Handles tricky poses, occlusions, & depth issues with ease. 10 examples: 1. Chasing
English
35
134
1.1K
176.7K
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@dome_271 Classifier-free guidance always seemed like some weird hack to me. There must be a more mathematically elegant solution out there, waiting to be found.
English
0
0
0
104
dome | Outlier
dome | Outlier@dome_271·
Do you all think there is something still fundamentally suboptimal in diffusion models? Just the reason that we need to rely on cfg-style sampling so much seems weird. Not using cfg just still looks very bad.
English
10
1
38
5.6K
MrNeRF
MrNeRF@janusch_patas·
URAvatar: Universal Relightable Gaussian Codec Avatars Contributions (cited): (1) We introduce a universal relightable avatar prior model learned from hundreds of dynamic performance captures with a multi-view and multi-light system. (2) We build a drivable head avatar from a phone scan that can be rendered and relit with global light transport in real-time. (3) A capture system and evaluation protocol to measure the accuracy of relighting under continuous illuminations.
English
6
80
680
50.1K
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@MartinGTobias Most of these measures make sense to me? As someone working in tech, I think it’s well worth spending a little money to encourage more women into the field.
English
0
0
0
19
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@finbarrtimbers Perhaps limiting the representation space via the limited size of the codebook forces the network to better compress what's really important in the images.
English
0
0
1
61
finbarr
finbarr@finbarrtimbers·
I don’t understand quantized image tokens (VQ-VAE style). Why would we ever want to use them vs continuous visual features?
English
25
11
335
85K
MrNeRF
MrNeRF@janusch_patas·
I've started a Discord server for hacking on Gaussian Splatting, discussing radiance field papers, the latest view-independent episode, or just hanging out. If you're interested, feel free to join (invitation link in comments)!
English
3
5
90
27.5K
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@AlboMP Wow that was easy! Now could you quickly do gambling ads and mining companies paying no royalties!?
English
0
0
0
10
Anthony Albanese
Anthony Albanese@AlboMP·
Our plan to legislate a minimum age for social media will support parents and protect children.
Anthony Albanese tweet media
English
2.1K
124
904
361.1K
Senator The Hon. Bridget McKenzie
Big build projects blow out by $10.1 billion in the recent budget driven by CFMEU corruption that has been a major contributor to rising inflation. This Labor Government rewards their mates at the expense of everyday Australians. Study and work hard for years to earn a decent wage? Take a 3-day course, enter the CFMEU and become a stop-go worker and make $200k? That $4.2 million that the CFMEU chucked in Labor coffers at the last election sure paid off! Australia cannot take another 3 years of Labor and their mates draining tax-payer money that could be funding more infrastructure, more hospitals, and more services for all of the citizenry.
Senator The Hon. Bridget McKenzie tweet media
English
180
68
370
27K
@levelsio
@levelsio@levelsio·
Japan is not futuristic Japan is a fossil stuck in 1990 99% of Westerners still don't get this Futuristic Asia is China, Korea, Vietnam, etc No Asian rates Japan for modernity at all
yifei e/λ (meetmeinshibuya april 26)@yifever

Japan is still on web 0.5, no one uses any apis, even the "tech" orgs are just dumping csv files back and forth. all arrangements are bespoke, and technical challenges solved 10 years ago are still blockers for huge conglomerates

English
903
2.3K
21.3K
3.3M
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@techchildrights @laion_ai Coming from a human rights organization, I am sure you appreciate the importance of transparency. Without @laion_ai's ongoing AI transparency efforts, we would know very little about the data going into these models.
English
0
0
2
10
Hye Jung Han
Hye Jung Han@techchildrights·
🇦🇺NEW: The personal photos of Australian children are being secretly used to build powerful AI tools. Others are then using these tools to create malicious deepfakes, putting even more children at risk of serious harm. hrw.org/news/2024/07/0…
English
4
22
20
19.7K
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@Saboo_Shubham_ @laion_ai This definitely wouldn’t work as well on papers that haven’t had 1000s of blog and Reddit etc posts written about it though
English
0
0
0
199
Shubham Saboo
Shubham Saboo@Saboo_Shubham_·
Claude 3.5 Sonnet transformed a research paper into an interactive learning dashboard in just 30 seconds. It goes beyond the capabilities of GPT-4o, Gemini Pro, Llama and other existing LLMs. Education will never be the same again with AI.
English
123
714
4.8K
678.7K
Liam Schoneveld
Liam Schoneveld@liamschoneveld·
@dome_271 I actually had this problem a long time ago when trying to use ConvNets to generate audio. Perhaps looking at audio generative model literature may help as high frequency details are perhaps even more important in that domain.
English
0
0
3
50
dome | Outlier
dome | Outlier@dome_271·
Does anyone else feel like diffusion models have a hard time generating high frequency details? Any experience / thoughts / ideas / pointers on that manner? We have observed often that our models don't generate high frequencies in images and have found it hard improving it.
English
19
4
53
19.9K