Alexander Richard

46 posts

Alexander Richard

Alexander Richard

@AlexRichardCS

I'm a research scientist @Meta in Pittsburgh where I work on audio-visual modeling for photorealistic avatars.

Bergabung Haziran 2020
172 Mengikuti739 Pengikut
Alexander Richard
Alexander Richard@AlexRichardCS·
‼️500h of 3D motion data released‼️ Our team at the Codec Avatars Lab just released a large scale dataset of 3D tracked human motion, including audio and text annotations. Check it out here: meta.com/emerging-tech/…
English
6
37
173
31.6K
Alexander Richard
Alexander Richard@AlexRichardCS·
Working on your next big speech paper but still looking for a suitable dataset? Check out EARS: 100h of full-band expressive, anechoic recordings of speech from 107 speakers with 22 different emotions, 7 different reading styles, and more. sp-uhh.github.io/ears_dataset/
Alexander Richard tweet media
English
0
15
56
3.7K
Alexander Richard me-retweet
Changan Chen
Changan Chen@changanvr·
📢Curious about the future of 3D scene understanding? Join us at the 1st Workshop on Multimodalities for 3D Scenes @CVPR! Learn about the latest research on using vision, audio, touch, and language to understand 3D scenes around us. 📅June 17, 1:30-5:20PM multimodalitiesfor3dscenes.github.io
Changan Chen tweet media
English
2
7
42
4.4K
Alexander Richard me-retweet
Boz
Boz@boztank·
Always good to see the progress we’re making in our labs in NYC and (in the case of these photos) Pittsburgh...
Boz tweet mediaBoz tweet mediaBoz tweet media
English
29
19
339
63K
Miguel | AP
Miguel | AP@angrypenguinPNG·
Quick test of Meta's Photoreal Embodiment 🤖 Audio to Synthesized Human Movement in under 10 mins ⚡️ Weights, Code, and Dataset: github.com/facebookresear…
English
10
25
118
18.1K
Alexander Richard
Alexander Richard@AlexRichardCS·
@danveloper The restriction comes from our data collection: we only train on dyadic conversations, so the model has never seen 3+ people in the same environment.
English
0
0
1
324
Dan Woods
Dan Woods@danveloper·
@AlexRichardCS This is really cool, Alex! Is the reason it’s constrained to dyadic conversation because of speaker diarization problems across a long running conversation?
English
1
0
0
440
Alexander Richard me-retweet
Yossi Adi
Yossi Adi@adiyossLC·
A new generalized and universal audio-visual speech enhancement model, powered by SSL. A single model for denoising, source sep, inpainting and lip-reading! Checkout the paper and demo video!! 🗣️🤖🔊 More details below with @mhnt1580
Wei-Ning Hsu@mhnt1580

📢New paper We are announcing ReVISE, the first universal audio-visual speech enhancement model powered by SSL. paper: arxiv.org/pdf/2212.11377… demo: wnhsu.github.io/ReVISE w/ @adiyossLC @TalRemez @BowenSh08204149 @_JacobDonley

English
0
1
10
1.2K
Alexander Richard me-retweet
Emre Aksan
Emre Aksan@emreaksan·
📢Excited to share our recent #ECCV2022 paper: "LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space" with Shugao Ma, @akcalakcal, Stanislav Pidhorskyi, @AlexRichardCS, Shih-En Wei, Jason Saragih and @OHilliges.
GIF
English
1
6
14
0
Alexander Richard
Alexander Richard@AlexRichardCS·
**Dataset Release!** We released high-quality 3D face data of 13 persons captured with up to 150 cameras while performing 100+ facial expressions! github.com/facebookresear…
GIF
English
1
9
50
0