Michael I Mandel

190 posts

Michael I Mandel

Michael I Mandel

@asterix77

Brooklyn, NY 가입일 Eylül 2007
296 팔로잉198 팔로워
Michael I Mandel 리트윗함
AI at Meta
AI at Meta@AIatMeta·
We’re thrilled to see our advanced ML models and EMG hardware — that transform neural signals controlling muscles at the wrist into commands that seamlessly drive computer interactions — appearing in the latest edition of @Nature. Read the story: nature.com/articles/s4158… Find more details on this work and the models on @github: github.com/facebookresear…
English
49
271
1.3K
168.3K
Michael I Mandel 리트윗함
Vish Sivakumar
Vish Sivakumar@_vishar0·
1/ We just open-sourced two large wrist electromyography (EMG) datasets - one towards typing without a keyboard and the other for predicting hand poses - with baselines. We believe these will help advance research into making high bandwidth non-invasive neuromotor interfaces a reality!
Vish Sivakumar tweet media
English
1
21
50
5.6K
Michael I Mandel 리트윗함
Alon Vinnikov
Alon Vinnikov@AlonVin·
You might think speech recognition is "solved" with models such as @OpenAI’s Whisper, but it's not. Natural conversations with distant microphones still lack effective solutions. To illustrate, on our newly released NOTSOFAR meeting benchmark, Whisper large-v3 with head-mounted mics achieves 9.3% WER (word-error-rate), yet on audio from a distant mic it climbs to 37.4% WER. The culprits are reverberation, noise, and overlapping speech, which interfere with the source signal. What's the missing ingredient? We believe it's datasets. The problem is not amenable to web scraping. Benchmarking datasets are scarce given their complex collection process. Microphone arrays, useful for speech separation, are rarely featured in labeled datasets, necessitating simulation to teach neural networks to utilize such arrays. To bridge the gap our team at @Microsoft has released a benchmarking dataset of 280 recorded meetings, and a 1000-hour simulated training set synthesized for real-world generalization. Join our challenge "NOTSOFAR: Distant Meeting Transcription with a Single Device", part of CHiME-8, to explore these resources and advance the field. Details and registration: aka.ms/chime8 Code and datasets: aka.ms/notsofar
Alon Vinnikov tweet media
English
1
14
38
9K
Michael I Mandel 리트윗함
David Sussillo
David Sussillo@SussilloDavid·
1/7 For the past decade, our team at Meta Reality Labs (previously CTRL-labs) has been dedicated to developing a neuromotor interface. Our goal is to address the Human Computer Interaction challenge of providing effortless, intuitive, and efficient input to computers.
English
56
390
1.8K
641.6K
Michael I Mandel
Michael I Mandel@asterix77·
@mclduk @sarabssethi We have lots of recordings from Alaska's north slope with some anthrophony. I think it's mostly cars and stationary oil machinery, but let me check.
English
2
0
1
0
Sarab Sethi
Sarab Sethi@sarabssethi·
Does anyone have eco-acoustic data from cold regions with snowmobiles / snow-scooters in the background? If so please reach out!
English
2
12
7
0
Pete Skomoroch
Pete Skomoroch@peteskomoroch·
What are the most popular machine learning podcasts right now?
English
35
48
438
0
Michael I Mandel
Michael I Mandel@asterix77·
Announcing the CHiME-6 Speech Separation and Recognition Challenge: spandh.dcs.shef.ac.uk/chime_challeng… Track 1: (repeat of CHiME-5) multichannel, multi-device speech recognition at dinner parties Track 2: same, but with multichannel, multi-device speaker diarization first
English
1
3
8
0
Michael I Mandel
Michael I Mandel@asterix77·
Keynote speakers announced for #chimeworkshop 2020: Paola García (Johns Hopkins University) and Dong Yu (Tencent AI Lab)
English
0
0
1
0