WAVLab | @CarnegieMellon

320 posts

WAVLab | @CarnegieMellon banner
WAVLab | @CarnegieMellon

WAVLab | @CarnegieMellon

@WavLab

Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more.

Katılım Ağustos 2021
147 Takip Edilen2.4K Takipçiler
WAVLab | @CarnegieMellon retweetledi
Shinji Watanabe
Shinji Watanabe@shinjiw_at_cmu·
We are looking for a postdoctoral researcher in speech and audio processing, with a possible start in the Fall 2026 semester. If you are interested in working with us, please apply through the following form: forms.gle/gfENMMrRf1nmnT…
English
1
24
57
7.9K
WAVLab | @CarnegieMellon
WAVLab @ #ICASSP2026 We will present 8 papers at ICASSP in Barcelona. If you are attending, please stop by the talks/posters and chat with the authors. arXiv links and presentation info below. 1/5
English
4
3
22
1.7K
WAVLab | @CarnegieMellon
Congrats to Brian @brianyan918 on finishing his PhD defense today! It was great to see so many people show up for this big event and celebrate such an important milestone. Wishing you all the best in what comes next!
WAVLab | @CarnegieMellon tweet media
English
0
1
18
921
WAVLab | @CarnegieMellon retweetledi
Shinji Watanabe
Shinji Watanabe@shinjiw_at_cmu·
6 papers (4 main and 2 findings) were accepted at #ACL2026! All are speech papers :)
Shinji Watanabe tweet media
English
1
10
97
4.8K
WAVLab | @CarnegieMellon retweetledi
arXiv Sound
arXiv Sound@ArxivSound·
Shikhar Bharadwaj, Chin-Jou Li, Kwanghee Choi, Eunjung Yeo, William Chen, Shinji Watanabe, David R. Mortensen, "An Empirical Recipe for Universal Phone Recognition," arxiv.org/abs/2603.29042
English
0
6
14
3K
WAVLab | @CarnegieMellon
Congratulations to Li-Wei @liweiche77 on successfully defending his PhD today! 🎉 Wishing him all the best in his next chapter!
WAVLab | @CarnegieMellon tweet media
English
0
4
20
1.4K
WAVLab | @CarnegieMellon
Congratulations to Siddhant @Sid_Arora_18 on a successful PhD defense today! It was wonderful to celebrate this big milestone together. Wishing him all the best for the exciting journey ahead.
WAVLab | @CarnegieMellon tweet mediaWAVLab | @CarnegieMellon tweet media
English
4
5
54
3.7K
WAVLab | @CarnegieMellon retweetledi
Natural Language Processing Papers
PRiSM: Benchmarking Phone Realization in Speech Models Shikhar Bharadwaj, Chin-Jou Li, Yoonjae Kim, Kwanghee Choi, Eunjung Yeo, Ryan Soh-Eun Shim, Hanyu Zhou, Brendon Boldt, Karen Rosero Jacome, Kalvin Chang, Darsh Agrawal, … arxiv.org/abs/2601.14046 [𝚌𝚜.𝙲𝙻 𝚌𝚜.𝚂𝙳]
Natural Language Processing Papers tweet media
Indonesia
0
4
6
449
WAVLab | @CarnegieMellon retweetledi
arXiv Sound
arXiv Sound@ArxivSound·
Chenda Li, Wei Wang, Marvin Sach, Wangyou Zhang, Kohei Saijo, Samuele Cornell, Yihui Fu, Zhaoheng Ni, Tim Fingscheidt, Shinji Watanabe, Yanmin Qian, "ICASSP 2026 URGENT Speech Enhancement Challenge," arxiv.org/abs/2601.13531
Deutsch
0
3
12
836
WAVLab | @CarnegieMellon retweetledi
arXiv Sound
arXiv Sound@ArxivSound·
Pu Wang, Shinji Watanabe, Hugo Van hamme, "SSVD-O: Parameter-Efficient Fine-Tuning with Structured SVD for Speech Recognition," arxiv.org/abs/2601.12600
English
0
2
5
402
WAVLab | @CarnegieMellon retweetledi
arXiv Sound
arXiv Sound@ArxivSound·
Shih-Heng Wang, Jiatong Shi, Jinchuan Tian, Haibin Wu, Shinji Watanabe, "Do Neural Codecs Generalize? A Controlled Study Across Unseen Languages and Non-Speech Tasks," arxiv.org/abs/2601.12205
English
0
3
16
846
WAVLab | @CarnegieMellon retweetledi
jiatongshi
jiatongshi@jiatongshi·
Heading to NeurIPS 2025 in San Diego! I’ll present our spotlight poster, ARECHO, focusing on speech multi-metric estimation. 📍 Exhibit Hall C,D,E #2000 🗓️ Thu Dec 4, 11 a.m.–2 p.m. PST If you’re around, let’s say hi or grab a coffee!
English
1
3
24
1.4K
WAVLab | @CarnegieMellon retweetledi
jiatongshi
jiatongshi@jiatongshi·
This is exactly the reason we worked for ESPnet-Codec, but being really hard to keep tracking as people are fast nowadays. The similar issue happens at most speech tasks from ASR, TTS, to general speech LLM. It's a bit sad time for driving scientific findings 🥲
🐿️🐒🗻📚🐹🦈@SythonUK

ヌラールオヂオーコデクの論文、全く違うデータで学習されたモデルを比較して「ワイらのモデル最強や!!😤😤😤」と主張しているものばかりで😩😩😩😩😩😩😩😩😩😩😩に関するMOS値が1000000になった

English
4
4
29
4.5K
WAVLab | @CarnegieMellon retweetledi
jiatongshi
jiatongshi@jiatongshi·
Speech isn’t just sound -> it’s how we turn thought into expression. Our new work, Speech-DRAME, measures how well speech AI can act, aligning evaluation with human perception. Paper: arxiv.org/abs/2511.01261 Code: github.com/Anuttacon/spee…
English
1
5
25
4K
WAVLab | @CarnegieMellon retweetledi
jiatongshi
jiatongshi@jiatongshi·
🚀 I’m open to new opportunities in industry! Ph.D. candidate @CMU (advisor: @shinjiw_at_cmu ). Research: speech/audio AI, speech LLMs, evaluation frameworks. Ex-Meta AI, Tencent, IBM Research. DMs open — let’s connect!
English
1
16
55
5.8K