Bin Yang

21 posts

Bin Yang banner
Bin Yang

Bin Yang

@binyangderek

Founder & CEO @breezeblueX, building next-gen realtime interaction layer.

Toronto, Canada เข้าร่วม Temmuz 2014
598 กำลังติดตาม102 ผู้ติดตาม
Jordan Dearsley
Jordan Dearsley@jordan_dearsley·
You can feel whether the voice on a call is a human or a machine before you can explain why. Today, @Vapi_AI is launching the Humanness Index™, a crowdsourced leaderboard for model humanness. You are the benchmark. Cast your first vote today: humannessindex.vapi.ai
English
5
1
26
7.4K
Bin Yang
Bin Yang@binyangderek·
@unilightwf fwiw, it's essentially a half-duplex model (speech in, text response out) with VAD integrated into the backbone as special tokens. They also defined some interesting interaction-related tasks (e.g., proactively respond to sounds).
English
0
0
1
128
Bin Yang
Bin Yang@binyangderek·
We are brewing☕️ a turbo version⚡️ of Bluebell tts model for realtime applications. Turns out, properly benchmarking end-to-end TTFA (time-to-first-audio, from client text -> tts provider server -> client intelligible speech) is nontrivial 👿 After we've sorted everything out (will open-source the benchmark tooling ofc), Bluebell-turbo-exp achieves the world's fastest TTFA 🥳 Guess who's the second one?
Bin Yang@binyangderek

~4 months in 2026 and we've seen 7 new voice design models released by different labs. Excited to see this new trend coming with @BreezeBlueX leading.

English
0
0
2
121
Bin Yang
Bin Yang@binyangderek·
Agree that it's the behavior that defines full-duplex, not architecture. Also, I would highlight that "forecasting" is an important capability of full-duplex systems as it significantly reduces the "perceived latency".
Desh Raj@rdesh26

x.com/i/article/2054…

English
0
0
1
168
Bin Yang
Bin Yang@binyangderek·
~4 months in 2026 and we've seen 7 new voice design models released by different labs. Excited to see this new trend coming with @BreezeBlueX leading.
Bin Yang tweet media
English
0
0
2
405
Bin Yang
Bin Yang@binyangderek·
@AmpCode when will we have opus 4.7 support?
English
0
0
0
29
Mr Panda
Mr Panda@PandaTalk8·
求推荐哪家tts 最具性价比,开源或闭源、付费或免费都可以? 性价比只的是既真实好用价格又跟不要钱一样。
中文
50
12
111
34.8K
Bin Yang
Bin Yang@binyangderek·
@DidiKieran Nice post! The follow-up work of VA-VAE, VTP (arxiv.org/abs/2512.13687) is also highly relevant, where the tokenizer is trained from scratch with a joint contrastive, self-supervised, and reconstruction objective to remove the dependency on pretrained representations.
English
0
0
1
247
Kieran Didi
Kieran Didi@DidiKieran·
Too many REPA / RAE / representation alignment papers lately? I was lost too, so I wrote a blog post that organizes the space into phases and zooms in on what actually matters for general/molecular ML. Curious what folks think - link below! 🔗 Blog: kdidi.netlify.app/blog/ml/2025-1…
Kieran Didi tweet media
English
9
92
534
79.3K
Bin Yang
Bin Yang@binyangderek·
@unilightwf from a tech perspective, human preference aligned TTS evaluation is hard. do you have any proposal here?
English
0
0
0
281
Wen-Chin Huang
Wen-Chin Huang@unilightwf·
Open-source TTS界隈、ありがたいんだけど「高品質」「高速」と言いながらなんお評価結果も載せていなくて謎すぎる😭
日本語
2
3
37
9.1K
Bin Yang
Bin Yang@binyangderek·
@eigensteve Hi, could you share your filming setup? This should be the common practice for all online courses/tutorials.
English
0
0
1
0
Steven Brunton
Steven Brunton@eigensteve·
First new video after being back from Sabbatical!! PDE 101: Separation of Variables... or how I learned to stop worrying and solve Laplace's equation One of the most important concepts in all of partial differential equations youtube.com/watch?v=VjWtMl…
YouTube video
YouTube
English
15
172
1.5K
0
Bin Yang รีทวีตแล้ว
Raquel Urtasun
Raquel Urtasun@RaquelUrtasun·
Today, together with my collaborators Andreas Geiger, Philip Lenz and Christoph Stiller, I was awarded the 2021 Everingham Prize for KITTI at #ICCV21, which enabled many breakthroughs in #SelfDrivingCars. Thank you, truly an honor!
Raquel Urtasun tweet media
English
24
16
332
0