
Eustache Le Bihan
198 posts

Eustache Le Bihan
@eustachelb
Speech & Audio @ Hugging Face 🤗




A bit late to the party but here is my take at running Kyutai's Pocket TTS 🗣️in the browser. Rust compiled to wasm, single threaded CPU only, using simd128, and it's running in real-time on my Pixel 8a without quantization. laurentmazare.github.io/pocket-tts/


New speech recognition models are announced on X almost every day nowadays. But not everyday you see a 250M parameter model beat the 1.5GB Whisper Large v3. Today we are announcing Moonshine Streaming. HF Link: huggingface.co/UsefulSensors/… Paper draft: download.moonshine.ai/docs/moonshine…








Introducing Voxtral Transcribe 2, next-gen speech-to-text models by @MistralAI. State-of-the-art transcription, speaker diarization, sub-200ms real-time latency. Details in 🧵

NVIDIA just dropped PersonaPlex-7B 🤯 A full-duplex voice model that listens and talks at the same time. No pauses. No turn-taking. Real conversation. 100% open source. Free. Voice AI just leveled up. huggingface.co/nvidia/persona…










