Eustache Le Bihan

198 posts

Eustache Le Bihan

Eustache Le Bihan

@eustachelb

Speech & Audio @ Hugging Face 🤗

Katılım Ağustos 2023
322 Takip Edilen756 Takipçiler
Julian Mack
Julian Mack@Julianfmack·
Happy to share what I've been working on recently: today we release Cohere Transcribe, a state-of-the-art speech recognition model that beats both commercial and open-source models to land at #1 on the Open ASR Leaderboard!
Julian Mack tweet media
English
3
12
80
3.7K
Eustache Le Bihan
Eustache Le Bihan@eustachelb·
just `uv pip install -U transformers` and play with it!
Eustache Le Bihan tweet media
English
0
0
2
157
Eustache Le Bihan
Eustache Le Bihan@eustachelb·
Been cooking again for you guys! Cohere ASR model, topping the open ASR leaderboard, is supported day 0 in transformers 🤠
Eustache Le Bihan tweet media
English
3
3
30
2.1K
Eustache Le Bihan retweetledi
Mistral AI for Developers
Mistral AI for Developers@MistralDevs·
In addition to vLLM, the Hugging Face Transformers library now supports Voxtral Realtime. Thanks to @eustachelb for leading the integration. We’ve been amazed by how quickly the OSS community has shipped implementations of Voxtral Realtime across platforms, backends, and use cases. We expect Transformers support to drive even wider adoption, especially across fine-tuning and quantization libraries. We want to thank Mergen Nachin, Salvatore Sanfilippo, TrevorS, Shreyas Karnik, Awni Hannun, and @limzba for their early contributions. Full list of community integrations: #community-contributions-untested" target="_blank" rel="nofollow noopener">huggingface.co/mistralai/Voxt…
English
3
7
44
5.1K
Eustache Le Bihan retweetledi
antirez
antirez@antirez·
@eustachelb @julien_c @1littlecoder @MistralAI You can find all the architectural detains into my voxtral.c GitHub repository if it can help in some way. There were details disseminated among mistral-common and vLLM that I reconstructed, also note that the exact FFT points are crucial for the model to work well.
English
1
0
1
72
antirez
antirez@antirez·
Yesterday @MistralAI released an open weights transcription model able to work in real time, Voxtral Mini 4B. Today, following the Whisper.cpp lesson, here is a C inference pipeline ready to use as a library, I hope you'll enjoy it: github.com/antirez/voxtra…
English
28
98
979
54.5K
Eustache Le Bihan
Eustache Le Bihan@eustachelb·
really nothing difficult, the vllm implem is unnecessarily complicated (since they don't use a conv cache, they have to provide more context, meaning sending longer overlapping audio chunks). The main complication comes from the STFT computation and where to cut when sending audio chunk
English
0
0
0
33
antirez
antirez@antirez·
Yes, for the "easy work" part. The model inference was only half-specified via the vLLM nightly + mistral common Python stuff. I needed to find my way with the inference using Codex 5.2 xhigh, only way to understand enough details. Then, with MODEL.md, Claude did it mostly.
English
4
0
9
1.8K
Eustache Le Bihan retweetledi
Seb Johnson
Seb Johnson@SebJohnsonUK·
The founders of @huggingface after turning down $500m from Nvidia so they can stay independent
English
7
4
107
14.7K
Eustache Le Bihan retweetledi
Arthur Zucker
Arthur Zucker@art_zucker·
Today is a big day, transformers v5 is FINALLY out!!
Arthur Zucker tweet media
English
12
55
666
35.2K
Eustache Le Bihan retweetledi
Omar Sanseviero
Omar Sanseviero@osanseviero·
Introducing our latest open model: MedASR 🔬Speech to text model 🏥for healthcare-based voice applications 🤗available in Hugging Face ⚡️run with transformers Download right now huggingface.co/google/medasr
English
40
137
1.2K
81.4K
Eustache Le Bihan
Eustache Le Bihan@eustachelb·
Boom! Big release by Meta in the audio game: Sam Audio and Perception Encoder Audiovisual. "The core innovation in SAM Audio is the Perception Encoder Audiovisual engine"... and it's supported day 0 in transformers!
Eustache Le Bihan tweet media
English
2
1
5
683
Eustache Le Bihan
Eustache Le Bihan@eustachelb·
Without it, using a new model often means diving into a complex research codebase, deciphering experimental features, and dealing with huge engineering overhead. Most people simply give up, and end up paying for a closed solution. transformers solves this. Learn one paradigm. Trust the integrations. Build with any open model. Move faster, not slower. It is the source of truth for open models, and I’m incredibly proud to help build it. Here’s to open source, Hugging Face, and the entire community. ❤️‍🔥
English
1
0
0
70
Eustache Le Bihan
Eustache Le Bihan@eustachelb·
transformers v5 is out! 3 mill daily downloads, one of the core foundations of the open AI ecosystem. 4y after v4, this milestone reminds us why we build OS AI. It’s genuinely fun to work on, but as technicians, we need to think about the impact of the tools we’re creating.
English
1
0
3
144