Julian Mack

253 posts

Julian Mack

@Julianfmack

ML researcher. Multimodal, foundations @cohere

Katılım Mayıs 2014

581 Takip Edilen2.3K Takipçiler

Sabitlenmiş Tweet

Julian Mack@Julianfmack·4d

Happy to share what I've been working on recently: today we release Cohere Transcribe, a state-of-the-art speech recognition model that beats both commercial and open-source models to land at #1 on the Open ASR Leaderboard!

English

3.7K

Julian Mack@Julianfmack·3d

This is cool. Our model running in the browser via transformers.js!

Xenova@xenovacom

NEW: Cohere Transcribe, a state-of-the-art multilingual speech recognition model, can now run entirely locally in your browser on WebGPU! 🤯 Transcribe 1 hour of audio in 100 seconds. Completely free. Fully private. No installation required. Try it out yourself! 👇

English

1.6K

Julian Mack@Julianfmack·3d

@BarathAnandan7 @huggingface @Tu7uruu @eustachelb @TheOneKloud We did experiment and 35s is the best we saw. It maintains O.K. quality up to around 45s but starts falling off rapidly above that

English

Barathwaj Anandan@BarathAnandan7·3d

@Julianfmack @huggingface @Tu7uruu @eustachelb @TheOneKloud Amazing workk :)) Trying to optimize it for mac and spark - is 35s the optimal chunk size or have you experimented with more? Thanks!!

English

Julian Mack@Julianfmack·4d

English

3.7K

Julian Mack retweetledi

Victor M@victormustar·3d

Very hyped by the new Cohere Transcribe model 🌍 Works surprisingly well on bad quality audio when the mic doesn't cooperate. 2B params, 14 supported languages and it's Apache 2.0. try the official Hugging Face demo ⬇️

English

306

20.2K

Julian Mack@Julianfmack·3d

@kushtrimvisoka Our tokenizer does use byte fallback though so while a totally new character set will be challenging for the current vocab, it's not an absolute constraint

English

Julian Mack@Julianfmack·3d

@kushtrimvisoka We haven't tested this but we'd be very interested in your results if you try! The main practical constraint is the tokenizer, which covers Latin, Greek, Arabic, Chinese, Japanese kana and Korean Hangul. Adaptation to languages outside these would need tokenizer changes

English

Julian Mack retweetledi

Ekagra Ranjan@EkagraRanjan·4d

Cohere Transcribe is the first step of Cohere in the Audio space. Happy to have contributed to this - focused on optimizing the inference stack for speed and scalability.

Cohere@cohere

Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition.

English

2.3K

Julian Mack retweetledi

Kyle Duffy@kyduffy·4d

In addition to the world's best translation models, we make the world's best transcription models at Cohere. This is the best open model out there for English speech recognition, and it works great on other languages too. @Julianfmack and @TheOneKloud absolutely killed it here!

Cohere@cohere

Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition.

English

874

Julian Mack retweetledi

Cohere@cohere·4d

Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition.

English

295

2.6K

591.1K

Julian Mack retweetledi

Pierre Richemond 🇪🇺@TheOneKloud·4d

Excited and proud to introduce our latest: Cohere Transcribe, the best dedicated ASR model in the world. #1 EN HF leaderboard, SotA human evals, ahead of ElevenLabs, Qwen3, Mistral, Kyutai, and OpenAI. 14 supported languages. Apache 2.0, on HF for you to try. Our first audio model and a key step in powering North experiences. huggingface.co/CohereLabs/coh…

English

112

13.9K

Julian Mack@Julianfmack·4d

Model supports 14 languages and is released open source on hugging face (Apache 2.0). More detail in the technical blog: huggingface.co/blog/CohereLab…

English

184

Julian Mack@Julianfmack·4d

We validated our quality in human preference evaluations. In head-to-head comparisons we come out ahead (>50% win-rate) against all competitors. Meaning preservation was the key criteria. But we also wanted well formatted, verbatim responses with correctly cased proper nouns

English

195

Julian Mack@Julianfmack·4 Mar

@jeankaddour Maybe an annealed aux loss formatting term to put special tokens <start/stop_thinking> in the right place during sft? As the trajectory is ~unchanged after 250, the aux term isn't adding new knowledge vs the baseline

English

112

Jean Kaddour@jeankaddour·3 Mar

ML interview question: What is happening here?

English

156

564

145.3K

Julian Mack retweetledi

Davis Blalock@davisblalock·4 Mar

🚀 Today we’re releasing FlashOptim: better implementations of Adam, SGD, etc, that compute the same updates but save tons of memory. You can use it right now via `pip install flashoptim`. 🚀 arxiv.org/abs/2602.23349 A bunch of cool ideas make this possible: [1/n]

English

228

1.6K

212.9K

Julian Mack retweetledi

gavin leech (Non-Reasoning)@g_leech_·16 Şub

New paper on a long-shot I've been obsessed with for a year: How much are AI reasoning gains confounded by expanding the training corpus 10000x? How much LLM performance is down to "local" generalisation (pattern-matching to hard-to-detect semantically equivalent training data)?

English

133

967

221.5K

Julian Mack retweetledi

Niels Rogge@NielsRogge·12 Şub

This is the most insane @github thread I’ve ever seen github.com/matplotlib/mat…

English

251

32.9K

Julian Mack retweetledi

Siyan Zhao@siyan_zhao·22 Oca

Introducing 💡On-Policy Self-Distillation💡, a simple method that enables LLM to teach itself with dense per-token feedback on its own on-policy generations—achieving 4-8x more token efficiency vs. GRPO and outperforming both GRPO and SFT/Off-Policy Distillation. Key insight: like a student reviewing solutions, rationalizing them, and correcting prior mistakes, an LLM can be conditioned on privileged info (e.g., correct solution or a reasoning trace) and supervise its weaker self—the version without such access—by matching the privileged-info-induced distribution from itself. 🌐Blog: siyan-zhao.github.io/blog/2026/opsd/ 🧵👇

English

157

921

131.6K

Julian Mack retweetledi

Cohere Labs@Cohere_Labs·17 Eki

Global AI deserves reproducible and transparent evaluation. 🌎 With Global MMLU Lite now part of @kaggle Benchmarks, you can track the multilingual performance of top models as well as test your own! Check out the leaderboard and notebook linked below.

English

7.4K

Julian Mack retweetledi

Dwarak@DwaraknathG·16 Eki

I am hiring highly skilled performance engineers for my team! You will be working on optimising pretraining for models >100B params on O(1000s) of GPUs, and hardware-aligned architecture design. We are cooking a lot of very exciting projects and I can safely say you will have a lot of fun! Link in thread. <3

English

458

67.1K

Keşfet

@BarathAnandan7 @huggingface @Tu7uruu @eustachelb @TheOneKloud @kushtrimvisoka @jeankaddour @github