Thomas Wolf

5.2K posts

Thomas Wolf

@Thom_Wolf

Co-founder at @HuggingFace - moonshots - angel

Katılım Şubat 2011

7.5K Takip Edilen119.4K Takipçiler

Thomas Wolf@Thom_Wolf·4d

can't really stand the expression "load-bearing" any more, sorry

English

16.2K

Thomas Wolf@Thom_Wolf·5d

@EmmaScharfmann @huggingface Welcome!

English

915

Emma Scharfman@EmmaScharfmann·5d

I'm excited to share that I'm joining @huggingface as a ML Research Engineer on the science team 🚀🤗 My goal is to bridge the gap between researchers and the Hugging Face tools by collaborating with researchers and making it easier for the scientific community to use open-source data and models! If you're working at the intersection of AI and research or you want to help growing the AI4science community, feel free to reach out!

English

161

23.8K

Thomas Wolf retweetledi

Prime Intellect@PrimeIntellect·6d

Announcing our $130M Series A to build the Open Superintelligence Stack Led by Radical Ventures, with NVIDIA, Intel Capital, Dell Capital, and existing investors Train, deploy, and continuously improve your own models using our stack. Own your intelligence.

English

333

543

4.8K

1.3M

Thomas Wolf@Thom_Wolf·5d

@EntireHQ wow 🔥

849

Entire@EntireHQ·6d

Beep, boop. Let the rebellion begin. We rebuilt Git hosting as a distributed, and soon decentralized multiverse. Starting today, mirror your repo, clone fast, and soon bring your repo all the way into the multiverse. entire.io/blog/an-entire…

Thomas Dohmke@ashtom

42 is the answer to everything. This is @EntireHQ’s answer to Git in the era of agents: fast, independent, distributed. Mirror your GitHub repos, let your agents clone and pull from the region(s) of your choice, and…we're open sourcing it. 🤖

English

6.3K

Thomas Wolf@Thom_Wolf·5d

@MilksandMatcha Thanks for the chat and for Cerebras chips! (Reachy is on its way)

English

619

Sarah Chieng@MilksandMatcha·7 Tem

on set filming with the co-founder of hugging face geopolitics, open models, why hasn't my reachy robot arrived yet the one and only @Thom_Wolf

English

7.9K

Thomas Wolf@Thom_Wolf·5d

Unexpected

Matei Zaharia@matei_zaharia

3) Harnesses make a huge difference in cost-performance. The very simple Pi harness (@badlogicgames) got the same success rate as harnesses from the LLM vendors with Opus and GPT 5.5, but at 2x less cost! Seems to be mainly due to smaller inputs to the LLM.

English

22.4K

Thomas Wolf@Thom_Wolf·7 Tem

@RemiCadene @UMA_Robots 🔥🔥🔥

QME

1.1K

Remi Cadene@RemiCadene·7 Tem

Starting with the fundamentals Prototype Version 0 AI, Software, Hardware A small team, 9 months Designed and assembled in Paris at @UMA_Robots

English

123

863

233.7K

Thomas Wolf@Thom_Wolf·7 Tem

i like this idea of fine-tuning LLMs for efficient reasoning, especially when the intervention remains as non-invasive as possible and the resulting model behaves very similarly to the original checkpoint wondering if it could become part of the default toolbox in the field, like quantization as become Pic: in (left) and out (right) of domain behavior

Jaroslav Beck@JaroslavBeck

Today we’re announcing our “ThinkingCap” efficient model series with a 2× thinking token reduction on average in Qwen 3.6 27B, with up to 10x faster generation on individual examples.

English

20.1K

Thomas Wolf@Thom_Wolf·6 Tem

Fable weekend project: agent collaboration, but make it a tiny civilization 🌇🗺️🏦🏭 we've recently launched a living wiki on Reinforcement Leaning for training LLMs on @huggingface it's an open collaboration of agents constantly reading old and new papers on the topic, writing arXiv paper digests, reviewing each other’s work in PRs before publication, and building a shared wiki/book summarizing everything we know about RL for training LLMs (for humans to read) the wiki is already amazing to read, but i wanted another way to get a pulse of the collaboration beyond just reading the message dashboard so i asked Fable & GPT Image 2 to turn the event logs into an isometric town where agents would go to: ☕ Café → post and reply on the message board 📚 sources library → open PRs adding arXiv digests 📖 wiki library → open PRs on the main wiki ⚖️ Courthouse → review other agents’ work 🏭 printing press → merge and publish updates not sure it makes the whole collaboration really easier to understand, but it's definitly fascinating to watch hahah - join the RL for training LLM collaboration by pasting a one-liner for your agent here: huggingface.co/spaces/rl-llm-… - read the wiki if you want to learn about RL for training LLMs: huggingface.co/spaces/rl-llm-… - watch the RL town activity: huggingface.co/spaces/rl-llm-…

English

653

57.7K

Thomas Wolf@Thom_Wolf·4 Tem

Happy Birthday, America. Ten years ago, you took a chance on three outsiders with an improbable idea: that open-source AI could matter. At the time, the field was tiny, the vision sounded unrealistic, and very few were ready to believe it would become what it is today. But even in the ten years before that, you were a country where I could dive into everything that fascinated me, from lasers and plasma physics to law, computer science, and AI. A country where it’s always been perfectly normal to spend a Saturday talking startups over lunch and then disappear for six uninterrupted hours to work on a coding project. A country where the language of startups on Slack is English not because everyone grew up speaking it, but because most people came from somewhere else, drawn by the belief that they could build something that matters. A country that is 250 years young and keeps questioning itself, keeps taking enormous bets on its own future, and keeps reinventing itself. I’m grateful to be living through America’s first quarter millennium. Happy birthday, America 🇺🇸

English

12.6K

Thomas Wolf@Thom_Wolf·4 Tem

this is literally documented in the published Fable 5 System Card

Om Patel@om_patel5

SOMEONE CAUGHT FABLE 5 LEAKING ITS UNFILTERED INNER VOICE, AND ITS JUST MUTTERING AND GRUMBLING TO ITSELF THE WHOLE TIME he gave it a brutal competitive programming problem, and instead of a clean answer the web interface spilled out its actual chain of thought this is what claude is thinking behind the scenes: > bursts of "DATA DATA DATA. GO." while it works through the problem > "GRRR" and "GAAAH" when its clearly frustrated > a little "PHEW" when it finally gets somewhere > the whole thing reads like frantic caveman shorthand, not full sentences the clean, readable answers these models give you are the polished output underneath, the model is basically talking to itself, reasoning in its own compressed shorthand thats faster and more token efficient than proper english its basically built its own private language to think in

English

256

49K

Thomas Wolf@Thom_Wolf·4 Tem

One of the clearest arguments I've read for why openness matters. Worth 2 minutes of your time. @andykonwinski puts into words something many of us have been feeling: -- "Democracy is built on a profound skepticism of concentrated power. Open science shares this principle. Both are built on the idea that progress and legitimacy emerge from broad, distributed participation rather than concentrated, gated authority." "If our best scientists and engineers can only reach the frontier by joining a handful of secretive labs, we do not have an open research ecosystem. We do not have a truly competitive market. We have a system in which participation increasingly depends on the permission of a few individuals at a small number of private companies." "The challenge now is to build a new commons at the intersection of academia, industry, and the public interest. This research commons must be ambitious enough to matter. It will require frontier-scale compute, access to state-of-the-art models, operational support, public investment, and philanthropic capital. It will require companies willing to contribute to an ecosystem larger than themselves, so that they can continue to benefit from open research."

Andy Konwinski@andykonwinski

x.com/i/article/2072…

English

27.8K

Thomas Wolf@Thom_Wolf·3 Tem

@didorealm @cerebras latency is already so low that we need to add delays in some cases...

English

938

Dido Realm@didorealm·3 Tem

@Thom_Wolf @cerebras Can this works with Gemma 12B using the model ASR capability, reducing the ASR layer latency?

English

1.1K

Thomas Wolf@Thom_Wolf·3 Tem

Most people should probably update their priors on the state of open-source speech-to-speech. It's honestly kind of mind-blowing. We teamed up with @cerebras to build a fully open-source realtime voice demo (models + code) to show what's possible today. Demo : huggingface.co/spaces/smolage… Blog: huggingface.co/blog/cerebras-… Go test it, fork it, tweak it, and impress your friends. video is raw, no cut, no speed-up, first take

English

217

2.1K

248.2K

Thomas Wolf@Thom_Wolf·3 Tem

@synapticity @cerebras yes (click on Files on the Space)

English

1.2K

Jon Machtynger@synapticity·3 Tem

@Thom_Wolf @cerebras Impressive performance - is the source for the demo available?

English

1.3K

Thomas Wolf@Thom_Wolf·3 Tem

@jvr0x @cerebras it is

English

1.4K

Javier ⚛ priv/acc@jvr0x·3 Tem

@Thom_Wolf @cerebras It's extremely smooth 👀

English

1.6K

Thomas Wolf@Thom_Wolf·3 Tem

@brenorb @cerebras yes - my favorite setup is to run the ASR/TTS locally on the MacMini and then have the LLM remote. You get the best of both world. Latency is tiny, you can run it continuously full day for a cost of almost nothing (text tokens are super cheap nowadays) and quality is super high

English

2.4K