Drew Breunig

15.7K posts

Drew Breunig banner
Drew Breunig

Drew Breunig

@dbreunig

Writing about and working on AI, geo, and data.

Bay Area Katılım Mart 2008
1.1K Takip Edilen8.9K Takipçiler
Omar Khattab
Omar Khattab@lateinteraction·
This project went through more naming iterations than anything else: magnet RL (actively attract the rollout to the right final answer), lucky RL (stumble upon good + realistic rollouts more often than pass@K), foresight RL (learn to use special knowledge of the future), ...
Souradip Chakraborty@SOURADIPCHAKR18

🚨Typical RL algorithms and on-policy distillation methods are blind samplers: they use privileged info to score rollouts, but not to *find* them. We ask: can we use privileged info to *actively sample* the rollouts RL wishes it can stumble upon with compute? ⤵️ Pedagogical RL

English
4
2
49
5.2K
Drew Breunig
Drew Breunig@dbreunig·
I am! Here's a drop for dspy.RLM's code interpretter: github.com/dbreunig/dspy-… Isaac and I have been discussing letting the code interpreter provide instructions to the RLM module, which would save a turn or two and prevent the model from trying unsupported imports. But since they added `json` support, the speed of execution is worth the 2-3 burned turns discovering the edges of monty.
English
0
0
0
39
Drew Breunig
Drew Breunig@dbreunig·
3 years since ChatGPT, hundreds of billions invested, models capable of writing entire codebases and finding novel exploits... And the emdash persists!
English
3
0
8
780
Drew Breunig
Drew Breunig@dbreunig·
I'm finding paper and pen, marker and whiteboard, to be multiples more valuable in this AI era. It's a different space to think, differently, without the easy chatbot escape hatch.
English
3
1
13
511
Drew Breunig retweetledi
alex zhang
alex zhang@a1zhang·
Some awesome initial experiments on training small RLMs :) A direction I think will be super super important moving forward for fully seeing the capabilities of RLMs vs. traditional agentic systems
alphaXiv@askalphaxiv

Reinforcing Recursive Language Models Can a 4B model learn to recursively call itself to answer hard long-context questions? We RL fine-tuned a small model to behave as a native RLM. On evidence selection across scientific papers, our 4B RLM matches Sonnet 4.6 in quality while running significantly faster and cheaper.

English
8
37
294
27.5K
Michael Ryan
Michael Ryan@michaelryan207·
Super excited to join the 2026 cohort of @KnightHennessy scholars! Has been incredible talking to the other scholars about the major problems they are tackling across healthcare, science, policy, etc. Excited to work towards Human-Centered Open-Source AI that can support people tackling the world's biggest challenges!
Michael Ryan tweet media
KnightHennessy@KnightHennessy

Meet the 2026 cohort of KH scholars! These 87 new scholars make up the most global Knight-Hennessy Scholars cohort to date, and will pursue degrees in 45 graduate programs across all seven graduate schools at @Stanford: knight-hennessy.stanford.edu/news/knight-he… (1/2)

English
39
19
192
25.3K
Drew Breunig retweetledi
Jeff Dean
Jeff Dean@JeffDean·
Great to see @percyliang as a keynote speaker at #cais2026!
ACM Conference on AI and Agentic Systems@CAISconf

🎤 Keynote announcement: @percyliang (Percy Liang), Professor of Computer Science at @Stanford, founding director of the Center for Research on Foundation Models, and co-founder of @togethercompute, is keynoting #CAIS2026. Percy's HELM framework set the standard for holistic evaluation of language models, and his Foundation Model Transparency Index (now in its third year) put every major AI lab on notice for what they do and don't disclose. His current work on Marin takes this further: an open lab where every experiment, successful or not, is public from day one. This one's going to be good. San Jose · May 26–29 caisconf.org

English
11
19
222
44.4K
Drew Breunig retweetledi
ACM Conference on AI and Agentic Systems
🎤 Keynote announcement: @percyliang (Percy Liang), Professor of Computer Science at @Stanford, founding director of the Center for Research on Foundation Models, and co-founder of @togethercompute, is keynoting #CAIS2026. Percy's HELM framework set the standard for holistic evaluation of language models, and his Foundation Model Transparency Index (now in its third year) put every major AI lab on notice for what they do and don't disclose. His current work on Marin takes this further: an open lab where every experiment, successful or not, is public from day one. This one's going to be good. San Jose · May 26–29 caisconf.org
ACM Conference on AI and Agentic Systems tweet media
English
2
20
86
64.6K
vicki
vicki@vboykis·
Legitimately feels like an unquantifiable vibe shift the last few weeks where the pendulum is swinging back to reasonable takes and people experimenting with model choice 🙏
vicki tweet media
English
17
21
216
18K
Drew Breunig
Drew Breunig@dbreunig·
AI is what Donna Haraway called a "power object": something we treat as enormously important without understanding how it works. Reverence plus naivety means everyone fills the empty box with their own pet issue.
Kevin Van Valkenburg@KVanValkenburg

This reader comment on a NY Times column where Ross Douthat ponders that maybe God is speaking to us through A.I. is an absolute fastball on the corner with movement, and deserves a column. "Lightening was once mysterious too; mystery did not make Zeus correct" is perfect.

English
1
3
9
1.5K
Drew Breunig retweetledi
Kevin Van Valkenburg
Kevin Van Valkenburg@KVanValkenburg·
This reader comment on a NY Times column where Ross Douthat ponders that maybe God is speaking to us through A.I. is an absolute fastball on the corner with movement, and deserves a column. "Lightening was once mysterious too; mystery did not make Zeus correct" is perfect.
Kevin Van Valkenburg tweet media
English
102
1.8K
12.2K
357.3K