Drew Breunig (@dbreunig) - Twitter Profili | Zamantika Mersobahis Locabet

Drew Breunig@dbreunig·17h

@NoahZiems @lateinteraction Where’s the list of candidate puns, Omar

English

1

0

2

69

Noah Ziems@NoahZiems·17h

@lateinteraction 😂

QME

1

0

1

117

Omar Khattab@lateinteraction·17h

This project went through more naming iterations than anything else: magnet RL (actively attract the rollout to the right final answer), lucky RL (stumble upon good + realistic rollouts more often than pass@K), foresight RL (learn to use special knowledge of the future), ...

Souradip Chakraborty@SOURADIPCHAKR18

🚨Typical RL algorithms and on-policy distillation methods are blind samplers: they use privileged info to score rollouts, but not to *find* them. We ask: can we use privileged info to *actively sample* the rollouts RL wishes it can stumble upon with compute? ⤵️ Pedagogical RL

English

4

2

49

5.2K

Drew Breunig@dbreunig·17h

Great teachers craft demonstrations their students could have built themselves.

Souradip Chakraborty@SOURADIPCHAKR18

🚨Typical RL algorithms and on-policy distillation methods are blind samplers: they use privileged info to score rollouts, but not to *find* them. We ask: can we use privileged info to *actively sample* the rollouts RL wishes it can stumble upon with compute? ⤵️ Pedagogical RL

English

1

4

13

1.6K

Drew Breunig@dbreunig·18h

The new flex is how many exploits Mythos finds, and how long it takes to find then. Mythos found one MacOS exploit in 5 days. Only a single curl bug was identified.

Andrew Curran@AndrewCurran_

Mythos has cracked MacOS. It took five days.

English

1

11

1.7K

Drew Breunig@dbreunig·22h

@BEBischof @Theoryvc @motherduck @lancedb @AICouncilConf I think you're winging it.

English

1

0

1

22

Bryan Bischof fka Dr. Donut@BEBischof·22h

@dbreunig @Theoryvc @motherduck @lancedb @AICouncilConf You must think i'm a quack or something

English

1

0

21

Theory Ventures@Theoryvc·23h

From Theory, @motherduck, and @lancedb: we salute you @AICouncilConf, and the conference attendees. If you missed our boat party, we hope you’ll join us in the future. 🫡

English

3

6

16

2K

Drew Breunig@dbreunig·22h

@BEBischof @Theoryvc @motherduck @lancedb @AICouncilConf Where is the duck pun, Bryan.

English

1

0

18

Bryan Bischof fka Dr. Donut@BEBischof·22h

@Theoryvc @motherduck @lancedb @AICouncilConf We had a lot of fun

English

1

0

1

65

Drew Breunig@dbreunig·22h

Live by the fuzzy, natural-language thinking engine, die by the fuzzy, natural-language thinking engine.

Nick Schrock@schrockn

As a business practice it is sensible and fine imo to subsidize only your own harness/apps but the unenforceability of it technically is makes this all very funny to watch.

English

0

2

7

920

Drew Breunig@dbreunig·22h

I am! Here's a drop for dspy.RLM's code interpretter: github.com/dbreunig/dspy-… Isaac and I have been discussing letting the code interpreter provide instructions to the RLM module, which would save a turn or two and prevent the model from trying unsupported imports. But since they added `json` support, the speed of execution is worth the 2-3 burned turns discovering the edges of monty.

English

0

39

isaac 🧩@isaacbmiller1·23h

@a1zhang @dbreunig is a big fan of it for DSPy.RLM

English

1

0

2

162

alex zhang@a1zhang·23h

Should I try out Monty for the RLM REPL I’ve been hearing more and more about it recently

Auctor@auctor

Blog: The Agent Is a Workflow That Writes Itself How we productionized the RLM with durable execution. Subagents lower to child workflows and PTC runs through a deterministic workflow-space interpreter. Every tool, subagent, and PTC call goes through a single recursive dispatch loop. Closure under replay, retry, cancel. We call these durable agents.

English

6

3

79

8.2K

Drew Breunig@dbreunig·1d

3 years since ChatGPT, hundreds of billions invested, models capable of writing entire codebases and finding novel exploits... And the emdash persists!

English

3

0

8

780

Drew Breunig@dbreunig·2d

@tetsuo_cpp @badlogicgames I'm going to be there's at least 10 incomplete versions on GH right now.

English

0

2

28

tetsuo.cpp (no slop)@tetsuo_cpp·2d

@badlogicgames Whatever happened to that Rust slop rewrite of Pi anyway?

English

1

0

1

606

Mario Zechner@badlogicgames·2d

uhm i sort of disagree :p

Armin Ronacher ⇌@mitsuhiko

Pi wouldn’t make any sense in rust or go. Extensibility is key to it. That leaves ruby, python, js, php for the most part unless you want to ship an interpreter. None of those languages have any benefit over node.

English

34

2

221

45.1K

Drew Breunig@dbreunig·2d

I'm finding paper and pen, marker and whiteboard, to be multiples more valuable in this AI era. It's a different space to think, differently, without the easy chatbot escape hatch.

English

3

1

13

511

Drew Breunig retweetledi

alex zhang@a1zhang·3d

Some awesome initial experiments on training small RLMs :) A direction I think will be super super important moving forward for fully seeing the capabilities of RLMs vs. traditional agentic systems

alphaXiv@askalphaxiv

Reinforcing Recursive Language Models Can a 4B model learn to recursively call itself to answer hard long-context questions? We RL fine-tuned a small model to behave as a native RLM. On evidence selection across scientific papers, our 4B RLM matches Sonnet 4.6 in quality while running significantly faster and cheaper.

English

8

37

294

27.5K

Drew Breunig@dbreunig·2d

@michaelryan207 @KnightHennessy Congrats!

English

1

0

1

144

Michael Ryan@michaelryan207·2d

Super excited to join the 2026 cohort of @KnightHennessy scholars! Has been incredible talking to the other scholars about the major problems they are tackling across healthcare, science, policy, etc. Excited to work towards Human-Centered Open-Source AI that can support people tackling the world's biggest challenges!

KnightHennessy@KnightHennessy

Meet the 2026 cohort of KH scholars! These 87 new scholars make up the most global Knight-Hennessy Scholars cohort to date, and will pursue degrees in 45 graduate programs across all seven graduate schools at @Stanford: knight-hennessy.stanford.edu/news/knight-he… (1/2)

English

39

19

192

25.3K

Drew Breunig retweetledi

Jeff Dean@JeffDean·2d

Great to see @percyliang as a keynote speaker at #cais2026!

ACM Conference on AI and Agentic Systems@CAISconf

🎤 Keynote announcement: @percyliang (Percy Liang), Professor of Computer Science at @Stanford, founding director of the Center for Research on Foundation Models, and co-founder of @togethercompute, is keynoting #CAIS2026. Percy's HELM framework set the standard for holistic evaluation of language models, and his Foundation Model Transparency Index (now in its third year) put every major AI lab on notice for what they do and don't disclose. His current work on Marin takes this further: an open lab where every experiment, successful or not, is public from day one. This one's going to be good. San Jose · May 26–29 caisconf.org

English

11

19

222

44.4K

Drew Breunig retweetledi

ACM Conference on AI and Agentic Systems@CAISconf·2d

🎤 Keynote announcement: @percyliang (Percy Liang), Professor of Computer Science at @Stanford, founding director of the Center for Research on Foundation Models, and co-founder of @togethercompute, is keynoting #CAIS2026. Percy's HELM framework set the standard for holistic evaluation of language models, and his Foundation Model Transparency Index (now in its third year) put every major AI lab on notice for what they do and don't disclose. His current work on Marin takes this further: an open lab where every experiment, successful or not, is public from day one. This one's going to be good. San Jose · May 26–29 caisconf.org

ACM Conference on AI and Agentic Systems tweet media

English

2

20

86

64.6K

Drew Breunig@dbreunig·3d

@badlogicgames @vboykis imo the front page has generally been balanced but the comments are vocally anti.

English

0

3

47

Mario Zechner@badlogicgames·3d

@vboykis please do! i'd be very interested in the outcome.

English

1

0

1

144

vicki@vboykis·4d

Legitimately feels like an unquantifiable vibe shift the last few weeks where the pendulum is swinging back to reasonable takes and people experimenting with model choice 🙏

English

17

21

216

18K

Drew Breunig@dbreunig·3d

See: dbreunig.com/2025/01/31/dee…

0

257

Drew Breunig@dbreunig·3d

AI is what Donna Haraway called a "power object": something we treat as enormously important without understanding how it works. Reverence plus naivety means everyone fills the empty box with their own pet issue.

Kevin Van Valkenburg@KVanValkenburg

This reader comment on a NY Times column where Ross Douthat ponders that maybe God is speaking to us through A.I. is an absolute fastball on the corner with movement, and deserves a column. "Lightening was once mysterious too; mystery did not make Zeus correct" is perfect.

English

1

3

9

1.5K

Drew Breunig retweetledi

Kevin Van Valkenburg@KVanValkenburg·4d

This reader comment on a NY Times column where Ross Douthat ponders that maybe God is speaking to us through A.I. is an absolute fastball on the corner with movement, and deserves a column. "Lightening was once mysterious too; mystery did not make Zeus correct" is perfect.

English

102

1.8K

12.2K

357.3K

Drew Breunig@dbreunig·3d

This is a positive datapoint. x.com/ZackKorman/sta…

Zack Korman@ZackKorman

Mythos found a single vulnerability in cURL (along with three false positives, and one issue they classified as a bug). The founder/lead dev wasn't super impressed.

English

0

315

Drew Breunig@dbreunig·15 Nis

Needless to say, I think Caldotcom going closed source for security reasons is exactly the wrong move. x.com/dbreunig/statu…

Drew Breunig@dbreunig

x.com/i/article/2044…

English

1

0

10

1.3K

Drew Breunig

Keşfet