Jonathan @SF

13

99

17.4K

Jonathan @SF@lightetal·5h

@MimeeXu Thanks Mimee!!

English

34

Mimee // smart casual dark and academic@MimeeXu·6h

@lightetal Congrats Jonathan!

English

Post-training LLMs is like mixing a cocktail: Too much easy data → no learning Too much hard data → instability Wrong balance → collapse And today, we mix it by hand. What if the data mixture could be learned instead of hand-tuned? arxiv.org/abs/2602.20532 🧵👇

0

1

56

Jonathan @SF@lightetal·6h

Excited to share that our paper was selected for an oral at the #ICLR2026 LLM Reasoning Workshop!

Jonathan @SF@lightetal

English

4

13

1.2K

Jonathan @SF@lightetal·1d

@yisongyue @YuanSui123 @GeelingC @evermind @Caltech @YueLabCaltech Congrats @YuanSui123 @GeelingC !!!

English

0

3

145

Yisong Yue@yisongyue·2d

Congratulations to @YuanSui123 & @GeelingC for rocking out at the @evermind Genesis competition! 🚀 We are building some amazing AI Agents for Science tech at @caltech. @YueLabCaltech going strong! 💪

EverMind@evermind

Full house at the Computer History Museum today. Great speakers, AI enthusiasts, and people who truly care about memories, all under one roof. We shared our thoughts on: • Memory and why it matters • The future of memory infrastructure • The future of the memory OS This was also the grand finale of our Genesis 2026 competition. The quality of the projects, the presentations, and the sheer volume of code submitted truly blew us away. It is clear that this community is pushing the boundaries of what memory infrastructure can be. More to come. Stay tuned.

English

2

31

6.7K

Jonathan @SF@lightetal·2d

@Young_AGI Or maybe COLM. For NeurIPS I feel like you would withdraw after the rebuttal

English

0

42

Young@Young_AGI·2d

@lightetal pivoted to NeurIPS?

English

0

48

Jonathan @SF@lightetal·6d

tfw you spend hours reviewing ICML papers and half of them withdraw before decisions 🥲

English

0

4

1.2K

Jonathan @SF retweetledi

Asari AI@AsariAILabs·5d

Artemis II launched yesterday, sending four astronauts on a journey around the Moon in an exciting milestone for space exploration. 10 days. 685,000 miles. Up to 25,000 mph. 🚀 Behind every one of those numbers is software that has to be flawless. We explored what it takes for AI agents to build software at that level of rigor. When our agents translated a mission-critical space math library from C to Safe Rust, they caught subtle errors in the test suite that expert reviewers had missed for years. This is the first crewed trip to the Moon since 1972 — hope their flight skills aren't too Rust-y 😉

Asari AI@AsariAILabs

x.com/i/article/2037…

English

2

6

1.1K

Jonathan @SF@lightetal·5d

@sukh_saroy You can actually get very far with just prompting. In addition to asking the model to generate 5 different responses, we also found that perturbing the prompt helps a lot. arxiv.org/pdf/2411.05010

English

0

2

233

Sukh Sroay@sukh_saroy·5d

🚨Breaking: Stanford researchers built a new prompting technique! By adding ~20 words to a prompt, it: - boosts LLM's creativity by 1.6-2x - raises human-rated diversity by 25.7% - beats fine-tuned model without any retraining - restores 66.8% of LLM's lost creativity after alignment Let's understand why and how it works: Post-training alignment methods like RLHF make LLMs helpful and safe, but they unintentionally cause mode collapse. This is where the model favors a narrow set of predictable responses. This happens because of typicality bias in human preference data: When annotators rate LLM responses, they naturally prefer answers that are familiar, easy to read, and predictable. The reward model then learns to boost these "safe" responses, aggressively sharpening the probability distribution and killing creative output. But here's the interesting part: The diverse, creative model isn't gone. After alignment, the LLM still has two personalities. The original pre-trained model with rich possibilities, and the safety-focused aligned model. Verbalized Sampling (VS) is a training-free prompting strategy that recovers the diverse distribution learned during pre-training. The idea is simple: Instead of prompting "Tell me a joke" (which triggers the aligned personality), you prompt: "Generate 5 responses with their corresponding probabilities. Tell me a joke." By asking for a distribution instead of a single instance, you force the model to tap into its full pre-trained knowledge rather than defaulting to the most reinforced answer. Results show verbalized sampling enhances diversity by 1.6-2.1x over direct prompting while maintaining or improving quality. Variants like VS-based Chain-of-Thought and VS-based Multi push diversity even further. You can find the paper link in the next tweet. 👉 Over to you: What other methods can be used to improve LLM diversity?

English

20

76

316

24.7K

Jonathan @SF@lightetal·6d

@ZamAlex8 😭

QME

Post-training LLMs is like mixing a cocktail: Too much easy data → no learning Too much hard data → instability Wrong balance → collapse And today, we mix it by hand. What if the data mixture could be learned instead of hand-tuned? arxiv.org/abs/2602.20532 🧵👇

129

Jonathan @SF@lightetal·31 Mar

Does Actor-Curator also work on larger models? YES! Qwen-7B on Countdown:

Jonathan @SF@lightetal

English

7

20

4K

Jonathan @SF@lightetal·29 Mar

@kpb_in_acad Yeah np! It’s a good work

English

1

27

Kishan@kpb_in_acad·28 Mar

@lightetal Thanks, Jonathan, for reading and sharing my work! Means a lot coming from you!:)

English

(1/8) 🚀 New preprint: stop training reasoning models uniformly. Uniform prompt sampling + fixed rollouts waste compute on easy questions. We adapt (1) *what* we train on online, or (2) *how much compute* we spend online. #ContinualLearning #lifelonglearning #Reasoningmodels

0

1

49

Jonathan @SF@lightetal·28 Mar

Great work by Kishan! Highly recommend a read.

Kishan@kpb_in_acad

English

8

30

4.5K

Jonathan @SF@lightetal·29 Mar

@DimitrisPapail @yisongyue It’s honestly liberating to be able to spend more time exploring ideas rather than bug hunting

English

Dimitris Papailiopoulos@DimitrisPapail

1

89

Dimitris Papailiopoulos@DimitrisPapail·28 Mar

That is, choose to be bottlenecked by ambition not technical grunt work

I actually think that “Claude Code can solve it” is a prerequisite for a great research problem because it allows you to explore hypotheses much faster. In fact if CC can’t solve it I’d flag it as a bad problem because it will solve it in six months that you’ll be wasting on it

English

6

52

7.9K

Jonathan @SF retweetledi

Asari AI@AsariAILabs·27 Mar

x.com/i/article/2037…

ZXX

4

8

2.6K

Jonathan @SF@lightetal·28 Mar

@lateinteraction So true. LLM-based optimization has been discussed in research for a while now

English

66

Omar Khattab@lateinteraction·27 Mar

I find it disappointing to see how much progress comes daily from late interaction, DSPy/GEPA, and RLMs but our slow industry only catches up after a lab person slaps a name like “autoresearch” or “deep research” on it lol. The future is already here, just not equally distributed

English

24

33

361

23.3K

Omar Khattab@lateinteraction·27 Mar

recursive language models at work!

Agentica@agenticasdk

We scored 36.08% on ARC-AGI-3 in one day using the Agentica SDK.

English

6

16

316

45.4K

Jonathan @SF@lightetal·25 Mar

One thing we learned: Generating more samples ≠ exploring more. Most samples are near-duplicates More compute often just means more redundancy The real bottleneck isn’t compute—it’s diversity. Compute without diversity is wasted. codespace-optimization.github.io

English

4

246

Jonathan @SF@lightetal·15 Mar

@0xJuliechen Happy bday Julie!

English

21

Julie Chen@0xJuliechen·13 Mar

Turning 26. 🎂 my 2nd year in sf. 3rd year in the US. 20+ friends came to my bday party. feeling peaceful and beloved :) I've written a birthday recap every year since 22, each one from a different city: seoul, singapore, shanghai, philly. this year, I'm writing it in sf - the place i decide to lock in and grow my roots. be bold. be kind. the best is yet to come 🩵

English

27

0

67

3.7K

Jonathan @SF@lightetal·15 Mar

@m1nj12 Maybe if the community focuses more on soundness rather than novelty that could help alleviate these problems/pressures?

English

Simply adding Gaussian noise to LLMs (one step—no iterations, no learning rate, no gradients) and ensembling them can achieve performance comparable to or even better than standard GRPO/PPO on math reasoning, coding, writing, and chemistry tasks. We call this algorithm RandOpt. To verify that this is not limited to specific models, we tested it on Qwen, Llama, OLMo3, and VLMs. What's behind this? We find that in the Gaussian search neighborhood around pretrained LLMs, diverse task experts are densely distributed — a regime we term Neural Thickets. Paper: arxiv.org/pdf/2603.12228 Code: github.com/sunrainyg/Rand… Website: thickets.mit.edu

496

Minji Lee@m1nj12·15 Mar

Reviewing for ICML and it’s extremely underwhelming. One adds a component to existing framework but write like they proposed the entire thing. Another describe their method in a very misleading way (in my opinion, intentionally, to overstate the novelty/utility). :((

English

17

8

283

29.2K

Jonathan @SF@lightetal·13 Mar

Backprop is dead for post-training?

Yulu Gan@yule_gan

English

Post-training LLMs is like mixing a cocktail: Too much easy data → no learning Too much hard data → instability Wrong balance → collapse And today, we mix it by hand. What if the data mixture could be learned instead of hand-tuned? arxiv.org/abs/2602.20532 🧵👇

3

503

Jonathan @SF@lightetal·5 Mar

Our ACTOR-CURATOR project page is now live: actor-curator.github.io We added a fun little animation of the RL post-training loop. You can play around with it and watch the curriculum evolve as the actor and curator learn together. Enjoy!

Jonathan @SF@lightetal

English

1

8

1.9K

Jonathan @SF@lightetal·27 Şub

@amritsinghbedi3 @anas__barakat Thanks! This is very cool!

English

0

2

137

Amrit Singh Bedi@amritsinghbedi3·27 Şub

@lightetal Nice work! We analyzed how post-training performance is affected by prompt mixing through the concept of Prompt Interference. arxiv.org/abs/2602.21189 @anas__barakat

English

3

6

160

Jonathan @SF@lightetal·26 Şub

Post-training LLMs is like mixing a cocktail: Too much easy data → no learning Too much hard data → instability Wrong balance → collapse And today, we mix it by hand. What if the data mixture could be learned instead of hand-tuned? arxiv.org/abs/2602.20532 🧵👇

English