Samuel Chen

9 posts

Samuel Chen

Samuel Chen

@samchenn_

phys/cs @ stanford | v6 plateau

Katılım Şubat 2025
89 Takip Edilen30 Takipçiler
Michael Y. Li
Michael Y. Li@michaelyli_·
Can a language model learn, end-to-end, what to keep in its own KV cache and what to throw away? Can it learn to forget while it learns to reason? Deep learning's central lesson: capability emerges from end-to-end optimization, not heuristics/strong inductive biases. But for efficiency, we rely heavily on hand-designed approaches. 🗑️ Introducing Neural Garbage Collection (NGC): we train a language model to jointly reason and manage its own KV cache, using reinforcement learning with outcome-based task reward alone. No SFT, no proxy objectives, no summarization in natural language. New paper with @jubayer_hamid, Emily Fox, and @noahdgoodman!
Michael Y. Li tweet media
English
30
133
905
163.1K
Samuel Chen
Samuel Chen@samchenn_·
@Andercot yes Islands of Stability, but we should probably also secure the Twin Peaks of unstable isotopes that hospitals need
Samuel Chen tweet media
English
0
0
0
30
Andrew Côté
Andrew Côté@Andercot·
We should consider immediately negotiating for the purchase of the Islands of Stability to safeguard the West's future need of stable isotopes. Thank you for your attention to this matter
Andrew Côté tweet media
English
5
3
124
7.7K
Nous Research
Nous Research@NousResearch·
Introducing NousCoder-14b, a competitive olympiad programming model. Our latest blog details the full findings from extensive experiments and logs with the full stack released - the RL environment, benchmark, and harness built in Atropos, all fully reproducible with our open training stack. NousCoder-14b was post-trained on Qwen3-14B by researcher in residence @JoeLi5050 using 48 B200s over the course of 4 days, our Atropos framework, and @modal's autoscaler. It achieves a Pass@1 accuracy of 67.87%,+7.08% over Qwen's baseline accuracy using verifiable execution rewards.
Nous Research tweet media
English
57
109
1.2K
675.8K
Ryan Rong
Ryan Rong@RyanNeverWrong·
Super exciting news to share. I’m starting at @xai on the post training team and just wrapped up my first day today. I’m taking a leave from @Stanford and I’m super excited about this new chapter of my life and making @grok the best model. More to come.
Ryan Rong tweet media
English
145
118
1.5K
74.8K
Linda Xue
Linda Xue@xuelinda7·
@samchenn_ sam i saw this tweet and thought it came from a very credible x user good work
English
1
0
1
217
Samuel Chen
Samuel Chen@samchenn_·
Same batch as Chad IDE btw. Glad there’s still smart people focused on innovation instead of launch videos
Samuel Chen tweet media
English
2
0
10
716
Samuel Chen
Samuel Chen@samchenn_·
Training llms is like raising kids
Samuel Chen tweet media
English
1
0
6
249