Samuel Chen

@samchenn_

phys/cs @ stanford | v6 plateau

Katılım Şubat 2025

89 Takip Edilen30 Takipçiler

Samuel Chen@samchenn_·23 Nis

@michaelyli__ This guys big on eliminating inductive biases🔥

English

354

Michael Y. Li@michaelyli_·22 Nis

Can a language model learn, end-to-end, what to keep in its own KV cache and what to throw away? Can it learn to forget while it learns to reason? Deep learning's central lesson: capability emerges from end-to-end optimization, not heuristics/strong inductive biases. But for efficiency, we rely heavily on hand-designed approaches. 🗑️ Introducing Neural Garbage Collection (NGC): we train a language model to jointly reason and manage its own KV cache, using reinforcement learning with outcome-based task reward alone. No SFT, no proxy objectives, no summarization in natural language. New paper with @jubayer_hamid, Emily Fox, and @noahdgoodman!

English

133

905

163.1K

Samuel Chen@samchenn_·21 Oca

@Andercot yes Islands of Stability, but we should probably also secure the Twin Peaks of unstable isotopes that hospitals need

English

Andrew Côté@Andercot·21 Oca

We should consider immediately negotiating for the purchase of the Islands of Stability to safeguard the West's future need of stable isotopes. Thank you for your attention to this matter

English

124

7.7K

Samuel Chen@samchenn_·7 Oca

@NousResearch Sum light for @JoeLi5050 😸

English

268

Nous Research@NousResearch·6 Oca

Introducing NousCoder-14b, a competitive olympiad programming model. Our latest blog details the full findings from extensive experiments and logs with the full stack released - the RL environment, benchmark, and harness built in Atropos, all fully reproducible with our open training stack. NousCoder-14b was post-trained on Qwen3-14B by researcher in residence @JoeLi5050 using 48 B200s over the course of 4 days, our Atropos framework, and @modal's autoscaler. It achieves a Pass@1 accuracy of 67.87%,+7.08% over Qwen's baseline accuracy using verifiable execution rewards.

English

109

1.2K

675.8K

Samuel Chen@samchenn_·18 Ara

@RyanNeverWrong @xai @Stanford @grok So so so well deserved🔥

English

Ryan Rong@RyanNeverWrong·9 Ara

Super exciting news to share. I’m starting at @xai on the post training team and just wrapped up my first day today. I’m taking a leave from @Stanford and I’m super excited about this new chapter of my life and making @grok the best model. More to come.