Julia Kempe

230 posts

Julia Kempe

@KempeLab

Silver Professor at NYU Courant & CDS Director & Research Scientist at MetaFAIR Research in Machine Learning, past in Quantum Computing & Finance Posts my own.

انضم Nisan 2024

230 يتبع2.6K المتابعون

Julia Kempe@KempeLab·20h

@chi_t_williams @ziv_ravid Way before

English

Kai Williams@chi_t_williams·1d

@ziv_ravid Quick Q: was this recorded before or after the unit distance result?

English

529

Julia Kempe أُعيد تغريده

Ravid Shwartz Ziv@ziv_ravid·1d

Math is starting to fall — so what's next? 🎙️ New episode of The Information Bottleneck is out! We've all seen the recent wave of Erdős problems being solved by frontier models, and the question now is what it actually means for the future of mathematics, and for AI research more broadly. We sit down with @KempeLab - Professor at NYU's Center for Data Science and researcher at Meta FAIR's Foundations of Reasoning team, to dig into exactly that. Julia makes the case that math is the next Go. With formal verification and LLM agents that can propose, formalize, and check proofs at scale, a new industry of automated mathematical discovery is closer than most mathematicians believe. We also get into: → Why physics is harder than math → Model collapse, synthetic data, and what's left to squeeze from the internet → Scaling limits, energy costs, and where academia still has the edge → How to advise PhD students when Claude can already do their first-year work → AI safety, agent security, and the Wild West of deployed agents → Why the Renaissance researcher is finally back One of our favorite conversations yet. Listen now 👇

English

2.6K

Julia Kempe أُعيد تغريده

NYU Center for Data Science@NYUDataScience·6d

The 2nd Sci4DL (@scifordl) workshop at #ICLR2026 in Rio drew a packed room. Organized by CDS PhD alumni @ZKadkhodaie & @LotfiSanae, CDS Instructor @FlorentinGuth, and CDS-associated Prof. Eero Simoncelli, with CDS Silver Prof. @KempeLab as a speaker. scienceofdlworkshop.github.io/2026/

English

Julia Kempe@KempeLab·18 May

@scottnarmstrong ... and of all further derived statements end lemmata!! Intergenerationally...

English

200

Scott Armstrong@scottnarmstrong·18 May

Even this is far too lenient. If you really cared about scientific integrity you’d not only do a retroactive banning of their publications, but you’d axiomatize the negation of all the theorems they proved.

TotientQuotient@t0tientqu0tient

Funny how people complain that the new arXiv policy is too harsh. If anything, it's still way too lenient. Getting caught with AI slop should result in a permanent, publicly viewable (in the form of a hall of shame) ban and retroactive redaction of all previous arXiv submissions.

English

4.5K

Julia Kempe أُعيد تغريده

NYU Center for Data Science@NYUDataScience·18 May

CDS Silver Prof @KempeLab working with @arnal_charles, @TacoCohen, Vivien Cabannes, and Remi Munos studied how LLMs can train more efficiently by reusing past experience instead of constantly generating new data. Accepted to the ICML '26 conference. arxiv.org/abs/2604.08706

English

1.1K

Julia Kempe@KempeLab·16 May

3/3 Scaling of foundation models & large-scale engineering continues, but further progress in machine intelligence will require new ideas & breakthroughs. I believe academia will continue to play a key role, particularly through published and opensource research. Exciting times!

English

5.2K

Julia Kempe@KempeLab·16 May

2/3 I am deeply grateful for the opportunity to collaborate with so many amazing colleagues at FAIR and MSL @AIatMeta. I also want to thank FAIR leadership, past and present, especially @ylecun, @jpineau1, @NailaMurray, David Lopez Paz, @rob_fergus for letting us explore.

English

6.2K

Julia Kempe@KempeLab·16 May

1/3 My time at Meta FAIR will soon come to a close. I joined nearly two years ago full-time to help advance LLM reasoning. It has been a remarkable journey working with and leading an exceptionally talented team.

English

269

99.2K

Julia Kempe@KempeLab·10 May

@arnal_charles Lucky for our team to have you, Charles!

English

Charles Arnal@arnal_charles·7 May

Very excited to be joining Meta Superintelligence Labs as a Research Scientist! I’ll be continuing my work on RL and AI for maths with @KempeLab, Rémi Munos, and my longtime partner in crime, Vivien Cabannes.

English

469

33.7K

Julia Kempe@KempeLab·26 Nis

@scifordl @ZKadkhodaie An amazing, super well organised (and popular!) workshop! Thanks to all the organisers!

English

203

Scientific Methods for Understanding Deep Learning@scifordl·26 Nis

Closing remarks: @ZKadkhodaie is happy!

Scientific Methods for Understanding Deep Learning tweet media

English

2.6K

Julia Kempe أُعيد تغريده

Scientific Methods for Understanding Deep Learning@scifordl·26 Nis

"Insights into LLMs reasoning" by Julia Kempe @KempeLab

English

4.5K

Julia Kempe@KempeLab·26 Nis

Tomorrow's Frontiers in Assciative Memory workshop at ICLR26 (room 201C) will be an exciting event!

Dmitry Krotov@DimaKrotov

For me the highlight of this year’s #ICLR2026 is the New Frontiers in Associative Memory workshop. Memory is an essential part of human cognition, yet it is present only in rudimentary forms in modern AI networks. The workshop will tackle recent advances in artificial memory models and new ideas for the future developments in this space. Amazing lineup of speakers including: Jay McClelland, Paul Liang, Xueyan Niu, and many others. 📍📅 Auditorium 201 C, Sunday April 26 9am-5pm. 👉 Additional info: nfam2026.amemory.net Join us tomorrow for the exciting conversations about Associative Memory! @iclr_conf @KempeLab @RogerioFeris @HildeKuehne @Ben_Hoov @krizna_b @pliang279 @JLMcCelland @p_ram_p @andre_t_martins @du_yilun @dlipshutz @meisamrr

English

1.8K

Julia Kempe@KempeLab·25 Nis

Our work today (Sat) @iclr2026: ORAL: - OpenApps 3:39pm 204A/B POSTERS: -10:30am: Soft Tokens, Hard Truths, P3-#1020 -10:30am: OpenApps, P3-#308 - 3:15pm: How reinforcement learning after next-token prediction facilitates learning, P3-#806 Come to discuss!

English

1.1K

Julia Kempe@KempeLab·23 Nis

Heading to Rio for #ICLR2026 (there 24-27): Giving a talk & on panel #Sci4DL workshop Today's (Wed) poster on interpretability: • From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers Thu Apr 23, 10:30 AM-1:00 PM | P4-#4002

English

714

Julia Kempe أُعيد تغريده

Natasha Butt@NatashaEve4·20 Nis

Excited to be at #ICLR2026 in Rio this week! Presenting “Soft Tokens, Hard Truths” Saturday 10.30am (Pavilion 3, P3-#1020). Feel free to DM me to chat about self-improvement, reasoning, code gen. I’m also on the job market for industry research positions.

Natasha Butt@NatashaEve4

🔥New preprint: Soft Tokens, Hard Truths Introduces the first scalable continuous-token RL method for LLMs - no reference CoTs needed; scales to hundreds of thought tokens. Best to train soft, infer hard! Pass @1 parity ⚖️, Pass@32 gains 📈& better robustness 🛡️ vs. hard CoT 1/🧵

English

3.1K

Julia Kempe أُعيد تغريده

finbarr@finbarrtimbers·13 Nis

love this replay buffer paper from Meta: arxiv.org/abs/2604.08706 "methods like PPO or GRPO typically operate as on-policy as possible, meaning rollouts are generated, used for a single gradient update, and immediately discarded." this is crazy and we shouldn't do this!

English

483

56.6K

Julia Kempe أُعيد تغريده

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·13 Nis

Efficient RL Training for LLMs with Experience Replay "Empirically, we show that a well-designed replay buffer can drastically reduce inference compute without degrading – and in some cases even improving – final model performance, while preserving policy entropy."

Tanishq Mathew Abraham, Ph.D. tweet media

English

350

22.1K

Julia Kempe أُعيد تغريده

Charles Arnal@arnal_charles·14 Nis

(1/9) Experience replay can cut LLM RL training compute by up to ~40% (without hurting final accuracy—and sometimes improving it). Paper: arxiv.org/abs/2604.08706

English

199

16.3K

Julia Kempe أُعيد تغريده

Rosinality@rosinality·13 Nis

Experience replay of LLM RL. Hyperparameters for replay buffers, buffer sizes N, number of new rollouts R, minibatch size B depend on compute imbalance of rollout generation and training, and task-specific statistics. It could be hard to optimize this well.

English

13.1K

Julia Kempe أُعيد تغريده

Scott Armstrong@scottnarmstrong·8 Nis

@KempeLab and I just vibe-coded a Lean 4 formalization of elliptic De Giorgi–Nash–Moser theory. This is a cornerstone of modern elliptic PDE: local boundedness, weak Harnack, Harnack, and Hölder regularity for weak solutions with merely bounded measurable coefficients.

English

150

18.7K

اكتشف

@chi_t_williams @ziv_ravid @scifordl @ZKadkhodaie @LotfiSanae @FlorentinGuth @scottnarmstrong @arnal_charles