Julia Kempe

230 posts

Julia Kempe banner
Julia Kempe

Julia Kempe

@KempeLab

Silver Professor at NYU Courant & CDS Director & Research Scientist at MetaFAIR Research in Machine Learning, past in Quantum Computing & Finance Posts my own.

انضم Nisan 2024
230 يتبع2.6K المتابعون
Kai Williams
Kai Williams@chi_t_williams·
@ziv_ravid Quick Q: was this recorded before or after the unit distance result?
English
1
1
1
529
Julia Kempe أُعيد تغريده
Ravid Shwartz Ziv
Ravid Shwartz Ziv@ziv_ravid·
Math is starting to fall — so what's next? 🎙️ New episode of The Information Bottleneck is out! We've all seen the recent wave of Erdős problems being solved by frontier models, and the question now is what it actually means for the future of mathematics, and for AI research more broadly. We sit down with @KempeLab - Professor at NYU's Center for Data Science and researcher at Meta FAIR's Foundations of Reasoning team, to dig into exactly that. Julia makes the case that math is the next Go. With formal verification and LLM agents that can propose, formalize, and check proofs at scale, a new industry of automated mathematical discovery is closer than most mathematicians believe. We also get into: → Why physics is harder than math → Model collapse, synthetic data, and what's left to squeeze from the internet → Scaling limits, energy costs, and where academia still has the edge → How to advise PhD students when Claude can already do their first-year work → AI safety, agent security, and the Wild West of deployed agents → Why the Renaissance researcher is finally back One of our favorite conversations yet. Listen now 👇
Ravid Shwartz Ziv tweet media
English
5
4
30
2.6K
Julia Kempe
Julia Kempe@KempeLab·
@scottnarmstrong ... and of all further derived statements end lemmata!! Intergenerationally...
English
0
0
1
200
Scott Armstrong
Scott Armstrong@scottnarmstrong·
Even this is far too lenient. If you really cared about scientific integrity you’d not only do a retroactive banning of their publications, but you’d axiomatize the negation of all the theorems they proved.
TotientQuotient@t0tientqu0tient

Funny how people complain that the new arXiv policy is too harsh. If anything, it's still way too lenient. Getting caught with AI slop should result in a permanent, publicly viewable (in the form of a hall of shame) ban and retroactive redaction of all previous arXiv submissions.

English
3
2
44
4.5K
Julia Kempe
Julia Kempe@KempeLab·
3/3 Scaling of foundation models & large-scale engineering continues, but further progress in machine intelligence will require new ideas & breakthroughs. I believe academia will continue to play a key role, particularly through published and opensource research. Exciting times!
English
1
1
42
5.2K
Julia Kempe
Julia Kempe@KempeLab·
2/3 I am deeply grateful for the opportunity to collaborate with so many amazing colleagues at FAIR and MSL @AIatMeta. I also want to thank FAIR leadership, past and present, especially @ylecun, @jpineau1, @NailaMurray, David Lopez Paz, @rob_fergus for letting us explore.
English
1
1
24
6.2K
Julia Kempe
Julia Kempe@KempeLab·
1/3 My time at Meta FAIR will soon come to a close. I joined nearly two years ago full-time to help advance LLM reasoning. It has been a remarkable journey working with and leading an exceptionally talented team.
English
11
4
269
99.2K
Charles Arnal
Charles Arnal@arnal_charles·
Very excited to be joining Meta Superintelligence Labs as a Research Scientist! I’ll be continuing my work on RL and AI for maths with @KempeLab, Rémi Munos, and my longtime partner in crime, Vivien Cabannes.
Charles Arnal tweet media
English
24
12
469
33.7K
Julia Kempe
Julia Kempe@KempeLab·
Tomorrow's Frontiers in Assciative Memory workshop at ICLR26 (room 201C) will be an exciting event!
Dmitry Krotov@DimaKrotov

For me the highlight of this year’s #ICLR2026 is the New Frontiers in Associative Memory workshop. Memory is an essential part of human cognition, yet it is present only in rudimentary forms in modern AI networks. The workshop will tackle recent advances in artificial memory models and new ideas for the future developments in this space. Amazing lineup of speakers including: Jay McClelland, Paul Liang, Xueyan Niu, and many others. 📍📅 Auditorium 201 C, Sunday April 26 9am-5pm. 👉 Additional info: nfam2026.amemory.net Join us tomorrow for the exciting conversations about Associative Memory! @iclr_conf @KempeLab @RogerioFeris @HildeKuehne @Ben_Hoov @krizna_b @pliang279 @JLMcCelland @p_ram_p @andre_t_martins @du_yilun @dlipshutz @meisamrr

English
0
0
9
1.8K
Julia Kempe
Julia Kempe@KempeLab·
Our work today (Sat) @iclr2026: ORAL: - OpenApps 3:39pm 204A/B POSTERS: -10:30am: Soft Tokens, Hard Truths, P3-#1020 -10:30am: OpenApps, P3-#308 - 3:15pm: How reinforcement learning after next-token prediction facilitates learning, P3-#806 Come to discuss!
English
0
2
9
1.1K
Julia Kempe
Julia Kempe@KempeLab·
Heading to Rio for #ICLR2026 (there 24-27): Giving a talk & on panel #Sci4DL workshop Today's (Wed) poster on interpretability: • From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers Thu Apr 23, 10:30 AM-1:00 PM | P4-#4002
English
0
2
12
714
Julia Kempe أُعيد تغريده
Natasha Butt
Natasha Butt@NatashaEve4·
Excited to be at #ICLR2026 in Rio this week! Presenting “Soft Tokens, Hard Truths” Saturday 10.30am (Pavilion 3, P3-#1020). Feel free to DM me to chat about self-improvement, reasoning, code gen. I’m also on the job market for industry research positions.
Natasha Butt@NatashaEve4

🔥New preprint: Soft Tokens, Hard Truths Introduces the first scalable continuous-token RL method for LLMs - no reference CoTs needed; scales to hundreds of thought tokens. Best to train soft, infer hard! Pass@1 parity ⚖️, Pass@32 gains 📈& better robustness 🛡️ vs. hard CoT 1/🧵

English
0
6
22
3.1K
Julia Kempe أُعيد تغريده
finbarr
finbarr@finbarrtimbers·
love this replay buffer paper from Meta: arxiv.org/abs/2604.08706 "methods like PPO or GRPO typically operate as on-policy as possible, meaning rollouts are generated, used for a single gradient update, and immediately discarded." this is crazy and we shouldn't do this!
English
12
51
483
56.6K
Julia Kempe أُعيد تغريده
Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·
Efficient RL Training for LLMs with Experience Replay "Empirically, we show that a well-designed replay buffer can drastically reduce inference compute without degrading – and in some cases even improving – final model performance, while preserving policy entropy."
Tanishq Mathew Abraham, Ph.D. tweet media
English
5
53
350
22.1K
Julia Kempe أُعيد تغريده
Charles Arnal
Charles Arnal@arnal_charles·
(1/9) Experience replay can cut LLM RL training compute by up to ~40% (without hurting final accuracy—and sometimes improving it). Paper: arxiv.org/abs/2604.08706
Charles Arnal tweet media
English
4
41
199
16.3K
Julia Kempe أُعيد تغريده
Rosinality
Rosinality@rosinality·
Experience replay of LLM RL. Hyperparameters for replay buffers, buffer sizes N, number of new rollouts R, minibatch size B depend on compute imbalance of rollout generation and training, and task-specific statistics. It could be hard to optimize this well.
Rosinality tweet media
English
4
8
77
13.1K
Julia Kempe أُعيد تغريده
Scott Armstrong
Scott Armstrong@scottnarmstrong·
@KempeLab and I just vibe-coded a Lean 4 formalization of elliptic De Giorgi–Nash–Moser theory. This is a cornerstone of modern elliptic PDE: local boundedness, weak Harnack, Harnack, and Hölder regularity for weak solutions with merely bounded measurable coefficients.
Scott Armstrong tweet media
English
4
42
150
18.7K