Andrew Saxe

732 posts

Andrew Saxe banner
Andrew Saxe

Andrew Saxe

@SaxeLab

Prof at @GatsbyUCL and @SWC_Neuro, trying to figure out how we learn. Bluesky: @SaxeLab Mastodon: @[email protected]

London, UK Inscrit le Kasım 2019
379 Abonnements5.7K Abonnés
Tweet épinglé
Andrew Saxe
Andrew Saxe@SaxeLab·
Why don’t neural networks learn all at once, but instead progress from simple to complex solutions? And what does “simple” even mean across different neural network architectures? Sharing our new paper @iclr_conf led by Yedi Zhang with Peter Latham arxiv.org/abs/2512.20607
GIF
English
9
51
409
21.7K
Andrew Saxe retweeté
Francis Bach
Francis Bach@BachFrancis·
Looking for alternatives to quadratic functions for closed-form analysis in optimization? This post explores matrix Riccati dynamics and their applications to neural networks. francisbach.com/closed-form-dy…
GIF
English
0
23
154
8.7K
Andrew Saxe retweeté
Andrew Lampinen
Andrew Lampinen@AndrewLampinen·
What is the relationship between memorization and generalization in AI? Is there a fundamental tradeoff? In a new blog post I’ve reviewed some of the evolving perspectives on memorization & generalization in machine learning, from classic perspectives through LLMs. Link below:
Andrew Lampinen tweet media
English
12
45
424
23.3K
Andrew Saxe retweeté
Stefano Sarao Mannelli
Stefano Sarao Mannelli@stefsmlab·
📢 We’re now accepting applications for the 2026 School on Analytical Connectionism dedicated this year to Language Acquisition. 📍 Gothenburg, Sweden 🗓️ August 17–28, 2026 ☠️ Apply by April 17! 🔗 analytical-connectionism.net/school/2026/ 👇 Meet the experts joining us this summer!
Stefano Sarao Mannelli tweet media
English
1
9
30
4.4K
Andrew Saxe
Andrew Saxe@SaxeLab·
We’re hiring postdocs/research scientists! Your interests can be anywhere on the spectrum from pure theory to empirically testing predictions relevant to AI safety. Our theoretical work relies on dynamical systems and tools from statistical physics. 3
English
2
2
47
2.8K
Andrew Saxe
Andrew Saxe@SaxeLab·
Excited to launch Principia, a nonprofit research organisation at the intersection of deep learning theory and AI safety. Our goal is to develop theory for modern machine learning that can help us understand network behaviors, including those critical for AI safety. 1
English
8
36
300
18.3K
Andrew Saxe
Andrew Saxe@SaxeLab·
Equipped with this theory, we make new predictions about how network width, data distribution, and initialization affect learning dynamics. For example, increasing the number of attention heads in linear attention shortens the plateaus in learning.
English
1
0
7
701
Andrew Saxe
Andrew Saxe@SaxeLab·
Why don’t neural networks learn all at once, but instead progress from simple to complex solutions? And what does “simple” even mean across different neural network architectures? Sharing our new paper @iclr_conf led by Yedi Zhang with Peter Latham arxiv.org/abs/2512.20607
GIF
English
9
51
409
21.7K
Andrew Saxe retweeté
Clémentine Dominé, Phd 🍊@NeurIPS
🎓Thrilled to share I’ve officially defended my PhD!🥳 At @GatsbyUCL, my research explored how prior knowledge shapes neural representations. I’m deeply grateful to my mentors, @SaxeLab and Caswell Barry, my incredible collaborators, and everyone who supported me! Stay tuned!
Clémentine Dominé, Phd 🍊@NeurIPS tweet mediaClémentine Dominé, Phd 🍊@NeurIPS tweet media
English
13
11
357
17.8K