Tiberiu Mușat

75 posts

Tiberiu Mușat banner
Tiberiu Mușat

Tiberiu Mușat

@Tiberiu_Musat_

Trying to figure out how AI works 🔍🧠 Currently at @ETH Zurich, previously @EPFL 🇨🇭 LLMs, interpretability, emergence, grokking 🤖

Zurich, Switzerland Katılım Şubat 2025
952 Takip Edilen754 Takipçiler
Sabitlenmiş Tweet
Tiberiu Mușat
Tiberiu Mușat@Tiberiu_Musat_·
🚨 New paper at ICLR 2025 on LLM Interepretability Why do LLMs need so many layers? 👀 👇 Key findings from my latest paper "Mechanism and Emergence of Stacked Attention Heads in Multi-Layer Transformers" (1/6)
English
1
7
23
3K
Lucius Bushnaq ⏹️
Lucius Bushnaq ⏹️@BushnaqLucius·
@Tiberiu_Musat_ For the first two, don't you run into trouble because your activations and parameters are finite float precision, meaning the positional encoding in the attention mechanism doesn't work anymore past a certain size? Or are we macgyvering some kind of different access mechanism?
English
1
1
2
426
Tiberiu Mușat
Tiberiu Mușat@Tiberiu_Musat_·
Why does deep learning generalize? What does weight decay really do? Can algorithmic information theory address these questions? In my latest preprint, I give a proof that the minimum neural weight norm matches the minimum program length (aka Kolmogorov Complexity), up to a logarithmic factor. In other words, the neural network with the smallest possible weight norm (that fits the data) must encode the shortest program (that fits the data). The result only holds for fixed-precision neural nets: infinite precision nets can store infinite information with finite (small) weights. arxiv.org/abs/2605.10878
Tiberiu Mușat tweet media
English
26
155
1.1K
138.1K
Tiberiu Mușat
Tiberiu Mușat@Tiberiu_Musat_·
@BushnaqLucius The proof works for looped neural architectures that can access an unbounded tape. Examples include chain-of-though transformers, looped transformers with large context, neural computers, etc.
English
2
1
3
2.3K
Lucius Bushnaq ⏹️
Lucius Bushnaq ⏹️@BushnaqLucius·
@Tiberiu_Musat_ Actually wait I'm confused, the paper seems to say the result holds for vanilla K-complexity, not K-complexity under some kind of memory bound. How does that work memory-wise when the mlp has a fixed finite width?
English
2
0
3
2.7K
Tiberiu Mușat
Tiberiu Mușat@Tiberiu_Musat_·
@BushnaqLucius I agree, implicit biases usually induce a similar prior without explicit regularization.
English
0
0
2
2.8K
Lucius Bushnaq ⏹️
Lucius Bushnaq ⏹️@BushnaqLucius·
@Tiberiu_Musat_ Nice. Notably, the proof is actually about the number of non-zero parameters in the network. So this solution would also be favoured in the training prior on account of its size in the loss landscape, not just because of explicit weight regularisation.
English
2
0
13
3.4K
Andres
Andres@Andres_Nava_12·
Very happy to share my first first-author preprint with @MatthieuWyart. Meaning is hierarchical: dog → mammal → animal. This hierarchy appears as geometry in LLM embeddings. But where does that geometry come from? We show that word co-occurrence statistics are sufficient to induce it. arxiv.org/abs/2605.23821
Andres tweet media
English
9
20
161
9.4K
Tiberiu Mușat retweetledi
Imbue
Imbue@imbue_ai·
Mechanistic interpretability aspires to be the biology of deep learning. @KuninDaniel and @learning_mech say that an emerging theory of deep learning they and their team call 🛠️ learning mechanics 🛠️ will be the physics.
English
2
3
22
2.1K
Uzay Macar
Uzay Macar@uzaymacar·
🧵New Anthropic Fellows research: We studied mechanisms of "introspective awareness" in LLMs. LLMs can sometimes detect steering vectors injected into their residual stream. But is this worthy of being called introspection, or attributable to some uninteresting confound?👇
Uzay Macar tweet media
English
28
70
425
46.6K
Tiberiu Mușat retweetledi
Dan McAteer
Dan McAteer@daniel_mac8·
LLM pretraining may follow a hidden curriculum. Across 9 open models, component skills tend to emerge before composite skills in a predictable order. The order is legible enough to predict held-out task trajectories. What this tells you is that a loss curve dashboard is not enough when training models. We need training milestone evals too, so you know where in the curriculum the model is.
Dan McAteer tweet media
English
6
19
107
8.7K
Tiberiu Mușat
Tiberiu Mușat@Tiberiu_Musat_·
(11/11) Try it in your browser! Works on mobile too. Link in my bio.
English
0
0
1
135
Tiberiu Mușat
Tiberiu Mușat@Tiberiu_Musat_·
A few more photos from my Mandelbrot Set fractal explorer👇(1/11)
Tiberiu Mușat tweet media
English
1
0
3
289