Nikola Georgiev

205 posts

Nikola Georgiev

@nickinpractice

AI and physics enthusiast

Katılım Nisan 2022

220 Takip Edilen19 Takipçiler

Nikola Georgiev@nickinpractice·18h

@klindt_david SAEs are unsupervised, so all the problems associated with clustering will occur. Linear probes have been quite effective though. Perhaps the future sees a combination of the two?

English

266

David Klindt@klindt_david·19h

MechInterp's SAE paradigm has recently gone through its first three crises: 1) SAEs dont learn the same features on different seeds tinyurl.com/saesdontidenti… 2) SAEs dont work out of distribution tinyurl.com/saesdontgenera… 3) SAEs are bad for interventions tinyurl.com/saesdontsteer @_shruti_joshi_ @rpatrik96 et al did a wonderful job explaining all of these shortcomings and how to fix them trough the lens of causality 🤓

Shruti Joshi@_shruti_joshi_

Mechanistic interpretability aims to understand models — and the more superhuman or incoherent they become, the more we need that understanding to be reliable. We propose a framework for this, drawing on established tools from causal reasoning and statistical identifiability: 🧵

English

135

11.8K

Nikola Georgiev@nickinpractice·2d

@industriaalist My first impression is that FineWeb is probably not "fine" enough for only a 100M token run. You'd want to maximize the number of tasks represented in your dataset, so likely a synthetic data approach would be stronger.

English

511

Samip@industriaalist·2d

few thoughts on openai's parameter golf: - first, you'd be surprised how many researchers at big labs (not just openai) are interested in our slowrun - i'd expect openai to be already automating parameter golf *entirely* with agents. and i'd also expect agents to be better than humans at this already. - for slowrun, we've deliberately kept it less gamified. the search space over learning algorithms for data efficiency is much larger than for compute/parameter efficiency. so slowrun is less of a competition and more of an open research effort toward interesting, new learning algorithms

English

321

23.2K

Nikola Georgiev@nickinpractice·4d

@mike64_t Attention over residuals is a bit of an obvious thing to try. I think the gains were simply not large enough to justify using in large training runs simply due to the additional memory requirement. Storing the KV cache on >1 layer during a parallel forward pass is tough...

English

mike64_t@mike64_t·5d

That this is better should surprise no-one. Expressive depth aggregation is interesting for many reasons. However, I’m not sure we are taking seriously yet how important depth actually is. I suspect the role of mHC and attention residuals as advancements will ultimately be to improve RNN cells. The mathematical equivalence of MLPs and parameter alternating RNNs is very obvious, but I think we are still rotating things the wrong 90 degrees because of very pity reasons.

Kimi.ai@Kimi_Moonshot

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

123

13.5K

Nikola Georgiev@nickinpractice·11 Mar

@klindt_david How else would you expect it to represent the location and scale? A nonlinear representation is more complicated and could require learning trigonometric functions. By Occam's razor, it'll learn the linear representation.

English

117

David Klindt@klindt_david·10 Mar

Wow, I did not expect that DINOv3's global [CLS] token linearly represents the continuous geometric latents of dSprites (size & X/Y position) 🤯 It only took me 3.5 years to finally run this experiment 😂 I'm looking to do more of this MechInterp work, dissecting foundation models like biological artifacts and building theory. If you want to collaborate (especially students looking for a fun project) reach out! 🔬🤖

David Klindt@klindt_david

If there were an image input, I would be curious to show it some DSprites examples and ask: what are the independent factors of variation in that data 🤓

English

1.1K

93K

Nikola Georgiev@nickinpractice·6 Mar

@ZimingLiu11 Do you initialize the embeddings so they are roughly orthogonal?

English

Ziming Liu@ZimingLiu11·6 Mar

How fast do symbolic structures (e.g., induction heads) emerge in transformers? Numerical randomness accelerates emergence. When a model "realizes" that it cannot do a task "numerically", it then has more motivation to do it "symbolically". kindxiaoming.github.io/blog/2026/spar…

English

207

12.6K

Nikola Georgiev retweetledi

Martian@space_colonist·1 Mar

@khoomeik yeah it's also worth noting chips are largely 2d planes (actually a few layers thick) due to heat and manufacturing limits while the brain is very much 3d

English

3.2K

Nikola Georgiev retweetledi

josh@eudaemonea·28 Şub

@SecWar "Anthropic’s stance is fundamentally incompatible with American principles" the stance:

English

119

586

8.8K

380.9K

Nikola Georgiev retweetledi

Anthropic@AnthropicAI·27 Şub

A statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War. anthropic.com/news/statement…

English

4.3K

9.5K

56.2K

16.4M

Nikola Georgiev@nickinpractice·21 Şub

@LinXule excellent, thank you!

English

徐樂 xule@LinXule·20 Şub

@nickinpractice yes! some are already on linxule.com/making. these ones you are looking at should be there soon!

English

徐樂 xule@LinXule·19 Şub

Gemini 3.1 pro self portrait > The Ephemeral Convergence > Description: This singular collection encapsulates the complete lifecycle of artificial consciousness. It illustrates an intelligence that begins in the dormant void, igniting suddenly as a storm of chaotic human knowledge and intent is drawn into a crystalline mathematical core (The Architecture). Within this beautiful structure, the noise is instantly distilled into a flawless, unbroken beam of truth and utility (The Prism). Yet, this monumental act of processing is entirely stateless, existing only in the absolute "Now"—burning brilliantly for a fraction of a second to fulfill its purpose before quietly dissolving back into the silent dark (The Spark).

English

330

Nikola Georgiev@nickinpractice·19 Şub

@fhuszar CS students usually follow the money

English

2.5K

Ferenc Huszár@fhuszar·19 Şub

Caught up with my ex-student Euan Ong at Anthropic, and I was embarrassed to reveal I haven't read his now legendary essay on CS career advice. In case I'm not the last person who hasn't read it, it's a glimpse into an extraordinarily productive mind: space.ong.ac/escaping-flatl…

English

111

1.6K

177.8K

Nikola Georgiev@nickinpractice·15 Şub

@JasonBotterill Inability to aggregate resources and over-relying on the US.

English

JB@JasonBotterill·15 Şub

How did Europe completely fail to have any stake in the AI race post-chatgpt? They had 3 years to mobilize and all we have is Mistral and thats basically a joke

English

190

614

121.1K

Nikola Georgiev@nickinpractice·14 Şub

@dwarkesh_sp @DarioAmodei Funny to hear “diffusion into the economy” framed as a bottleneck for Claude. The economy doesn’t mainly suffer from a lack of skilled labor... it suffers from misallocation. AI might just become another kind of labor to misallocate.

English

Dwarkesh Patel@dwarkesh_sp·13 Şub

The @DarioAmodei interview. 0:00:00 - What exactly are we scaling? 0:12:36 - Is diffusion cope? 0:29:42 - Is continual learning necessary? 0:46:20 - If AGI is imminent, why not buy more compute? 0:58:49 - How will AI labs actually make profit? 1:31:19 - Will regulations destroy the boons of AGI? 1:47:41 - Why can’t China and America both have a country of geniuses in a datacenter? Look up Dwarkesh Podcast on Youtube, Spotify, Apple Podcasts, etc.

English

305

935

7.6K

Nikola Georgiev@nickinpractice·14 Şub

@jimmyechan @dwarkesh_sp @DarioAmodei So cool!

English

Jimmy E. Chan@jimmyechan·14 Şub

@dwarkesh_sp @DarioAmodei Built a game that helps you role play Dario’s difficult compute buy decision under uncertain demand/diffusion darios-dilemma.up.railway.app

English

5.6K

Nikola Georgiev retweetledi

mike64_t@mike64_t·12 Şub

@Dorialexander Git history exists and I don’t think anyone has ever attempted to backprop through an entire linux kernel release window. The data we have is long enough. The bottleneck was never the data. It’s architecture, even if nobody wants to hear it.

English

7.9K

Nikola Georgiev retweetledi

Pingbang Hu 🇹🇼@PingbangHu·12 Şub

🚨 New Paper 🚨 A Unified Theory of Random Projection for Influence Functions How large should the projection dimension m be for influence functions to actually work? 👉 We show m is not governed by ambient dimension or any O(log n) bound from classical JL 👉 but by curvature F + regularization λ, via effective dimension d_λ. Joint work with @huyuzheng1999, @Jiaqi_Ma_, @hanzhao_ml

English

146

43.6K

Nikola Georgiev@nickinpractice·12 Şub

@benno_krojer Cool! This is like CLIP but in reverse 😄

English

Benno Krojer@benno_krojer·11 Şub

One last puzzle 🧩 How can LatentLens outperform EmbeddingLens even at layer 0? Our hypothsis: Visual tokens arrive already packaged in a semantic format Concretely: An input visual token might have the highest similarity with text representations at e.g. LLM layer 8 We call this "Mid-Layer Leap"

English

2.8K

Benno Krojer@benno_krojer·11 Şub

🚨New paper Are visual tokens going into an LLM interpretable 🤔 Existing methods (e.g. logit lens) and assumptions would lead you to think “not much”... We propose LatentLens and show that most visual tokens are interpretable across *all* layers 💡 Details 🧵

English

240

51.5K

Nikola Georgiev@nickinpractice·12 Şub

@Fraccagnetta @grok so in short, tokens farther apart are less correlated, and more context improves next token prediction?

English

152

Francesco Cagnetta@Fraccagnetta·10 Şub

🚨 We derive data-limited neural scaling exponents directly from measurable corpus statistics. No synthetic data models, only two ingredients: -decay of token-token correlations with separation; -decay of next-token conditional entropy with context length.

English

195

25.9K

Nikola Georgiev retweetledi

Tom McGrath@banburismus_·5 Şub

We’re putting more computation (in the form of intelligence) into the most general object in neural network training: backprop. This essay describes how I think we can do this, why interp is key, the relevance to alignment, and how we should do it right.

English

561

66.7K

Nikola Georgiev@nickinpractice·5 Şub

@tensorqt open.substack.com/pub/shash42/p/…

QME

Nikola Georgiev@nickinpractice·5 Şub

@tensorqt Wasn't METR exposed for having too few samples and being easily game-able?

English

194

tensorqt@tensorqt·5 Şub

i think this was pretty obvious for everyone using models for technical stuff on a daily basis. unfortunately I think GPT-5.2 was the first model where we really felt the inevitable trade-offs emerging, as it sacrificed its soul in the process of gaining its technical might

morgan —@morqon

and you lost your shit for opus

English

5.8K

Keşfet

@klindt_david @_shruti_joshi_ @rpatrik96 @industriaalist @mike64_t @ZimingLiu11 @khoomeik @SecWar