Federico Tiblias (@akatief48) - Twitter Profili

Mingyu_Jin19@fnruji316625·17 May

A really interesting paper on representation geometry in LLMs written by my friend @frankniujc : “Hypothesis-Driven Feature Manifold Analysis in LLMs via SMDS” proposes a model-agnostic way to test geometric hypotheses about latent representations instead of assuming everything is just linear directions. They find that different concepts naturally form different structures like circles, lines, clusters, and that these manifolds remain surprisingly stable across model families/sizes while also dynamically reshaping with context. Very cool bridge between mechanistic interpretability and representation geometry. 🔥 Especially liked the framing that reasoning may operate over structured manifolds rather than isolated features. Paper: openreview.net/pdf?id=vCKZ40Y… Code: github.com/UKPLab/tmlr202… #LLM #MechanisticInterpretability #AIResearch #RepresentationLearning #TMLR #Interpretability #DeepLearning

English

220

21.4K

Federico Tiblias@akatief48·18 May

@coponder @frankniujc @fnruji316625 Happy to hear our work left such a good impression! Please do reach out via DMs! (looks like I can't message you first)

English

Robin Goins@coponder·17 May

@frankniujc @fnruji316625 @akatief48 On the off-chance @akatief48 or you will be in SF this June/July/August, we'd love for you to give a talk at Mox (moxsf.com) on this research!! Happy to DM with more details. :)

English

Federico Tiblias

Keşfet