Daniel Scalena

101 posts

Daniel Scalena

@daniel_sc4

PhDing @unimib 🇮🇹 & @GroNlp 🇳🇱, interpretability et similia

Katılım Şubat 2015

777 Takip Edilen134 Takipçiler

Sabitlenmiş Tweet

Daniel Scalena@daniel_sc4·16 Eki

You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀 Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones. 🧵👇

English

1.9K

Daniel Scalena@daniel_sc4·13 Mar

@GoodfireAI Nice work! I wonder, probe trained on answer choices needs known options. What if you probe model confidence and early exit there regardless of the answer it's thinking? I feel like after some t the model already knows and the rest is just overthinking

English

192

Goodfire@GoodfireAI·12 Mar

LLMs often reason “performatively” well after deciding on a final answer - something that CoT monitors are slow to catch. Our new paper finds that: - probes can help monitor for this - it seems to track with task difficulty - probes enable early CoT exit, saving tokens! (1/7)

English

328

41.7K

Daniel Scalena@daniel_sc4·13 Mar

@paradigmainc Ok I was trying to cook something to improve model’s scientific creativity, throwing the repo into flywheel feels like the next logical step

English

172

Paradigma@paradigmainc·12 Mar

introducing Flywheel: the infrastructure for autonomous research.

English

513

91.9K

Daniel Scalena@daniel_sc4·4 Şub

@gsarti_ Gemini listened to its AGI intrusive thoughts on this

English

Gabriele Sarti@gsarti_·4 Şub

Surely a good omen, thanks Gemini

English

285

Daniel Scalena@daniel_sc4·2 Şub

@thelokasiffers They were pioneers vibecoding it with gpt2 back then

English

giulio@thelokasiffers·1 Şub

I think the city of Rome vibecoded their public transport system

English

Daniel Scalena retweetledi

Gabriele Sarti@gsarti_·9 Oca

Happy to announce I will be mentoring a SPAR project this Spring! ✨Check out the programme and apply by Jan 14th to work with me on understanding and mitigating implicit personalization in LLMs, i.e. how models form hidden beliefs about users that shape their responses.

English

807

Daniel Scalena retweetledi

Gabriele Sarti@gsarti_·4 Oca

Now accepted at EACL main! Check it out! ⬇️

Daniel Scalena@daniel_sc4

📢 New paper: Applied interpretability 🤝 MT personalization! We steer LLM generations to mimic human translator styles on literary novels in 7 languages. 📚 SAE steering can beat few-shot prompting, leading to better personalization while maintaining quality. 🧵1/

English

1.4K

Daniel Scalena@daniel_sc4·4 Oca

Want models to translate in the style you actually like? Our paper just got accepted at EACL Main 🚀, check out our work on using interpretability for MT personalization! And, see you in Morocco! 🇲🇦

Daniel Scalena@daniel_sc4

English

253

Daniel Scalena@daniel_sc4·14 Ara

@thelokasiffers Woo big congrats on the launch and shoutout for starting in Rome. Can’t wait to hear more!

English

giulio@thelokasiffers·13 Ara

Arriviamo

tensorqt@tensorqt

announcement: I will be founding a new company with @thelokasiffers and @EmanueleRodola. it seems very clear to us that we're on the verge of completely re-imagining many of the institutions humans have consolidated across history. one of these is the way we do, interpret, review and utilize science. we will be laying the foundations to build a breakthrough factory, and will approach the problem from a research-heavy perspective, while at the same time offering a product we hope many of you will use soon. crucially, we will do this in Europe, starting from Rome. over the next few weeks we will bring together a small number of angels to support our already well underway efforts, before making both our product and research available to the public. in the near future, we will also be expanding the team, with the specific purpose of building the single most talent-dense company in this space. if automating research taste sounds like an inevitable challenge, if you can feel that the current way we're doing science will change, if you think we don't have enough frontier labs in Europe, please reach out to any of us in DMs.

Italiano

9.4K

Daniel Scalena@daniel_sc4·29 Eki

@Turn_Trout @GladiaLab SVs are approximate directions in the latent space. They look for exact matches in the latent space. This could make things harder, but I’m still curious to know!

English

119

Alex Turner@Turn_Trout·28 Eki

@GladiaLab I'm curious what "prompt" you'd recover if you add steering vectors to a representation!

English

4.9K

GLADIA Research Lab@GladiaLab·27 Eki

LLMs are injective and invertible. In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space. (1/6)

English

280

1.3K

10.9K

Daniel Scalena@daniel_sc4·27 Eki

@andy_peng05 @Cohere_Labs @UvA_Amsterdam Hi, thank you very much! Good catch on the p@1, it was meant to be p@k (pass@k). We’ll fix it asap in the preprint!

English

Andy Peng@andy_peng05·27 Eki

@Cohere_Labs @UvA_Amsterdam Nice paper! Is p@1 in table 3 in the appendix a typo?

English

Cohere Labs@Cohere_Labs·8 Kas

We are committed to making meaningful progress in machine learning research through open collaboration. Follow this 🧵to stay on top of our research contributions.

English

368

Daniel Scalena@daniel_sc4·16 Eki

@ZotosLeo @FersiniE @MalvinaNissim @ahmetustun89 Also cc @jiawzhao and @FuYichao123 — I guess your great work on DeepConf is highly relevant to this project!

English

Daniel Scalena@daniel_sc4·16 Eki

Takeaway: EAGer shows we can be MORE efficient & MORE effective by letting models focus compute where it matters most. 📄Paper: arxiv.org/abs/2510.11170 💻Code: github.com/DanielSc4/EAGer ✨Huge thanks to my mentors and collaborators @ZotosLeo @FersiniE @MalvinaNissim @ahmetustun89

English

Daniel Scalena@daniel_sc4·16 Eki

English

1.9K

Daniel Scalena@daniel_sc4·16 Eki

Had so much fun working on this! Thank you @Cohere_Labs for sharing our work!

Cohere Labs@Cohere_Labs

How can we make reasoning models more efficient without sacrificing performance? Introducing EAGER, our new entropy-aware generation method, saving compute by up to 65% while lifting Pass @k by up to 37% on benchmarks like AIME.

English

401

Daniel Scalena@daniel_sc4·7 Eyl

@thelokasiffers as a non citizen, best way to discover Rome ruins: getting lost through the ATAC connection graph

English

giulio@thelokasiffers·6 Eyl

a fun cope of living in a city devoid of proper mobility solutions is that each time you wanna go somewhere you get to play a game coming up with creative ways of getting to destination

English

700

Daniel Scalena@daniel_sc4·21 Ağu

More about the work👇

Daniel Scalena@daniel_sc4

📘 Literary translation isn't just about accuracy, but also creatively conveying meaning across languages. But LLMs prompted for MT are very literal. Prompting & steering to the rescue! Can we personalize LLM’s MT when few examples are available, without further tuning? 🔍 2/

English

Daniel Scalena@daniel_sc4·21 Ağu

I’ll be attending the NEMI 2025 workshop this Friday and presenting a poster👇. Happy to chat about cool interpretability stuff there!

David Bau@davidbau

This Friday NEMI 2025 is at Northeastern in Boston, 8 talks, 24 roundtables, 90 posters; 200+ attendees. Thanks to @GoodfireAI for sponsoring! nemiconf.github.io/summer25/ If you can't make it in person, the livestream will be here: youtube.com/live/4BJBisHk1…

English

193

Keşfet

@GoodfireAI @paradigmainc @gsarti_ @thelokasiffers @Turn_Trout @GladiaLab @andy_peng05 @Cohere_Labs