Lucy Farnik

497 posts

Lucy Farnik banner
Lucy Farnik

Lucy Farnik

@lucyfarnik

Meowing at the shoggoths. PhDing. Poking spherical cows with a stick. DMs open!

London, UK Katılım Mart 2022
370 Takip Edilen843 Takipçiler
Sabitlenmiş Tweet
Lucy Farnik
Lucy Farnik@lucyfarnik·
🚨NEW PAPER ALERT 🚨 SAEs can give us insight into the representations of LLMs. But what about the LLMs' computations? If we want to understand LLMs, we don't just need sparse SAE activations, but also a sparse computational graph connecting them. So how do we get them? A 🧵
Lucy Farnik tweet media
English
6
24
256
24K
Lucy Farnik retweetledi
Erika Lee
Erika Lee@erikalee·
"I'm at my limit" emotional or claude?
English
337
2.8K
19.3K
467.2K
Lucy Farnik
Lucy Farnik@lucyfarnik·
@So8res “X is a problem but Y is worse and I don’t know how to address X without exacerbating Y” seems like a coherent position?
English
0
0
0
31
Nate Soares ⏹️
Nate Soares ⏹️@So8res·
AI execs when talking about the danger vs the exact same AI execs talking about how we should respond:
Nate Soares ⏹️ tweet mediaNate Soares ⏹️ tweet mediaNate Soares ⏹️ tweet mediaNate Soares ⏹️ tweet media
English
10
56
314
21.9K
Lucy Farnik
Lucy Farnik@lucyfarnik·
@zudasworld @ESYudkowsky I'm confused, if an LLM thinks that you're wrong about something, do you want it to push back or do you want it to be sycophantic? I want the former, I explicitly have that in my system prompt, and Claude 4.5 has been much better than most models at following that instruction.
English
1
0
8
272
Lucy Farnik retweetledi
Larry the Cat
Larry the Cat@Number10cat·
The most important news from today's reshuffle:
Larry the Cat tweet media
English
331
2.9K
31.2K
646.4K