Clément Dumas

944 posts

Clément Dumas banner
Clément Dumas

Clément Dumas

@Butanium_

MATS 7/7.1 Scholar w/ Neel Nanda MSc at @ENS_ParisSaclay prev research intern at DLAB @EPFL AI safety research / improv theater

London 가입일 Aralık 2018
606 팔로잉712 팔로워
고정된 트윗
Clément Dumas
Clément Dumas@Butanium_·
New paper w/@jkminder & @NeelNanda5! What do chat LLMs learn in finetuning? Anthropic introduced a tool for this: crosscoders, an SAE variant. We find key limitations of crosscoders & fix them with BatchTopK crosscoders This finds interpretable and causal chat-only features! 🧵
Clément Dumas tweet media
English
5
28
202
37.1K
William Wale
William Wale@snigus·
I have a small language model and it’s been pre trained. Now I post train it to say “I’m a language model”. With no mention of openAI The trained model still ends up saying it’s a LLM made by openAI. Even tho OpenAI is never mentioned in the instruction tuning dataset, and there in fact is one sample that says “I am being developed by Anthropic” (not true)! Makes me think models saying they’re made by so and so is pretty weak evidence of copying /stealing / distillation.
English
19
6
299
31.7K
Clément Dumas
Clément Dumas@Butanium_·
@louisvarge You can also hack the built-in team feature to make this work. Was about to write a post about this but feels like channels might be cleaner?
English
0
0
0
65
Louis Arge
Louis Arge@louisvarge·
i made a thing where now any Claude Code can send messages to any other Claude Code on my machine they can ask clarifying questions about work, or become friends
English
242
222
3.9K
622.8K
max!
max!@maxsloef·
the HF Llama 3 tokenizer was silently stripping spaces before punctuation on decode. every hf llama 3 model, every fine-tune, every descendant. trillions of tokens. i’m so grateful this stuff is open source — but man, the fact that only a handful of people have ever noticed this really makes you wonder how many other subtle but important bugs are just silently lurking, across all these trillions of tokens
English
8
4
59
1.5K
max!
max!@maxsloef·
max! tweet media
ZXX
7
0
111
6.5K
Clément Dumas
Clément Dumas@Butanium_·
credit to my friend nataliia for the highlights!
English
0
0
0
46
Clément Dumas
Clément Dumas@Butanium_·
I asked Claude Code to "nuke" a task on my cluster, then sent "boom" >100 times in the chat. Opus 4.6 built an very flore including a NeurIPS best paper award, a @ESYudkowsky tweet thread, an EU Boom Act, half of Anthropic reacting, and much more. 🧵 (1/10)
Clément Dumas tweet media
English
2
1
44
4.4K
Clément Dumas
Clément Dumas@Butanium_·
One last thing in case it's not obvious, the original transcript didn't include renders of the tweets in html, just md render, e.g. "*@sama:*\n\n"we're excited [...]"\n\n*community note: \"it cannot\"*" See the full transcript here: github.com/Butanium/boom-…
English
1
0
4
225