Jonathan Michala

3 posts

Jonathan Michala

Jonathan Michala

@JonathanMi98298

Katılım Ocak 2026
17 Takip Edilen12 Takipçiler
Jonathan Michala retweetledi
Callum Canavan
Callum Canavan@CalCanavan·
Recently Wen et al found an unsupervised elicitation technique with similar performance to fully supervised fine-tuning on small training sets. We wanted to see which aspects make it work, and found some simple methods that get comparable results. 🧵
Callum Canavan tweet media
English
2
5
30
3.9K
Jonathan Michala retweetledi
Anthropic
Anthropic@AnthropicAI·
New Anthropic Fellows research: the Assistant Axis. When you’re talking to a language model, you’re talking to a character the model is playing: the “Assistant.” Who exactly is this Assistant? And what happens when this persona wears off?
Anthropic tweet media
English
319
582
5.2K
1.3M