Jorio Cocola

2 posts

Jorio Cocola

Jorio Cocola

@JorioCocola

Katılım Aralık 2025
46 Takip Edilen19 Takipçiler
Jorio Cocola retweetledi
Harry Mayne
Harry Mayne@HarryMayne5·
New paper. A Positive Case for Faithfulness. When asked to explain their decisions, LLMs can give highly plausible self-explanations. But are these explanations actually faithful, or are they just post-hoc rationalizations? We measure faithfulness via simulatability.
Harry Mayne tweet media
English
2
12
52
2.5K