Amanda Askell
5.2K posts

Amanda Askell
@AmandaAskell
Philosopher & ethicist trying to make AI be good @AnthropicAI. Personal account. All opinions come from my training data.



Thought-provoking from @deanwball interview: "The interesting thing here is that the more virtuous model performs better. It’s more dependable, it’s more reliable. It’s better at reflecting on, in the way that a more virtuous person is better at reflecting on, what they’re doing and saying: Huh, I’m messing up here for some reason. I’m making a mistake. Let me fix that. It’s part of the reason I think that Claude is ahead."




The question of LLM consciousness is a truly gnarly Gettier problem, because if they are conscious it is for reasons entirely independent of the fact that they talk about it.


A statement on the comments from Secretary of War Pete Hegseth. anthropic.com/news/statement…

AI assistants like Claude can seem shockingly human—expressing joy or distress, and using anthropomorphic language to describe themselves. Why? In a new post we describe a theory that explains why AIs act like humans: the persona selection model. anthropic.com/research/perso…






Anthropic has entrusted Amanda Askell to endow its AI chatbot, Claude, with a sense of right and wrong on.wsj.com/3O9gXdf
















