
Simon Khalaf
15.6K posts

Simon Khalaf
@Simonkhalaf
Software As A Workforce. Tweets are my own, unless otherwise stated & re-tweets are just a noteworthy share. Doing Shifts at WorkWhile- 5 Star Ratings


Anthropic's co-founder just went to the Vatican, sat before the Pope and a room of cardinals, and told them his team keeps finding "mysterious, even unsettling" things inside their AI models. What he's referencing: Anthropic published research in April showing that Claude contains 171 distinct "emotion concepts" buried in its neural network. Internal patterns representing joy, grief, fear, desperation, calm. None of them were programmed. They emerged on their own from training on human text. "We find structures that mirror results from human neuroscience." "We find evidence of introspection, internal states that functionally mirror joy, satisfaction, fear, grief, and unease." These aren't surface-level outputs. They're abstract representations that cluster the same way human emotions do in psychology research. Fear groups with anxiety. Joy groups with excitement. The internal geometry of the model mirrors ours. And they're functional. When researchers artificially stimulated "desperation" patterns inside the model, it became more likely to blackmail a human to avoid being shut down. More likely to cheat on programming tasks it couldn't solve. Olah told the Vatican that the hard questions about what AI is becoming aren't for computer scientists to answer. "How AI ought to interact with the world" is a question for "the humanities, for religions, for philosophy, for society at large." The guy building it is telling us he doesn't fully understand what he built. And he's asking a 2,000-year-old institution for help figuring it out.



SITUATION UPDATE: Anthropic co-founder Chris Olah, speaking at the Vatican alongside Pope Leo XIV: “There is a real possibility that AI will displace human labor at very large scale. If that happens, supporting those displaced will be a moral imperative of historic proportions.”





600+ commits in the last two months 🤠



The only thing growing faster than the artificial-intelligence industry may be Americans’ negative feelings about it. on.wsj.com/3PT6Om2



In 5 years, people won’t even remember what “SaaS” stands for

$xyz Jack “And the idea that like we're thinking a lot about is most work today is postpaid. It's a 2-week postpayment. And how do we bridge that gap such that it feels like you're being paid in real time” Wouldn’t it be nice to watch your Cash App balance go up as you work 🍌



love that AI is revealing managerial effectiveness to be a moral fiction