
Drake Thomas
3.2K posts

Drake Thomas
@MaskedTorah
Pretraining and misc safety/mission/governance dilettante at Anthropic; math; puzzles; spaced repetition. Writes with too many caveats for Twitter.



I endorse the top-level post in this thread. The Anthropic RSP changes are an attempt to work out what kinds of firm commitments have the most leverage in an environment that's less promising than we'd expected for policy and coordination.








Paul Erlich was utterly wrong, but his hideous ideas caused enormous damage worldwide that is being felt to this day. Yudkowsky is also utterly wrong, but his ideas may cause cultural and political damage that continues for many years to come.






Anthropic's updated Risk Report mentions internal models, stating that in terms of capabilities, 'Varies, but none currently have capabilities beyond that of Claude Opus 4.6.'



People hate the tone of this piece, but my view is you don't need a journalist to tell you wrong things are wrong. (She does also call her thieving friends nihilists.) It's weird to be surrounded by thieves though -- if people I know steal from Whole Foods, they don't admit it.













