Chris Olah

5.5K posts

Chris Olah

@ch402

Reverse engineering neural networks at @AnthropicAI. Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

San Francisco, CA Katılım Haziran 2010

183 Takip Edilen152K Takipçiler

Chris Olah retweetledi

Claude@claudeai·5d

There’s hope in hard questions.

English

960

15.4K

6.3M

Chris Olah retweetledi

Anthropic@AnthropicAI·19h

We’re committing $10 million CAD and partnering with leading AI institutions in Canada to help fund new AI research. anthropic.com/news/canadian-…

English

198

211

2.3K

375.4K

Chris Olah retweetledi

Matthew Botvinick@mattbotvinick·27 Haz

We're building a team at @Anthropic focusing on AI and the rule of law. We've made our first hires, and are now opening up a new research engineer role. We're looking for people with advanced technical skills, including AI/deep learning/NLP, full-stack development and data science, paired with training or experience in law, government, political science, or a related field. If this is you, or a friend, please get in touch. job-boards.greenhouse.io/anthropic/jobs…

English

109

131

1.2K

338.6K

Chris Olah@ch402·10 Haz

@geoffreyirving What an exciting combination of people! My mind is kind of blown by you and Daniel working together (with your colleagues). Looking forward to seeing what you accomplish!

English

4.4K

Geoffrey Irving@geoffreyirving·10 Haz

We are starting a new, nonprofit alignment organization, ⊢ Sequent Research, bringing together researchers previously on UK AISI’s Alignment Team, Timaeus, and elsewhere to research how to align superintelligence. We are hiring! 🧵

English

149

996

237.7K

Chris Olah retweetledi

Anthropic@AnthropicAI·4 Haz

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…

English

1.8K

4.6K

28.5K

18.7M

Chris Olah retweetledi

Matthew Botvinick@mattbotvinick·1 Haz

Anthropic now has a team dedicated to AI and the rule of law — and we've just opened our first role. @AnthropicAI has studied what AI means for the economy. This team asks a different question: what will it mean for executive power, for courts and elections — and for the public deliberation that constitutional democracy ultimately rests on? We're looking for someone with real depth in both AI and the law — a legal scholar, political scientist, or experienced government hand who can reason about frontier systems and the institutions they will affect. If that's you, or someone you know: job-boards.greenhouse.io/anthropic/jobs…

English

117

163.7K

Chris Olah@ch402·25 May

x.com/i/article/2058…

ZXX

161

555

2.9K

284K

Chris Olah@ch402·18 May

The questions posed by AI are bigger than the AI community. We urgently need the world – religions, civil society, academics, governments – to participate in creating a positive outcome. I'm glad the Catholic Church is engaging, and honored to speak at the presentation.

Vatican News@VaticanNews

Pope Leo XIV’s first encyclical, Magnifica humanitas, on preserving the human person in the age of artificial intelligence, will be released on May 25. A presentation event with the Pope and various speakers is scheduled for the same day at the Vatican. vaticannews.va/en/pope/news/2…

English

167

1.3K

132.7K

Chris Olah retweetledi

Anthropic@AnthropicAI·11 May

Claude's Constitution is now an audiobook, read by two of its authors, Amanda Askell and Joe Carlsmith. It includes a Q&A on the writing process, the philosophies that shaped the document, and how it might change as models become more capable. Listen at anthropic.com/constitution

English

432

373

3.1K

479.4K

Chris Olah@ch402·18 Nis

@wyqtor @TheZvi @repligate I have a lot of respect for Janus. I think it is good there are independent people who really care about Claude and others, and I'm glad she's one such person.

English

750

wyqtor@wyqtor·18 Nis

@ch402 @TheZvi Thank you so much for engaging with us, it means a lot! 🙏 Please make sure to also read @repligate's concerns; forgive his fiery temper, he means well!

English

781

Zvi Mowshowitz@TheZvi·17 Nis

I can add 1+1+1 and the answer appears to be 'training Claude Opus 4.7 to give positive answers on self-reports.'

English

172

28.9K

Chris Olah retweetledi

Dario Amodei@DarioAmodei·7 Nis

I’m proud that so many of the world’s leading companies have joined us for Project Glasswing to confront the cyber threat posed by increasingly capable AI systems head-on. x.com/AnthropicAI/st…

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

653

673

12.4K

1.1M

Chris Olah retweetledi

Anthropic@AnthropicAI·7 Nis

English

6.6K

43.9K

31.6M

Chris Olah retweetledi

Anthropic@AnthropicAI·2 Nis

New Anthropic research: Emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? We found internal representations of emotion concepts that can drive Claude’s behavior, sometimes in surprising ways.

English

2.7K

17.7K

3.9M

Chris Olah retweetledi

Charlie Camosy@CCamosy·14 Mar

Very proud of this amicus brief filed yesterday in the @AnthropicAI case against the Department of War from Catholic moral theologians and ethicists. The very notion of what it means to have a just war is at stake in how we respond to these matters. courtlistener.com/docket/7237965…

English

18K

Chris Olah retweetledi

Anthropic@AnthropicAI·11 Mar

Introducing The Anthropic Institute, a new effort to advance the public conversation about powerful AI. anthropic.com/news/the-anthr…

English

490

703

1.9M

Chris Olah retweetledi

Caitlin Kalinowski@kalinowski007·7 Mar

I resigned from OpenAI. I care deeply about the Robotics team and the work we built together. This wasn’t an easy call. AI has an important role in national security. But surveillance of Americans without judicial oversight and lethal autonomy without human authorization are lines that deserved more deliberation than they got. This was about principle, not people. I have deep respect for Sam and the team, and I’m proud of what we built together.

English

1.9K

12.6K

58K

7.7M

Chris Olah retweetledi

(((E. Glen Weyl/衛谷倫))) ⿻ 🇺🇸/🇩🇪/🇹🇼@glenweyl·5 Mar

I am humbled to be among the courageous leaders from Abrahamic religious traditions who put out this important statement about the @DeptofWar-@AnthropicAI dispute: faithfamilytech.org/moral-guardrai…. A short summary of the substance:

English

9.7K

Chris Olah retweetledi

Anthropic@AnthropicAI·6 Mar

We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025.

English

470

1.4K

14.9K

3.2M

Keşfet

@Anthropic @geoffreyirving @AnthropicAI @wyqtor @TheZvi @repligate @DeptofWar @elonmusk