Michael Chen

444 posts

Michael Chen

@miclchen

AI policy @METR_Evals, @aigioxford, prev @stripe Tweets don't represent my org

Berkeley Katılım Kasım 2012

736 Takip Edilen900 Takipçiler

Michael Chen retweetledi

Charles Foster@CFGeek·4d

Interactive logistic is live! Hope y’all like it

Joel Becker@joel_bkr

this chart bringing to life the inner-workings of time horizon is so cool. from my super-talented colleague @CFGeek.

English

5.2K

Michael Chen@miclchen·2d

@Andrxwtran exciting!

English

Andrew Tran@Andrxwtran·2d

@miclchen hmm, i’m inspired to look into this. Will follow up! 🤩

English

Michael Chen@miclchen·4d

is there an accessible AI agent that lets me use my computer with my eyes fully closed?

English

Michael Chen@miclchen·2d

@Andrxwtran however autonomous AI agents can be!

English

Andrew Tran@Andrxwtran·3d

@miclchen At what level of autonomy? (e.g. a specific site you choose or full end to end web surfing)

English

Michael Chen@miclchen·2d

@pannous apps.apple.com/us/app/ducktor… ooh sadly I do not have an iPhone

English

Pannous@pannous·3d

@miclchen ducktor on app store

English

Michael Chen@miclchen·3d

@ZiChengCaoHuang nsaphra.net/post/hands/ yeah? This is from 2019 but I feel like there should be something higher level and more AI-powered, like Claude Cowork feels like it should be so close to the type of thing that could be totally hands-free and eyes-closed.

English

Michael Chen@miclchen·4d

@adamcurtisbroll just unplug it!

English

922

Future Adam Curtis B-Roll@adamcurtisbroll·4d

A malfunctioning service robot dances uncontrollably at a Haidilao hotpot restaurant in San Jose, California, knocking over tableware as staff members attempt to restrain it, March 2026.

English

183

726

8.3K

1.9M

Michael Chen@miclchen·5d

@ChrisPainterYup Happy birthday! Wow you're like the same age as my brother (give or take a few days)

English

Chris Painter@ChrisPainterYup·6d

Today’s my 30th birthday! 🥳

English

167

Michael Chen@miclchen·5d

@ArunDemeure prove it!

English

Arun Demeure@ArunDemeure·28 Şub

In these troubling times, I am grateful to be a p-zombie.

English

Michael Chen retweetledi

METR@METR_Evals·12 Mar

After Anthropic published their Sabotage Risk Report for Claude Opus 4.6, they shared an unredacted version with METR for us to review. We agree with their conclusion that catastrophic sabotage risk from Opus 4.6 is very low but not negligible, in light of the available evidence.

Anthropic@AnthropicAI

When we released Claude Opus 4.5, we knew future models would be close to our AI Safety Level 4 threshold for autonomous AI R&D. We therefore committed to writing sabotage risk reports for future frontier models. Today we’re delivering on that commitment for Claude Opus 4.6.

English

389

45.2K

Michael Chen retweetledi

Alexander Long@AlexanderLong·6 Mar

insane sequence of statements buried in an Alibaba tech report

English

230

946

6.9K

2.8M

Michael Chen@miclchen·1 Mar

@usmananwar391 this is a good flag, thanks!

English

Usman Anwar@usmananwar391·28 Şub

@miclchen This reads like narrative pushing and wrong to me. Scaling up things needed algorithmic improvement.. e.g. mu-p / hyperparameter transfer kind of stuff, same for RL scaling.

English

Michael Chen@miclchen·26 Şub

ZXX

856

Michael Chen@miclchen·27 Şub

Claude has basically automated the art of writing Wikipedia articles. Turns a multi-hour project into a 10-minute task. Just ask it to: - Output wikitext that you can copy-paste into Wikipedia - Follow Wikipedia's manual of style, policies on verifiability and notability, etc. - Double-check that all claims are directly supported by sources - Avoid close paraphrasing of sources - Check that all wikilinks are live - Avoid the stylistic features of "Wikipedia:Signs of AI writing" Check the output, ask Claude for revisions, and disclose LLM usage in your edit summary, and that's it!

English

Michael Chen@miclchen·26 Şub

ok I'm turning this pronouns field back off, my fault for not previewing

English

1.1K

Michael Chen@miclchen·25 Şub

(this visual refresh was part of METR's site redesign from earlier this month)

English

171

Michael Chen@miclchen·25 Şub

metr.org/fsp has a fresh new layout and includes links to past versions of safety policies too!

English

2.2K

Michael Chen@miclchen·24 Şub

@MichaelTontchev If Chinese AI models were "catching up" *independent* of US AI model capabilities, that's a very different dynamic than if they're just distilling

English

Michael Tontchev@MichaelTontchev·24 Şub

@miclchen Why is this exaggeration? Capabilities are what they are - just the method they got them is different, no?

English

Michael Chen@miclchen·24 Şub

the reports of the US–China gap in AI capabilities closing were an exaggeration

English

1.1K

Michael Chen@miclchen·24 Şub

The update can be read at anthropic.com/responsible-sc…

English

527

Michael Chen@miclchen·24 Şub

Overall, I think the following elements are useful for a frontier safety policy: (1) Pragmatic and ambitious safety goals (2) Candid transparency about risk, including from internal models (3) Advocating for universal AI safety regulation that mitigates risk acceptably

English

507

Michael Chen@miclchen·24 Şub

Here’s what’s new in Anthropic’s Responsible Scaling Policy, version 3: - No more implication of unilateral commitment to pause AI development and deployment in relevant conditions - Public roadmaps of safety and security goals - Risk reports and third-party review - Advocacy of industry-wide safety standards based on safety cases

English

16.3K

Keşfet

@Andrxwtran @pannous @ZiChengCaoHuang @adamcurtisbroll @ChrisPainterYup @ArunDemeure @usmananwar391 @elonmusk