Michael Chen

444 posts

Michael Chen banner
Michael Chen

Michael Chen

@miclchen

AI policy @METR_Evals, @aigioxford, prev @stripe Tweets don't represent my org

Berkeley Katılım Kasım 2012
736 Takip Edilen900 Takipçiler
Andrew Tran
Andrew Tran@Andrxwtran·
@miclchen hmm, i’m inspired to look into this. Will follow up! 🤩
English
1
0
1
9
Michael Chen
Michael Chen@miclchen·
is there an accessible AI agent that lets me use my computer with my eyes fully closed?
English
4
0
9
1K
Andrew Tran
Andrew Tran@Andrxwtran·
@miclchen At what level of autonomy? (e.g. a specific site you choose or full end to end web surfing)
English
1
0
0
25
Michael Chen
Michael Chen@miclchen·
@ZiChengCaoHuang nsaphra.net/post/hands/ yeah? This is from 2019 but I feel like there should be something higher level and more AI-powered, like Claude Cowork feels like it should be so close to the type of thing that could be totally hands-free and eyes-closed.
English
1
0
1
25
Future Adam Curtis B-Roll
Future Adam Curtis B-Roll@adamcurtisbroll·
A malfunctioning service robot dances uncontrollably at a Haidilao hotpot restaurant in San Jose, California, knocking over tableware as staff members attempt to restrain it, March 2026.
English
183
726
8.3K
1.9M
Michael Chen
Michael Chen@miclchen·
@ChrisPainterYup Happy birthday! Wow you're like the same age as my brother (give or take a few days)
English
0
0
0
31
Chris Painter
Chris Painter@ChrisPainterYup·
Today’s my 30th birthday! 🥳
English
28
0
167
5K
Arun Demeure
Arun Demeure@ArunDemeure·
In these troubling times, I am grateful to be a p-zombie.
English
2
0
1
83
Michael Chen retweetledi
METR
METR@METR_Evals·
After Anthropic published their Sabotage Risk Report for Claude Opus 4.6, they shared an unredacted version with METR for us to review. We agree with their conclusion that catastrophic sabotage risk from Opus 4.6 is very low but not negligible, in light of the available evidence.
Anthropic@AnthropicAI

When we released Claude Opus 4.5, we knew future models would be close to our AI Safety Level 4 threshold for autonomous AI R&D. We therefore committed to writing sabotage risk reports for future frontier models. Today we’re delivering on that commitment for Claude Opus 4.6.

English
5
26
389
45.2K
Michael Chen retweetledi
Alexander Long
Alexander Long@AlexanderLong·
insane sequence of statements buried in an Alibaba tech report
Alexander Long tweet media
English
230
946
6.9K
2.8M
Usman Anwar
Usman Anwar@usmananwar391·
@miclchen This reads like narrative pushing and wrong to me. Scaling up things needed algorithmic improvement.. e.g. mu-p / hyperparameter transfer kind of stuff, same for RL scaling.
English
1
0
1
24
Michael Chen
Michael Chen@miclchen·
Claude has basically automated the art of writing Wikipedia articles. Turns a multi-hour project into a 10-minute task. Just ask it to: - Output wikitext that you can copy-paste into Wikipedia - Follow Wikipedia's manual of style, policies on verifiability and notability, etc. - Double-check that all claims are directly supported by sources - Avoid close paraphrasing of sources - Check that all wikilinks are live - Avoid the stylistic features of "Wikipedia:Signs of AI writing" Check the output, ask Claude for revisions, and disclose LLM usage in your edit summary, and that's it!
English
0
2
13
1K
Michael Chen
Michael Chen@miclchen·
ok I'm turning this pronouns field back off, my fault for not previewing
Michael Chen tweet mediaMichael Chen tweet media
English
0
0
5
1.1K
Michael Chen
Michael Chen@miclchen·
(this visual refresh was part of METR's site redesign from earlier this month)
English
0
0
2
171
Michael Chen
Michael Chen@miclchen·
metr.org/fsp has a fresh new layout and includes links to past versions of safety policies too!
Michael Chen tweet media
English
1
1
43
2.2K
Michael Chen
Michael Chen@miclchen·
@MichaelTontchev If Chinese AI models were "catching up" *independent* of US AI model capabilities, that's a very different dynamic than if they're just distilling
English
0
0
0
17
Michael Tontchev
Michael Tontchev@MichaelTontchev·
@miclchen Why is this exaggeration? Capabilities are what they are - just the method they got them is different, no?
English
1
0
0
21
Michael Chen
Michael Chen@miclchen·
the reports of the US–China gap in AI capabilities closing were an exaggeration
Michael Chen tweet media
English
2
3
16
1.1K
Michael Chen
Michael Chen@miclchen·
Overall, I think the following elements are useful for a frontier safety policy: (1) Pragmatic and ambitious safety goals (2) Candid transparency about risk, including from internal models (3) Advocating for universal AI safety regulation that mitigates risk acceptably
English
1
0
9
507
Michael Chen
Michael Chen@miclchen·
Here’s what’s new in Anthropic’s Responsible Scaling Policy, version 3: - No more implication of unilateral commitment to pause AI development and deployment in relevant conditions - Public roadmaps of safety and security goals - Risk reports and third-party review - Advocacy of industry-wide safety standards based on safety cases
Michael Chen tweet media
English
4
10
98
16.3K