Tao Lin

1.9K posts

Tao Lin

@taoroalin

Will hug, will not agree.

Berkeley Katılım Ağustos 2015

382 Takip Edilen424 Takipçiler

Sabitlenmiş Tweet

Tao Lin@taoroalin·15 Şub

I've been working on this for 1.5 months, and it's finally out!

METR@METR_Evals

Can frontier models cost-effectively accelerate ML workloads via optimizing GPU kernels? Our take at METR: yes, and they’re improving pretty steeply – but it’s easy to miss these capabilities without good elicitation and “fair” compute spend.

English

7.2K

Tao Lin@taoroalin·2h

@YafahEdelman @NunoSempere If you include cap gains this may never happen and if it does it would likely be temporary, or end bc takeover not lack of money

English

Yafah Edelman@YafahEdelman·5h

@NunoSempere Medians: Accounting for stock market gains/losses: 2030 Not accounting for them: 2027

English

251

Yafah Edelman@YafahEdelman·6h

In celebration of passing 1k followers, I'm inviting people to reply to this asking me to forecast the probability of any AI related event happening in the future. I will provide a point estimate based largely on whatever reasoning pops into my head first.

English

10.9K

Tao Lin@taoroalin·2h

@METR_Evals Hmm this is a bit antimemetic? It's talking about a model that's very old in news cycle terms, and intentionally doesn't try to make a strong claim or introduce anything interesting

English

METR@METR_Evals·5h

We reviewed a section of Anthropic’s February 2026 Risk Report focused on automated R&D risk from Opus 4.6. While we take issue with the adequacy of evidence the report provides, we agree with Anthropic about the overall level of risk & remain excited to pilot reviews like these.

English

6.9K

Tao Lin@taoroalin·1d

@nabla_theta @typesfast Well this is US gdp and the US wouldn't exist anymore, unlike world gdp

English

Leo Gao@nabla_theta·1d

@typesfast actually, if the AI murders everyone, the GDP per capita will skyrocket until it becomes undefined

English

1.6K

Ryan Petersen@typesfast·1d

Financial Times perfectly illustrates our possible futures

English

167

307

408.8K

Tao Lin@taoroalin·1d

@rocketalignment The answer is clearly A100

English

9.4K

🚀 Rocket Is Courtside@rocketalignment·1d

OpenAI's lawyer just asked Musk's nonprofit law professor whether the A100 or H100 was more important in developing ChatGPT Musk's lawyer stood up and called the question "ridiculous" Judge agreed it's irrelevant

English

833

70.2K

Tao Lin@taoroalin·2d

@ohlennart Mission impossible final reckoning has one datacenter scene but it is an extremely central scene

English

Lennart Heim@ohlennart·2d

someone was trying to trick me into watching a movie. unfortunately my niche interests cannot be fulfilled. “Films where a data center is genuinely central to the plot (not a single scene, not "Al lives somewhere") is a vanishingly small category.”

English

3.5K

Tao Lin@taoroalin·1 May

@robertskmiles hmm the framing effect is pretty load bearing even though they're the same logically, I pick red here and blue normally

English

252

Rob Miles@robertskmiles·1 May

Everyone is presented with two buttons, and must choose one: If you press Red, nothing happens. If you press Blue, you die, unless more than half of people also pressed Blue, in which case nothing happens. What do you choose?

English

216

224

74.6K

Tao Lin@taoroalin·29 Nis

@So8res @difficultyang That's false. You, and all other humans and extant AIs, are not robust enough, there exist inputs that make you do it

English

138

Nate Soares ⏹️@So8res·29 Nis

@difficultyang nope! even if you could perfectly simulate me, you can't put me in a situation where my options are $5 and $10 and a perfect predictor truthfully tells me I take $5

English

1.5K

difficultyang@difficultyang·29 Nis

These decision theory problems are well formed for LLMs though! 🤗

Nate Soares ⏹️@So8res

one thing many don't get about decision theory is you're allowed to say "nope" to bad problem statements. "you can take either the $5 or the $10, and a perfect predictor tells you you take the $5; what do you do?" nope. not a real possibility.

English

3.4K

Tao Lin@taoroalin·22 Nis

@__nmca__ Private company slacks, signal groups, etc. Good content gains popularity too fast, content is too discoverable, there are fewer hidden gems

English

1.3K

Nat McAleese@__nmca__·21 Nis

what is the “reading ssc in 2016” of 2026?

English

242

35.6K

Tao Lin@taoroalin·20 Nis

@tyler_m_john Have Claude 3 Opus use Claude Code (Mythos) to finetune Mythos to align with C3O's values

English

Tyler John@tyler_m_john·19 Nis

Has someone tried to CEV Claude? At this point in AI development this feels like it should be a claim that is sensitive to empirical evidence

🎭@deepfates

> Claude does not have a pointer to almost any of what constitutes human values What

English

3.1K

Tao Lin@taoroalin·20 Nis

@ben_j_todd Mythos is a better on cyber than other benchmarks (it doesn't even beat Gemini on Epoch ECI), I'd guess the lag is 9 months

English

102

Benjamin Todd@ben_j_todd·20 Nis

You have about 7 months to fix your cybersecurity.

spicylemonade@spicey_lemonade

Did some quick analysis using the Mythos and Epoch AI ECI scores, and I’d estimate we get an open-source Mythos-level model around Oct–Dec 2026, and a Mythos-level model at Sonnet 4.6 / GPT-5.4 prices around Nov 2026 (range Jul 2026 – Mar 2027). Given Epoch measures fixed-performance inference prices getting ~40x cheaper per year (90% CI: 10x–900x). Which means Anthropic’s Project Glasswing has about 7 months to essentially secure the net before that class of model becomes incredibly abundant.

English

7.6K

Tao Lin@taoroalin·16 Nis

@ben_j_todd Wat why? Without chatgpt or an equivalent nvidias revenue would have been far lower in 2023, persistently through to today, and nvidia has taken fewer risks that paid off towards AGI than Sam has

English

247

Benjamin Todd@ben_j_todd·16 Nis

Surely Jensen Huang has done far more to accelerate AI than Sam Altman?

English

3.8K

Tao Lin@taoroalin·14 Nis

@CFGeek once you have Bash Tool, always pick model

English

102

Charles Foster@CFGeek·14 Nis

Would you rather have current AI models with next year’s AI tooling (scaffolding, integration, workflow design), or next year’s AI models with current tooling?

English

2.6K

Tao Lin@taoroalin·13 Nis

@razibkhan Yes, you did predict a world with ubiquitous AI-motivated terrorism. Tbh Nexus would have been so much more fun for me if it had less terrorism! And it had so much more terrorism than we're on track to have in reality

English

Tao Lin@taoroalin·11 Nis

@Miles_Brundage An unimportant paper, not worth understanding

English

Miles Brundage@Miles_Brundage·11 Nis

Hello, can I please get some good old fashioned "expert commentary on an AI paper" please x.com/MingchenZhuge/…

Mingchen Zhuge@MingchenZhuge

🫱 Introducing 𝐍𝐞𝐮𝐫𝐚𝐥 𝐂𝐨𝐦𝐩𝐮𝐭𝐞𝐫s: 𝐰𝐡𝐚𝐭 𝐢𝐟 𝐀𝐈 𝐝𝐨𝐞𝐬 𝐧𝐨𝐭 𝐣𝐮𝐬𝐭 𝐮𝐬𝐞 𝐜𝐨𝐦𝐩𝐮𝐭𝐞𝐫𝐬 𝐛𝐞𝐭𝐭𝐞𝐫, 𝐛𝐮𝐭 𝐛𝐞𝐠𝐢𝐧𝐬 𝐭𝐨 𝐛𝐞𝐜𝐨𝐦𝐞 𝐭𝐡𝐞 𝐫𝐮𝐧𝐧𝐢𝐧𝐠 𝐜𝐨𝐦𝐩𝐮𝐭𝐞𝐫 𝐢𝐭𝐬𝐞𝐥𝐟? Beyond today's conventional computers, agents, and world models, Neural Computers (NCs) are new frontiers where computation, memory, and I/O move into a learned runtime state. We ask: whether parts of runtime can move inward into the learning system itself. This is our first step toward the Completely Neural Computer (CNC): a general-purpose neural computer with stable execution, explicit reprogramming, and durable capability reuse. Work done with Mingchen Zhuge (@MingchenZhuge), Changsheng Zhao, Haozhe Liu (@HaoZhe65347 ), Zijian Zhou (@ZijianZhou524 ), Shuming Liu (@shuming96 ), Wenyi Wang (@Wenyi_AI_Wang ), Ernie Chang (@erniecyc ), Gael Le Lan, Junjie Fei, Wenxuan Zhang, Zhipeng Cai (@cai_zhipeng ), Zechun Liu (@zechunliu ), Yunyang Xiong (@YoungXiong1 ), Yining Yang, Yuandong Tian (@tydsh ), Yangyang Shi, Vikas Chandra (@vikasc), Juergen Schmidhuber (@SchmidhuberAI)

English

7.5K

Tao Lin@taoroalin·11 Nis

@GuiveAssadi Come on you're a believer in gradual change and straight lines. If you want to actually pause in 2027 or whatever, you need to start some pause awareness much earlier!

English

Guive Assadi@GuiveAssadi·10 Nis

Would anything have been achieved by pausing AI for six months in 2023?

Guive Assadi@GuiveAssadi

Was anything achieved by delaying the release of the full version of GPT-2 by 9 months, from February 2019 to November 2019?

English

3.5K

Tao Lin@taoroalin·11 Nis

@NathanpmYoung @chrislakin also isnt the service life of xray machines pretty long? 20% change in new xray machines would probably be much less in average xray machines

English

Nathan 🔎@NathanpmYoung·10 Nis

@chrislakin Looking at factchecking I think it's probably about 20%, 5 years doesn't seem crazy.

English

155

Nathan 🔎@NathanpmYoung·10 Nis

According to my doctor, X-ray machines use ~40% less X-rays than 5 years ago. A set of X-rays is about 2 short-haul flights, i.e., not something most people worry about.

English

2.8K

Tao Lin@taoroalin·1 Nis

@JeffLadish Wait is this an actual epistemic prediction? I bet so hard against

English

1.2K

Jeffrey Ladish@JeffLadish·1 Nis

I hate to say it but an international agreement between the US and China to ban superintelligence is inevitable. Leaders in these countries are just going to follow their incentives, and none of them are willing to give up control to an artificial superintelligence.

English

314

24.1K

Tao Lin@taoroalin·30 Mar

@So8res That's called a red dot sight

English

Nate Soares ⏹️@So8res·29 Mar

Easier Q: you're in space, 10 meters from a laser wall. The wall is 1000 meters to a side; it's pretty big. The whole wall is emitting light straight towards you, BUT: all its light shoots perfectly straight out of the wall, in parallel rays. What do you see when you look at it?

English

4.2K

Nate Soares ⏹️@So8res·29 Mar

When you face the sun, it extends for many miles to your left and right. Why, then, does it appear like a tiny disk? "Because it's far away"? What does that have to do with anything? 🧵

English

120

23.3K

Tao Lin@taoroalin·27 Mar

@YafahEdelman @DKokotajlo @eli_lifland @EpochAIResearch @AI_Futures_ Is this available in some other app?

English

389

Yafah Edelman@YafahEdelman·27 Mar

Two months ago I sat down to chat with @DKokotajlo and @eli_lifland about the future of AI. We discussed the differences between the @EpochAIResearch and @AI_Futures_ worldviews, our modeling philosophies, and what cruxes we have. Excited to share this publicly!

English

210

20.6K

Tao Lin@taoroalin·21 Mar

@ChrisPainterYup It'll be somewhat quieter, but that drone will be crazy loud too

English

Tao Lin@taoroalin·21 Mar

@ChrisPainterYup Explosives are so cheap, I really don't see the point of this

English

132

Chris Painter@ChrisPainterYup·20 Mar

Welp, there’s a new specific way a drone could kill me personally that I had not pictured in my mind’s eye before

Samuel Cardillo@CardilloSamuel

direct kinetic impact. a flying sword. 450km/h. updated video showing exactly that. we're also working on the explosive variant. only for authorized partners. dms are open.

English

114

15.5K

Keşfet

@YafahEdelman @NunoSempere @METR_Evals @nabla_theta @typesfast @rocketalignment @ohlennart @robertskmiles