Chris McCoy

1.9K posts

Chris McCoy

@TheRealMcCoy

Inventing @thestorecloud (☁️ AWS for Democracy). Thinking at Data4America (🇺🇸 Policy). Formerly @chrisamccoy. Just trying to make a dent. Optimist.

가입일 Temmuz 2009

1.1K 팔로잉348 팔로워

고정된 트윗

Chris McCoy@TheRealMcCoy·1 May

I'm back.

English

4.5K

Chris McCoy 리트윗함

Podcast Notes 🗒️@podcastnotes·6h

A Navy SEAL reads it. A Fortune 500 CEO reads it. A chess grandmaster reads it. None of them play tennis. The Inner Game of Tennis by W. Timothy Gallwey is the best performance psychology book ever written. 10 quotes that prove it:

English

573

65.8K

Chris McCoy@TheRealMcCoy·11h

He's right but it's the wrong way to think.

Watcher.Guru@WatcherGuru

JUST IN: Elon Musk says universal high income from the Federal government "is the best way to deal with unemployment caused by AI." "AI/robotics will produce goods & services far in excess of the increase in the money supply, so there will not be inflation."

English

Chris McCoy@TheRealMcCoy·15h

@sriramk @kevinakwok We need China's AI infrastructure to become architecturally dependent on US second and third generation chips by 2028/2029

English

Sriram Krishnan@sriramk·16h

@kevinakwok Used the same phrase today multiple times.

Sriram Krishnan@sriramk

Every person here's reaction to the Jensen + @dwarkesh_sp podcast can be extrapolated *directly* from whether they believe in the frontier labs achieving short timelines for AGI/ASI. If you believe in the labs achieving RSI and then AGI/ASI (for some definition of all three) in the next few years, you'll probably sympathetic to the frame @dwarkesh_sp adopts. If not, you're probably more sympathetic to the arguments from Jensen.

English

7.6K

Kevin Kwok@kevinakwok·17h

Jensen Dwarkesh podcast was a true scissor statement Haven't watched it yet but so funny how everyone I know agrees one of them was so good and the other so bad. Just can't agree on who

English

9.2K

Chris McCoy@TheRealMcCoy·16h

@beffjezos We need China's AI infrastructure to become architecturally dependent on US second and third generation chips by 2028/2029

English

101

Beff (e/acc)@beffjezos·1d

Jensen is talking his book (securing the bag for GPU sales to China). Dwarkesh is talking his roommate's book (Anthropic MTS that is tired of Chinese model distillation) The outcome is a bit painful to watch

Dwarkesh Patel@dwarkesh_sp

The Jensen Huang episode. 0:00:00 – Is Nvidia’s biggest moat its grip on scarce supply chains? 0:16:25 – Will TPUs break Nvidia’s hold on AI compute? 0:41:06 – Why doesn’t Nvidia become a hyperscaler? 0:57:36 – Should we be selling AI chips to China? 1:35:06 – Why doesn’t Nvidia make multiple different chip architectures? Look up Dwarkesh Podcast on YouTube, Apple Podcasts, Spotify, etc. Enjoy!

English

478

182.6K

Chris McCoy@TheRealMcCoy·17h

@GavinSBaker We need China's AI infrastructure to become architecturally dependent on US second and third generation chips by 2028/2029

English

Gavin Baker@GavinSBaker·20h

More thoughts on the Dwarkesh/Jensen discussion around export controls. Strongly believe that selling specific GPUs to China is in our national security interest and is a good policy for America. I think it is super important for us a country to get this right.

Gavin Baker@GavinSBaker

Much of Dwarkesh's argument hinges on this statment which *was* accurate but will be increasingly inaccurate on a go forward basis imo: “American labs port across accelerators constantly. Anthropic's models are run on GPUs, they're run on Trainium, they're run on TPUs. There are so many things you can do, from distilling to a model that's well fit for your chips.” As system level architectures diverge (torus vs. switched scale-up topologies, memory hierarchies, networking primitives), true portability is eroding. The Mi300 and Mi325 had roughly the same scale-up domain size as Hopper while Blackwell’s scale-up domain is 9x larger than the Mi355 scale-up domain, etc. Many frontier models are now being explicitly co-designed for inference on specific hardware like GB300 racks. Codex on Cerebras is another example. Those models run less efficiently on other systems and the performance differentials will only widen. A model that runs well on Google’s torus topology will run less efficiently on Nvidia’s switched scale-up topology and vice versa - the data traffic is fundamentally different as a byproduct of the models being parallelized across the different topologies. Google’s internal teams - and increasingly the Anthropic teams as they become the most important customer of almost every cloud - have the luxury of operating across the stack (models, chips, networking) - but that is not the case for the rest of the market and other prospective users. Anthropic is the exception, not the rule. To wit, Anthropic and Google allegedly have a mutual understanding where Anthropic can hire the TPU engineers they need every year to ensure that they can continue to get the most out of the TPU. Given the overwhelming importance of cost per token to the economics of the labs, models will be run where they run best. Most extremely large MoE models will run best on GB300s given the importance of having a switched scale-up network like NVLink for MoE inference. When training was the dominant cost for labs and power was broadly available, labs were optimizing to minimize capex dollars. Model portability was a way to create leverage over suppliers. I think that drove a lot of the focus on portability. Today, inference costs as measured by tokens per watt per dollar are everything. Inference is way more important than training costs (inference is effectively now part of training via RL). Labs are therefore now optimizing for inference. This means increasing co-design and higher go-forward switching costs for individual models between systems. I do think this explains why Anthropic and Nvidia came together: Anthropic needed Blackwells and Rubins to inference at least *some* of their models economically. And Mythos might just end up being released coincident with the availability of Rubins for inference. TLDR: as labs shift their focus from training to inference, the costs of portability and the upside of co-design to maximize tokens per watt per dollar both rise. Portability is likely to begin decreasing as a result. I think what I might have respectfully added to Jensen’s answer is that systems evolve under local selective pressures. The evolutionary pressure in America is a shortage of watts so it makes sense for Nvidia to optimize, as an American company, for power efficiency and tokens per watt and stay on copper as long as possible. China has a surfeit of watts. Chinese AI systems are already taking advantage of this with the Huawei Cloudmatrix 384 and Atlas SuperPoD having an optical scale-up domain that is much larger than anything offered by Nvidia today at the cost of *much* higher power consumption and much lower tokens per watt. The networking primitives for this Huawei system are very different than those for Nvidia’s systems and a model that runs well on Nvidia will not run well on that system and vice versa. This means that if a Chinese ecosystem gets momentum, Chinese models might stop running well on American hardware. And when Chinese models run best on American hardware, America is in a better position as this gives America a degree of leverage and control over Chinese AI that it risks losing to an all-Chinese alternative ecosystem. This architectural fork makes porting and distillation less effective and strengthens the pro-American national security case for selling China deprecated GPUs imo. Also I will attest that I did not wake up a loser this morning.

English

513

81.9K

Chris McCoy@TheRealMcCoy·17h

@AlecStapp @Noahpinion Sell them Intel's chips. Get em hooked by 2028-2029. Force a trade on their mineral supply in exchange for H100s. We all win.

English

104

Alec Stapp@AlecStapp·19h

Letting NVIDIA sell H200 chips to China is even worse that it looks on first glance. Given inelastic supply conditions, the critical inputs for producing H200s (such as high-bandwidth memory) could have been used to produce even more powerful chips for US customers. So our labs and hyperscalers lose out on even more compute than China gains.

Dwarkesh Patel@dwarkesh_sp

English

506

44.1K

Chris McCoy 리트윗함

Samuel Hammond 🦉@hamandcheese·23h

It is remarkable how long Jensen has gone without taking an even mildly adversarial interview.

Alex Imas@alexolegimas

Jensen has been doing what seems like a 24/7 interview cycle for months, and the number one question from the beginning should have been this exact exchange. I don't know if it's the decline of old media---where journalists are just not pushing and asking questions in the same "investigative" style that they used to---or something else. But I'm glad we have @dwarkesh_sp to do the research and shine the light.

English

283

42K

Chris McCoy@TheRealMcCoy·20h

@hamandcheese good stuff

English

113

Samuel Hammond 🦉@hamandcheese·22h

I'm on week 4 of retatrutide and already down 12.4 lbs. More interestingly: - I seem to have much more energy and focus - I find myself spontaneously preferring standing desks, walking rather than ubering, opting into exercise etc. - My blood sugar no longer crashes after eating - I don't drink nearly as much but when I do my hangovers seem a lot weaker - My food preferences spontaneously shifted in favor of fish, salads and fresh fruit - My GI health improved a lot, I think mostly thanks to it being easier to resist trigger foods I've had no negative symptoms or downsides, and am still on a low starter dose (~2.5mg)

English

463

42.3K

Chris McCoy 리트윗함

Semafor@semafor·1d

Token demand makes an AI bubble unlikely, says Michael Dell, CEO of Dell semafor.com/article/04/15/…

English

2.8K

Chris McCoy 리트윗함

Daniel@growing_daniel·2d

This dude is not going to be governor of California. This is too dumb

Tom Steyer@TomSteyer

x.com/i/article/2044…

English

131

1.6K

78.6K

Chris McCoy 리트윗함

RYAN SΞAN ADAMS - rsa.eth 🦄@RyanSAdams·1d

AI KYC is here. New claude subscribers asked for gov ID & photo. Not even a regulatory requirement - Anthropic just doing it because they want to. But regulatory is coming Next up will be laws: No AI without gov-issued ID All AI use tracked to individual - no private AI

English

215

194

1.1K

149.8K

Chris McCoy 리트윗함

Andrew Curran@AndrewCurran_·2d

The DNC has barred staffers from using Chat and Claude. The only approved DNC model is Gemini.

English

281

23.1K

Chris McCoy@TheRealMcCoy·1d

@sundeep Yes. And you can price it into 100mm micro-units.

English

sunny madra@sundeep·1d

Why Cost per Token Is the Only Metric That Matters blogs.nvidia.com/blog/lowest-to…

English

3.9K

Chris McCoy@TheRealMcCoy·1d

👀 that's a first cc: @cyantist

English

Chris McCoy 리트윗함

Neeraj K. Agrawal@NeerajKA·3d

It’s got to feel so bad to drop a satoshi exposé only to have no one believe or care about it

English

145

9.4K

Chris McCoy@TheRealMcCoy·2d

@Noahpinion Indeed. It can be governed by the same math of the US Constitution where innovation triumphs under a +2/3 human in the loop.

English

140

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·2d

It's a great point

Acyn@Acyn

Khanna: They’re saying this technology is going to be bigger than nuclear, bigger than electricity, bigger than aviation. Last I checked, we have an FAA for aviation, nuclear energy is regulated, and electricity is regulated. So you’re telling me on one hand that this is going to be bigger, and then you’re saying you don’t want any regulation. I mean, it makes no sense.

English

344

54.2K

Chris McCoy@TheRealMcCoy·2d

This is wrong.

Nick shirley@nickshirleyy

California is trying to pass a bill that would criminalize investigative journalism with misdemeanors, $10,000 fines, imprisonment, and content takedown. The proposed bill is titled AB 2624 and was made after I exposed mass fraud by immigrant groups in America. Under AB 2624, government-funded entities like the Somali “Learing” Daycare centers would be protected from being exposed if they operated inside California. The enemy truly is within. When our politicians would rather protect fraudsters and illegal migrants, it’s time for us to stand up or face mass oppression from the traitors who “rule” over us.

English