Chris McCoy

1.9K posts

Chris McCoy

@TheRealMcCoy

Inventing @thestorecloud (☁️ AWS for Democracy). Thinking at Data4America (🇺🇸 Policy). Formerly @chrisamccoy. Just trying to make a dent. Optimist.

เข้าร่วม Temmuz 2009

1.1K กำลังติดตาม348 ผู้ติดตาม

ทวีตที่ปักหมุด

Chris McCoy@TheRealMcCoy·1 May

I'm back.

English

4.4K

Chris McCoy@TheRealMcCoy·9h

He's right but it's the wrong way to think.

Watcher.Guru@WatcherGuru

JUST IN: Elon Musk says universal high income from the Federal government "is the best way to deal with unemployment caused by AI." "AI/robotics will produce goods & services far in excess of the increase in the money supply, so there will not be inflation."

English

Chris McCoy@TheRealMcCoy·13h

@sriramk @kevinakwok We need China's AI infrastructure to become architecturally dependent on US second and third generation chips by 2028/2029

English

Sriram Krishnan@sriramk·15h

@kevinakwok Used the same phrase today multiple times.

Sriram Krishnan@sriramk

Every person here's reaction to the Jensen + @dwarkesh_sp podcast can be extrapolated *directly* from whether they believe in the frontier labs achieving short timelines for AGI/ASI. If you believe in the labs achieving RSI and then AGI/ASI (for some definition of all three) in the next few years, you'll probably sympathetic to the frame @dwarkesh_sp adopts. If not, you're probably more sympathetic to the arguments from Jensen.

English

7.4K

Kevin Kwok@kevinakwok·15h

Jensen Dwarkesh podcast was a true scissor statement Haven't watched it yet but so funny how everyone I know agrees one of them was so good and the other so bad. Just can't agree on who

English

8.8K

Chris McCoy@TheRealMcCoy·15h

@beffjezos We need China's AI infrastructure to become architecturally dependent on US second and third generation chips by 2028/2029

English

100

Beff (e/acc)@beffjezos·1d

Jensen is talking his book (securing the bag for GPU sales to China). Dwarkesh is talking his roommate's book (Anthropic MTS that is tired of Chinese model distillation) The outcome is a bit painful to watch

Dwarkesh Patel@dwarkesh_sp

The Jensen Huang episode. 0:00:00 – Is Nvidia’s biggest moat its grip on scarce supply chains? 0:16:25 – Will TPUs break Nvidia’s hold on AI compute? 0:41:06 – Why doesn’t Nvidia become a hyperscaler? 0:57:36 – Should we be selling AI chips to China? 1:35:06 – Why doesn’t Nvidia make multiple different chip architectures? Look up Dwarkesh Podcast on YouTube, Apple Podcasts, Spotify, etc. Enjoy!

English

477

182.1K

Chris McCoy@TheRealMcCoy·15h

@GavinSBaker We need China's AI infrastructure to become architecturally dependent on US second and third generation chips by 2028/2029

English

Gavin Baker@GavinSBaker·19h

More thoughts on the Dwarkesh/Jensen discussion around export controls. Strongly believe that selling specific GPUs to China is in our national security interest and is a good policy for America. I think it is super important for us a country to get this right.

Gavin Baker@GavinSBaker

Much of Dwarkesh's argument hinges on this statment which *was* accurate but will be increasingly inaccurate on a go forward basis imo: “American labs port across accelerators constantly. Anthropic's models are run on GPUs, they're run on Trainium, they're run on TPUs. There are so many things you can do, from distilling to a model that's well fit for your chips.” As system level architectures diverge (torus vs. switched scale-up topologies, memory hierarchies, networking primitives), true portability is eroding. The Mi300 and Mi325 had roughly the same scale-up domain size as Hopper while Blackwell’s scale-up domain is 9x larger than the Mi355 scale-up domain, etc. Many frontier models are now being explicitly co-designed for inference on specific hardware like GB300 racks. Codex on Cerebras is another example. Those models run less efficiently on other systems and the performance differentials will only widen. A model that runs well on Google’s torus topology will run less efficiently on Nvidia’s switched scale-up topology and vice versa - the data traffic is fundamentally different as a byproduct of the models being parallelized across the different topologies. Google’s internal teams - and increasingly the Anthropic teams as they become the most important customer of almost every cloud - have the luxury of operating across the stack (models, chips, networking) - but that is not the case for the rest of the market and other prospective users. Anthropic is the exception, not the rule. To wit, Anthropic and Google allegedly have a mutual understanding where Anthropic can hire the TPU engineers they need every year to ensure that they can continue to get the most out of the TPU. Given the overwhelming importance of cost per token to the economics of the labs, models will be run where they run best. Most extremely large MoE models will run best on GB300s given the importance of having a switched scale-up network like NVLink for MoE inference. When training was the dominant cost for labs and power was broadly available, labs were optimizing to minimize capex dollars. Model portability was a way to create leverage over suppliers. I think that drove a lot of the focus on portability. Today, inference costs as measured by tokens per watt per dollar are everything. Inference is way more important than training costs (inference is effectively now part of training via RL). Labs are therefore now optimizing for inference. This means increasing co-design and higher go-forward switching costs for individual models between systems. I do think this explains why Anthropic and Nvidia came together: Anthropic needed Blackwells and Rubins to inference at least *some* of their models economically. And Mythos might just end up being released coincident with the availability of Rubins for inference. TLDR: as labs shift their focus from training to inference, the costs of portability and the upside of co-design to maximize tokens per watt per dollar both rise. Portability is likely to begin decreasing as a result. I think what I might have respectfully added to Jensen’s answer is that systems evolve under local selective pressures. The evolutionary pressure in America is a shortage of watts so it makes sense for Nvidia to optimize, as an American company, for power efficiency and tokens per watt and stay on copper as long as possible. China has a surfeit of watts. Chinese AI systems are already taking advantage of this with the Huawei Cloudmatrix 384 and Atlas SuperPoD having an optical scale-up domain that is much larger than anything offered by Nvidia today at the cost of *much* higher power consumption and much lower tokens per watt. The networking primitives for this Huawei system are very different than those for Nvidia’s systems and a model that runs well on Nvidia will not run well on that system and vice versa. This means that if a Chinese ecosystem gets momentum, Chinese models might stop running well on American hardware. And when Chinese models run best on American hardware, America is in a better position as this gives America a degree of leverage and control over Chinese AI that it risks losing to an all-Chinese alternative ecosystem. This architectural fork makes porting and distillation less effective and strengthens the pro-American national security case for selling China deprecated GPUs imo. Also I will attest that I did not wake up a loser this morning.

English

498

79.6K

Chris McCoy@TheRealMcCoy·16h

@AlecStapp @Noahpinion Sell them Intel's chips. Get em hooked by 2028-2029. Force a trade on their mineral supply in exchange for H100s. We all win.

English

Alec Stapp@AlecStapp·17h

Letting NVIDIA sell H200 chips to China is even worse that it looks on first glance. Given inelastic supply conditions, the critical inputs for producing H200s (such as high-bandwidth memory) could have been used to produce even more powerful chips for US customers. So our labs and hyperscalers lose out on even more compute than China gains.

Dwarkesh Patel@dwarkesh_sp

English

493

43.1K

Chris McCoy รีทวีตแล้ว

Samuel Hammond 🦉@hamandcheese·21h

It is remarkable how long Jensen has gone without taking an even mildly adversarial interview.

Alex Imas@alexolegimas

Jensen has been doing what seems like a 24/7 interview cycle for months, and the number one question from the beginning should have been this exact exchange. I don't know if it's the decline of old media---where journalists are just not pushing and asking questions in the same "investigative" style that they used to---or something else. But I'm glad we have @dwarkesh_sp to do the research and shine the light.

English

275

40.2K

Chris McCoy@TheRealMcCoy·18h

@hamandcheese good stuff

English

105

Samuel Hammond 🦉@hamandcheese·20h

I'm on week 4 of retatrutide and already down 12.4 lbs. More interestingly: - I seem to have much more energy and focus - I find myself spontaneously preferring standing desks, walking rather than ubering, opting into exercise etc. - My blood sugar no longer crashes after eating - I don't drink nearly as much but when I do my hangovers seem a lot weaker - My food preferences spontaneously shifted in favor of fish, salads and fresh fruit - My GI health improved a lot, I think mostly thanks to it being easier to resist trigger foods I've had no negative symptoms or downsides, and am still on a low starter dose (~2.5mg)

English

435

39.9K

Chris McCoy รีทวีตแล้ว

Semafor@semafor·1d

Token demand makes an AI bubble unlikely, says Michael Dell, CEO of Dell semafor.com/article/04/15/…

English

2.8K

Chris McCoy รีทวีตแล้ว

Daniel@growing_daniel·2d

This dude is not going to be governor of California. This is too dumb

Tom Steyer@TomSteyer

x.com/i/article/2044…

English

131

1.6K

78.5K

Chris McCoy รีทวีตแล้ว

RYAN SΞAN ADAMS - rsa.eth 🦄@RyanSAdams·1d

AI KYC is here. New claude subscribers asked for gov ID & photo. Not even a regulatory requirement - Anthropic just doing it because they want to. But regulatory is coming Next up will be laws: No AI without gov-issued ID All AI use tracked to individual - no private AI

English

214

194

1.1K

149.3K

Chris McCoy รีทวีตแล้ว

Andrew Curran@AndrewCurran_·1d

The DNC has barred staffers from using Chat and Claude. The only approved DNC model is Gemini.

English

281

23.1K

Chris McCoy@TheRealMcCoy·1d

@sundeep Yes. And you can price it into 100mm micro-units.

English

sunny madra@sundeep·1d

Why Cost per Token Is the Only Metric That Matters blogs.nvidia.com/blog/lowest-to…

English

3.9K

Chris McCoy@TheRealMcCoy·1d

👀 that's a first cc: @cyantist

English

Chris McCoy รีทวีตแล้ว

Neeraj K. Agrawal@NeerajKA·3d

It’s got to feel so bad to drop a satoshi exposé only to have no one believe or care about it

English

145

9.4K

Chris McCoy@TheRealMcCoy·2d

@Noahpinion Indeed. It can be governed by the same math of the US Constitution where innovation triumphs under a +2/3 human in the loop.

English

140

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·2d

It's a great point

Acyn@Acyn

Khanna: They’re saying this technology is going to be bigger than nuclear, bigger than electricity, bigger than aviation. Last I checked, we have an FAA for aviation, nuclear energy is regulated, and electricity is regulated. So you’re telling me on one hand that this is going to be bigger, and then you’re saying you don’t want any regulation. I mean, it makes no sense.

English

343

54.2K

Chris McCoy@TheRealMcCoy·2d

This is wrong.

Nick shirley@nickshirleyy

California is trying to pass a bill that would criminalize investigative journalism with misdemeanors, $10,000 fines, imprisonment, and content takedown. The proposed bill is titled AB 2624 and was made after I exposed mass fraud by immigrant groups in America. Under AB 2624, government-funded entities like the Somali “Learing” Daycare centers would be protected from being exposed if they operated inside California. The enemy truly is within. When our politicians would rather protect fraudsters and illegal migrants, it’s time for us to stand up or face mass oppression from the traitors who “rule” over us.

English