Max Kan

29 posts

Max Kan

@maxkan_

primarily for consumption purposes

Katılım Mayıs 2020

276 Takip Edilen35 Takipçiler

Max Kan@maxkan_·3d

The complete 180 in Groq sentiment is crazy. Seems everyone made the mistake of evaluating accelerator startups as general purpose inference chips, when in reality, they only need to do one specific workflow really well (e.g. high interactivity FFN decode) to be extremely valuable

English

Max Kan@maxkan_·3d

Did Nvidia just silently kill Rubin CPX at GTC? Is it because GDDR7 is too expensive now?

English

Max Kan@maxkan_·13 Mar

@dwarkesh_sp @dylan522p @dylan522p if H100s are worth $2.40/hr today and newer chips are potentially over 100x better for frontier workloads (FP4, MoE, long context, etc) per InferenceX, why isn't Nvidia raising the price of Rubin?

English

912

Dwarkesh Patel@dwarkesh_sp·13 Mar

.@dylan522p gives a deep dive on the 3 big bottlenecks to scaling AI compute: logic, memory, and power. And walks through the economics of labs, hyperscalers, foundries, and fab equipment manufacturers. Learned a ton about every single level of the stack. 0:00:00 – Why an H100 is worth more today than 3 years ago 0:24:52 – Nvidia secured TSMC allocation early; Google is getting squeezed 0:34:34 – ASML will be the #1 constraint for AI compute scaling by 2030 0:56:06 – Can’t we just use TSMC’s older fabs? 1:05:56 – When will China outscale the West in semis? 1:16:20 – The enormous incoming memory crunch 1:42:53 – Scaling power in the US will not be a problem 1:55:03 – Space GPUs aren't happening this decade 2:14:26 – Why aren’t more hedge funds making the AGI trade? 2:18:49 – Will TSMC kick Apple out from N2? 2:24:35 – Robots and Taiwan risk Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify. Enjoy!

English

182

1.7K

472.7K

Max Kan@maxkan_·12 Mar

@TheZvi Counterexample: Jake Paul

English

155

Zvi Mowshowitz@TheZvi·12 Mar

The reason you can just do things is most of them don't involve inviting Mike Tyson to punch you in the face.

Nate Silver@NateSilver538

"You can just do things" is basically the same heuristic as "Everyone has a plan until they get punched in the face" except for people who haven't been punched in the face yet.

English

192

10.2K

Max Kan@maxkan_·6 Oca

Kinda ironic how Dario is the most scale-pilled, yet Anthropic is the most conservative when it comes to buying compute

English

Max Kan@maxkan_·24 Ara

This would be a gigantic strategic blunder by OpenAI

English

Max Kan@maxkan_·20 Ara

Jake Paul is a testament to how far self belief can get you. Honestly feels like this fight went perfectly to plan. Charmin ultra soft knockout

English

475

Max Kan@maxkan_·17 Ara

ChatGPT app is borderline unusable with how slow 5.2 thinking is + the fact that the output just doesn’t render 90% of the time if you switch apps while it’s thinking and then later come back

English

216

Max Kan@maxkan_·20 Kas

Kinda funny how a much larger percentage of SF has TSA pre than NY

English

Max Kan@maxkan_·20 Kas

How does 100% utilization for A100s make sense? Aren’t Blackwells supposed to make Hopper uneconomical from a tokens per watt/dollar perspective?

English

Max Kan@maxkan_·18 Kas

I guess choosing to forgo their own ASIC gave Anthropic the chance to co-design with Nvidia? Interesting development

English

Max Kan@maxkan_·12 Kas

@HotAisle What are your thoughts on Coreweave reporting that they’re resigning long-term H100 contracts at only a 5% discount right after the original expires? Doesn’t this imply the ~$2 per hour is quite sticky?

English

385

Hot Aisle@HotAisle·10 Kas

Thanks to Mr. Burry, depreciation seems to be the hot new debate topic, so here’s some additional thoughts from someone who buys and runs the hardware: Depreciation ≠ Useful Life. Depreciation is an accounting schedule that accounts for how you're going to pay back the hardware. Depreciation itself says nothing about how long an H100 remains useful. Hardware can run for a decade, the profit window is what’s shrinking faster than ever. You buy $5M of GPUs, depreciate over ~6 years, and that $833K/yr quietly drags down profit whether the hardware is running full tilt or sitting idle. $5M / $30k = ~150 GPUs $2 * 150 * 8760 = $2.6M $2.6M - $833K = $1.8M potential revenue / yr That $2 goes down by $0.50 every time there is a new release or new cloud comes online. This cuts into profits. Factor in idle / free time, OPEX (staff, offices, data centers, insurance, certifications, licenses, etc.) and those margins turn negative very quickly. Side note: You wouldn't believe how many very well funded startups come to me asking for free / discounted compute and then get mad at me and go elsewhere when I say no. WTF! Revenue ≠ Profit. Fact is that clouds are losing money every hour of every day. Depreciation guarantees that. Companies will now try to pull whatever accounting tricks they can in order to remain profitable. Got investment from BigCo to run your cloud? They are likely going to continue propping you up because they don't want to lose face and shake the market up. Contract Lock-In is a Financial Survival Mechanism. Clouds take on CAPEX / OPEX and need 1–3 year customer contracts to justify the loans (DDTL). Contracts prove revenue; they do not prove profit. Quality of contracts has no external oversight. Release Cycles Are Accelerating. NVIDIA and AMD are shipping new generations faster, and state-of-the-art AI requires the newest hardware. The upgrade treadmill is speeding up while margins shrink. There’s More Hardware Than Users. Everyone's buying GPUs and trying to rent them out and we have an ever increasing supply of "old" hardware. Reality is that spare capacity is everywhere. Idle hardware still depreciates. That’s pure loss, compounding daily. You Escape Depreciation by: Lowering costs, and / or raising prices. Neither are happening today because competition and the hardware cadence is endless. This Looks a Lot Like 2007 Subprime. Banks and VCs are funding DDTLs under the assumption that the contracts are solid and that the hardware can be resold if things go sideways. Much like they thought they could trust the lenders to have solid contracts and also their ability to sell the houses. Sure, that works for the big players with rock-solid contracts. For everyone else… this is shaky AF. The Resale Safety Net Is an Illusion. If waves of hardware hit secondary markets, prices collapse (just like houses). Everyone who thought they could “just sell the GPUs later” discovers they’re holding fully-functional hardware nobody can profit from. If other more experienced players couldn’t make it profitable, why would you? I know this comes off a bit heavy. That’s not the point. I’m actually quite optimistic for my own business. I see a path forward. I just don’t want to spell it out yet in public. You’ll have to watch what we build over the long term. Maybe it works, maybe it doesn’t. Time will tell.

English

195

70.6K

Max Kan@maxkan_·18 Eki

Did they nerf GPT 5 Thinking? Why’s it always respond so fast now

English

Max Kan@maxkan_·6 Eki

So OpenAI’s gonna use Nvidia GPUs, AMD GPUs, Google TPUs, and their own ASIC? Wonder if this only makes sense in a world where software is ~free

English

Max Kan@maxkan_·20 May

Did not realize that AI voice for announcing students names at graduation is already fully mainstream lol

English

Max Kan@maxkan_·22 Nis

storage.googleapis.com/deepmind-media…

ZXX

Max Kan@maxkan_·22 Nis

Was never able to fully shake the nagging feeling that RLHF + SFT on CoT data + Mercor’s humans expert evals were once again ignoring the hard-won Bitter Lesson. Who better than Richard Sutton and David Silver to so clearly pinpoint the limitations of these methods and outline a pure RL path to superintelligence. Welcome to the era of experience

English

125

Max Kan@maxkan_·19 Nis

Wasn’t expecting The Power Broker to double as Long Island tourism propaganda, but here we are

English

Keşfet

@dwarkesh_sp @dylan522p @TheZvi @HotAisle @elonmusk @BarackObama @taylorswift13 @cristiano