Max Kan

29 posts

Max Kan banner
Max Kan

Max Kan

@maxkan_

primarily for consumption purposes

Katılım Mayıs 2020
276 Takip Edilen35 Takipçiler
Max Kan
Max Kan@maxkan_·
The complete 180 in Groq sentiment is crazy. Seems everyone made the mistake of evaluating accelerator startups as general purpose inference chips, when in reality, they only need to do one specific workflow really well (e.g. high interactivity FFN decode) to be extremely valuable
English
0
0
0
13
Max Kan
Max Kan@maxkan_·
Did Nvidia just silently kill Rubin CPX at GTC? Is it because GDDR7 is too expensive now?
English
0
0
0
38
Max Kan
Max Kan@maxkan_·
@dwarkesh_sp @dylan522p @dylan522p if H100s are worth $2.40/hr today and newer chips are potentially over 100x better for frontier workloads (FP4, MoE, long context, etc) per InferenceX, why isn't Nvidia raising the price of Rubin?
English
0
0
0
912
Dwarkesh Patel
Dwarkesh Patel@dwarkesh_sp·
.@dylan522p gives a deep dive on the 3 big bottlenecks to scaling AI compute: logic, memory, and power. And walks through the economics of labs, hyperscalers, foundries, and fab equipment manufacturers. Learned a ton about every single level of the stack. 0:00:00 – Why an H100 is worth more today than 3 years ago 0:24:52 – Nvidia secured TSMC allocation early; Google is getting squeezed 0:34:34 – ASML will be the #1 constraint for AI compute scaling by 2030 0:56:06 – Can’t we just use TSMC’s older fabs? 1:05:56 – When will China outscale the West in semis? 1:16:20 – The enormous incoming memory crunch 1:42:53 – Scaling power in the US will not be a problem 1:55:03 – Space GPUs aren't happening this decade 2:14:26 – Why aren’t more hedge funds making the AGI trade? 2:18:49 – Will TSMC kick Apple out from N2? 2:24:35 – Robots and Taiwan risk Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify. Enjoy!
English
81
182
1.7K
472.7K
Max Kan
Max Kan@maxkan_·
@TheZvi Counterexample: Jake Paul
English
1
0
3
155
Max Kan
Max Kan@maxkan_·
Kinda ironic how Dario is the most scale-pilled, yet Anthropic is the most conservative when it comes to buying compute
English
0
0
0
96
Max Kan
Max Kan@maxkan_·
This would be a gigantic strategic blunder by OpenAI
Max Kan tweet media
English
0
0
0
99
Max Kan
Max Kan@maxkan_·
Jake Paul is a testament to how far self belief can get you. Honestly feels like this fight went perfectly to plan. Charmin ultra soft knockout
English
0
0
0
475
Max Kan
Max Kan@maxkan_·
ChatGPT app is borderline unusable with how slow 5.2 thinking is + the fact that the output just doesn’t render 90% of the time if you switch apps while it’s thinking and then later come back
English
0
0
0
216
Max Kan
Max Kan@maxkan_·
Kinda funny how a much larger percentage of SF has TSA pre than NY
English
0
0
0
40
Max Kan
Max Kan@maxkan_·
How does 100% utilization for A100s make sense? Aren’t Blackwells supposed to make Hopper uneconomical from a tokens per watt/dollar perspective?
English
0
0
0
37
Max Kan
Max Kan@maxkan_·
I guess choosing to forgo their own ASIC gave Anthropic the chance to co-design with Nvidia? Interesting development
English
0
0
0
50
Max Kan
Max Kan@maxkan_·
@HotAisle What are your thoughts on Coreweave reporting that they’re resigning long-term H100 contracts at only a 5% discount right after the original expires? Doesn’t this imply the ~$2 per hour is quite sticky?
English
1
0
0
385
Hot Aisle
Hot Aisle@HotAisle·
Thanks to Mr. Burry, depreciation seems to be the hot new debate topic, so here’s some additional thoughts from someone who buys and runs the hardware: Depreciation ≠ Useful Life. Depreciation is an accounting schedule that accounts for how you're going to pay back the hardware. Depreciation itself says nothing about how long an H100 remains useful. Hardware can run for a decade, the profit window is what’s shrinking faster than ever. You buy $5M of GPUs, depreciate over ~6 years, and that $833K/yr quietly drags down profit whether the hardware is running full tilt or sitting idle. $5M / $30k = ~150 GPUs $2 * 150 * 8760 = $2.6M $2.6M - $833K = $1.8M potential revenue / yr That $2 goes down by $0.50 every time there is a new release or new cloud comes online. This cuts into profits. Factor in idle / free time, OPEX (staff, offices, data centers, insurance, certifications, licenses, etc.) and those margins turn negative very quickly. Side note: You wouldn't believe how many very well funded startups come to me asking for free / discounted compute and then get mad at me and go elsewhere when I say no. WTF! Revenue ≠ Profit. Fact is that clouds are losing money every hour of every day. Depreciation guarantees that. Companies will now try to pull whatever accounting tricks they can in order to remain profitable. Got investment from BigCo to run your cloud? They are likely going to continue propping you up because they don't want to lose face and shake the market up. Contract Lock-In is a Financial Survival Mechanism. Clouds take on CAPEX / OPEX and need 1–3 year customer contracts to justify the loans (DDTL). Contracts prove revenue; they do not prove profit. Quality of contracts has no external oversight. Release Cycles Are Accelerating. NVIDIA and AMD are shipping new generations faster, and state-of-the-art AI requires the newest hardware. The upgrade treadmill is speeding up while margins shrink. There’s More Hardware Than Users. Everyone's buying GPUs and trying to rent them out and we have an ever increasing supply of "old" hardware. Reality is that spare capacity is everywhere. Idle hardware still depreciates. That’s pure loss, compounding daily. You Escape Depreciation by: Lowering costs, and / or raising prices. Neither are happening today because competition and the hardware cadence is endless. This Looks a Lot Like 2007 Subprime. Banks and VCs are funding DDTLs under the assumption that the contracts are solid and that the hardware can be resold if things go sideways. Much like they thought they could trust the lenders to have solid contracts and also their ability to sell the houses. Sure, that works for the big players with rock-solid contracts. For everyone else… this is shaky AF. The Resale Safety Net Is an Illusion. If waves of hardware hit secondary markets, prices collapse (just like houses). Everyone who thought they could “just sell the GPUs later” discovers they’re holding fully-functional hardware nobody can profit from. If other more experienced players couldn’t make it profitable, why would you? I know this comes off a bit heavy. That’s not the point. I’m actually quite optimistic for my own business. I see a path forward. I just don’t want to spell it out yet in public. You’ll have to watch what we build over the long term. Maybe it works, maybe it doesn’t. Time will tell.
Hot Aisle tweet media
English
18
20
195
70.6K
Max Kan
Max Kan@maxkan_·
Did they nerf GPT 5 Thinking? Why’s it always respond so fast now
English
0
0
0
44
Max Kan
Max Kan@maxkan_·
So OpenAI’s gonna use Nvidia GPUs, AMD GPUs, Google TPUs, and their own ASIC? Wonder if this only makes sense in a world where software is ~free
English
0
0
0
59
Max Kan
Max Kan@maxkan_·
Did not realize that AI voice for announcing students names at graduation is already fully mainstream lol
English
0
0
1
66
Max Kan
Max Kan@maxkan_·
Was never able to fully shake the nagging feeling that RLHF + SFT on CoT data + Mercor’s humans expert evals were once again ignoring the hard-won Bitter Lesson. Who better than Richard Sutton and David Silver to so clearly pinpoint the limitations of these methods and outline a pure RL path to superintelligence. Welcome to the era of experience
English
1
0
0
125
Max Kan
Max Kan@maxkan_·
Wasn’t expecting The Power Broker to double as Long Island tourism propaganda, but here we are
English
0
0
0
46