Daniel Nishball

66 posts

Daniel Nishball banner
Daniel Nishball

Daniel Nishball

@dnishball

Katılım Mayıs 2009
205 Takip Edilen455 Takipçiler
Daniel Nishball retweetledi
Dylan Patel
Dylan Patel@dylan522p·
The Nvidia Tensor Core is the most important evolution of computer architecture in the last decade We explain why / how it's evolved Shout out to collaborators @bfspector @tri_dao @colfaxintl @charles_irl @ia_buck Neil Movva Jonah Alben esp @simonguozirui for the cutest cover pic
SemiAnalysis@SemiAnalysis_

NVIDIA Tensor Core Evolution From Volta To Blackwell Amdahl’s Law, Strong Scaling Asynchronous Execution Blackwell, Hopper, Ampere, Turing, Volta semianalysis.com/2025/06/23/nvi…

English
8
23
317
50.2K
Daniel Nishball retweetledi
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
SemiAnalysis is hosting an Nvidia Blackwell GPU Hackathon on Sunday March 16th. It is the ultimate playground for Blackwell PTX tech enthusiasts, offering hands-on exploration of Blackwell & PTX infrastructure while collaborating on open-source projects.
English
4
29
251
91.6K
Daniel Nishball retweetledi
Bg2 Pod
Bg2 Pod@BG2Pod·
BG2. AI Compute Landscape ‘25-26, scaling intelligence, chip competition, memory tech, inference time reasoning & more. 👊💥@altcap @bgurley feat @dylan522p
English
49
145
1.2K
470.5K
Daniel Nishball retweetledi
Dylan Patel
Dylan Patel@dylan522p·
Met with @LisaSu today for 1.5 hours as we went through everything She acknowledged the gaps in AMD software stack She took our specific recommendations seriously She asked her team and us a lot of questions Many changes are in flight already! Excited to see improvements coming
Dylan Patel@dylan522p

Our 5-month journey conducting independent analysis & benchmarking of AMD MI300X vs Nvidia H100 + H200 Detailed, open source low-level benchmarks performance vs TCO Comprehensive public recommendations It’s not just immature software, they need to change how they do development

Georgia, USA 🇺🇸 English
84
195
3.5K
596.9K
Daniel Nishball retweetledi
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
MI300X vs H100 vs H200 Benchmark Part 1: Training CUDA Moat Still Alive Our 5 month journey conducting independent analysis + benchmarking User Experience, Usability GEMM + attention InfiniBand vs Spectrum-X vs RoCEv2 Ethernet SHARP Total Cost of Ownership semianalysis.com/2024/12/22/mi3…
English
31
127
888
759.3K
Daniel Nishball
Daniel Nishball@dnishball·
4/5 - Blackwell's impact is another important factor. How many B100+B200s hit the Neocloud market? Will GB200 NVL36/72s be readily available in 2025 given liquid cooling tightness? How will Neocloud Giants set GB200 NVL72 debut rental price?
English
1
0
4
868
Daniel Nishball
Daniel Nishball@dnishball·
1/5 - H100 SXM on-demand rental prices cut by 30% in the last two months. Today, you can rent H100 SXMs on-demand for as low as $2.99 per hour. What's going on?
Daniel Nishball tweet media
English
1
6
40
11.2K
Daniel Nishball retweetledi
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
1/4 - Chevron deference has just been struck down by the US Supreme Court. What is it and how does it affect semiconductor companies? ⬇️ Under that 40-yr-old legal doctrine, US federal agencies had the power to create their own rules & regulations when a law is ambiguous. In our industry, this is particularly relevant for technology export controls – agencies were in the driver’s seat and didn’t have to worry about being challenged in Court. This is now over, and the power has been handed back to the Court system after the Supreme Court’s ruling in Loper Bridge Enterprises v Raimondo.
SemiAnalysis tweet media
English
4
30
151
64.6K
Daniel Nishball
Daniel Nishball@dnishball·
@r_bjerg @FredaDuan Freda's math is correct. H100s can't just run on their own - they need to be in a server with a CPU and various networking chips which all take up energy. Back end networking for GPU clusters - switches, optical transceivers etc also add power requirements.
English
1
0
0
43
M Arden 🇵🇸
M Arden 🇵🇸@don_bjerg·
@FredaDuan One H100 has a peak power usage of 700W=0.7KW=0.0007MW. Therefore 600,000 H100 chips use around 600,000 x 0.0007 = 420MW, so not 1MW as you wrote.
English
2
0
1
417
Freda Duan
Freda Duan@FredaDuan·
The AI-driven energy bottleneck seems real: US data center power usage accounts for 22GW, or 4.5% of the nation's power consumption today. According to SemiAnalysis, it is projected to reach 100GW, or nearly 20% by 2030 due to AI buildout. Interesting data points from the interview: "Right now, the AI data centers are 50MW or 100MW. Then you get into building a data center that's…1GW, but I don't think it (1GW) will happen next year" To put into perspective, 100MW dc = ~60k H100 1MW dc = ~600k H100 Assuming each new generation of the AI model is ~10x more FLOPs (assuming a 3x increase in parameters and a 3x increase in training tokens). This scaling suggests that reaching a 1GW data center capacity may actually occur very soon (within one or two model gen, or 1-2 years). Potential Solutions and Implications: Nuclear Power Co-location: The most straightforward solution. One plant is ~1GW. It still doesn't solve the problem for subsequent model generations (potentially 10x power needs). Gas Plants: Viable option given abundant supply, but the lead time can be 4 years, with transmission capacity being a significant limitation. Rising Electricity Prices: According to Morgan Stanley, for every additional 1GW of load, power prices increase by $4/MWh. For ERCOT North ($45/MWh), a 1GW increase could raise prices by about 9%. Long-term solutions: Distributed cross-datacenter training? Nuclear fusion? Paradigm change in GPU energy efficiency?? h/t SemiAnalysis:
Freda Duan tweet media
Dwarkesh Patel@dwarkesh_sp

"1 GW - that's the size of a meaningful nuclear power plant, only going towards training a model. Over the last few years, there was this issue of GPU production. Now I think that's getting less. But I actually think before we hit that, you're gonna run into energy constraints. When you start getting into building a datacenter that's 300 MW or 500 MW or 1 GW, that's not gonna happen next year."

English
23
47
207
114.3K
Daniel Nishball
Daniel Nishball@dnishball·
@FredaDuan Other solutions could include seeking out stranded renewable resources - wind farms and such, repurposing inefficient old enterprise data centers > 1.6 PUE into AI DCs at 1.3 PUE and below.
English
0
0
0
126