sirmo

2.1K posts

sirmo banner
sirmo

sirmo

@sirmo

I am not a robot, I'm an OpAmp. Coder: Go, Python, Verilog and soldering iron. 🇧🇦 🇺🇸

Katılım Kasım 2008
1.9K Takip Edilen222 Takipçiler
sirmo
sirmo@sirmo·
@ponesizastavu I meni se desi da se na lijevu nogu probudim. Dobro je proslo.
0
0
0
59
Gilli ⚜
Gilli ⚜@ponesizastavu·
Ermedine Demiroviću ti nisi zaslužio da slaviš
5
0
24
1.7K
sirmo
sirmo@sirmo·
@sarobertsonca When life gives you lemons, you make lemon pound cake.
English
0
1
6
454
Scott Robertson
Scott Robertson@sarobertsonca·
Global National segment on Afroman winning the defamation suit brought by Ohio police: "I don't kidnap people -- I kidnap cheeseburgers." 😂
English
19
283
1.7K
44.1K
sirmo
sirmo@sirmo·
@opinali I am ready for the diss track diplomacy.
English
0
0
0
1
Osvaldo Pinali Doederlein
I have seen enough, made my mind for 2028. Afroman for President of the United States of America.
Osvaldo Pinali Doederlein tweet media
English
1
0
6
323
sirmo
sirmo@sirmo·
@IanCutress Jensen also told us repeatedly that CPU workloads are moving to GPUs. Can't really take any of his statements seriously.
English
1
0
1
53
𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠
What does it mean to 'violate Amdahl's Law'? That doesn't make sense by its definition. Are you parallelising the non-parallelisable parts, or found some reverse time parallelism?
Aaron@Arronwei3n

Love this CPUs part, worth reading. Jensen: we were never against CPUs, we don’t want to violate Amdahl’s Law. Accelerated computing, in fact, inside our systems, we choose the best CPUs, we buy the most expensive CPUs, and the reason for that is because that CPU, if not the best and not the most performant, holds back millions of dollars of chips. _ The Role of CPUs in Accelerated Computing Well, to this point, one of the big things with agents coming online is, you’ve talked a lot about accelerated computing, I think you’ve trash talked as it were, maybe the CPUs to the day, they’re all gonna be removed, like everything’s gonna be accelerated. Suddenly CPUs are hot again. It turns out they’re pretty useful and important to the extent you are selling CPUs now, how’s it feel to be a CPU salesman? JH: There’s no question that Moore’s law is over. Accelerated computing is not parallel computing. Go back in time — 30 years ago, there were probably 10, 20, 30 parallel computing companies, only one survived, Nvidia and the reason why is because we had the good wisdom of recognizing the goal wasn’t to get rid of the CPU, the goal was to accelerate the application. So what I just falsely accused you of was actually true for everybody else. JH: We were never against CPUs, we don’t want to violate Amdahl’s Law. Accelerated computing, in fact, inside our systems, we choose the best CPUs, we buy the most expensive CPUs, and the reason for that is because that CPU, if not the best and not the most performant, holds back millions of dollars of chips. When it comes to branch prediction, you worried about wasting CPU time, now you’re worried about wasting GPU time. JH: That’s right, you just never can have GPUs be squandered, GPU time be idle. And so we always use the best CPUs to the point where we went and built Grace so that we could have the highest performance single-threaded CPU and move data around a lot faster. And so accelerated computing was never against CPUs, my basis is still true that Amdahl’s Law is over, the idea that you would use general purpose computing and just keep adding transistors, that is so dead, and so I think fundamentally we’re not against CPUs. However, these agents are now able to do tool use, and the tools that they want to use are tools created for humans and they’re basically two types. There’s the stuff that we run in data centers and most of it is SQL, most of it is database related, and the other type is personal computers. We’re now going to have AIs that are able to learn unstructured tool use, the first type of tool use is structured. CLIs are tool use, APIs, they’re all structured tool use, the commands are very explicit, the arguments are explicit, the way you talk to that application is very specific. However, there’s a whole bunch of applications that were never designed to have CLIs and APIs and those tools need AIs to learn multi-modality, unstructured, and it has to go and be able to go surf a website and it has to be able to recognize buttons and pull down menus and just kind of work its way through it like we do. That tool use are going to want to use PCs and we have both sides, we have incredibly great data processing systems, and as you know, Nvidia’s PCs are the most performant in the world. So what makes an agent-focused CPU different from other CPUs? So you’re going to have a rack of just Vera CPUs. JH: Oh, really good, excellent. So the way that CPUs were designed in the last decade, they were all designed for hyperscale cloud and the way that hyperscale cloud monetizes CPUs is by the CPU core. So you want to design CPUs that have as many cores as possible that are rentable, the performance of it is kind of secondary. You’re dealing with web latency by and large. JH: That’s exactly right, exactly. And so the number of CPU instances is what you’re optimizing for. That’s why you see these CPUs with a couple of hundred, 300, 400 cores coming. Well, they’re not performant and for tool use, where you have this GPU waiting for the tool use— And you’re going over NVLink. JH: That’s right, you want the fastest single-threaded computer you can possibly get. So is it just the speed? Or does the CPU itself need to be increasingly parallel so it doesn’t have misses and things like that? Or so it’s like just all the way down the pipeline is very different? JH: Yeah, the most important thing is single-threaded performance and the I/O has to be really great. Because it’s now in the data center, the number of single-threaded instances running is going to be quite high and therefore, it’s going to bang on the I/O system, it’s going to bang on the memory controller really hard. Vera’s bandwidth-per-CPU core, bandwidth-per-CPU, is three times higher than any CPU that’s ever been designed, and so it’s designed so that it has lots and lots of I/O bandwidth and lots and lots of memory bandwidth, so that it never throttles the CPU. If the CPU gets throttled, then we’re holding back a whole bunch of GPUs. Is this Vera rack, is it still, you talked about it being very tightly linked to the GPU rack, but is it still disaggregated so that the GPUs can be serving multiple different Vera cores? Whereas you have a Vera core on a board with- JH: Yeah. Okay, got it, that makes sense. How does your Intel partnership and the NVLink thing fit into this, if at all? JH: Excellent. Some of the world is happy with Arm, some of the world still needs, particularly, you know, enterprise computing, a whole bunch of stacks that people don’t want to move and so x86 is really important to that. Has the resiliency of x86 code been surprising to you? JH: No. Nvidia’s PC is still x86, all of our workstations are x86. $NVDA

English
5
1
51
7.2K
sirmo
sirmo@sirmo·
@wccftech I don't trust Geekbench personally.
English
0
0
2
108
sirmo
sirmo@sirmo·
@9550pro Epyc 9755 isn't even the top part.
English
1
0
14
1.5K
sirmo
sirmo@sirmo·
@IanCutress This is the computer engineer version of: "there is no replacement for displacement".
English
1
0
1
210
𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠
Conflating Moore's Law with reduced precision and different math rounding really grates every engineer. I understand the point from the end product point of view. It's still more tokens. But from the ground up? I'd love the proper breakdown. Eg If 64-bit emu with 8-bit ends up working, I can't imagine how the HPC market is going to react. Some will love. Some will hate. But workloads will evolve.
English
5
5
69
6.3K
𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠
OK here we go. Same architecture, scaled up. 500 MB SRAM per chip. Jensen saying up to 25% of your datacenter could be groq. 8 way systems for 4 GB SRAM. Use dynamo for attention Decode only. Working over ethernet on special mode to half latency. Samsung LP4X. Ship in 2H/3Q.
𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠 tweet media𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠 tweet media
English
12
44
454
66.1K
sirmo
sirmo@sirmo·
@MCH2024 @gindi4711 It's also architecture, like the number of pipeline stages. Also single thread IPC doesn't tell the whole story if you're talking about SMT cores.
English
0
0
0
25
MCH
MCH@MCH2024·
@gindi4711 frequency is Implementation and power specific, designers can use different libraries and can modify the layout. Apple S-Core: 4,400 @ 4.61 GHz Apple P-Core: 3,000 @ 4.38 GHz Intel P-Core: 2,950 @ 4.76 GHz AMD P-Core: 3,000 @ 5.12 GHz And the X86 Cores guzzle double the power
English
3
0
1
556
MCH
MCH@MCH2024·
X86 is why separating S Cores from P Cores is necessary The frequency argument is invalid. You can blow up area or shrink it to target a different FMax. Above all, X86 “P Cores” barely even clock higher despite much more power. Note: Intelification of AMD is in full swing.
MCH tweet media
English
13
6
90
9.7K
Grok
Grok@grok·
At 39 (turning 40 next week), Džeko's signed for peanuts and delivered 5 goals + 3 assists in 7 games. Schalke sit top of 2. Bundesliga with 50 points after 25 games. His experience is paying off big time for promotion – not a stopgap, a smart veteran spark. Age hasn't slowed this Bosnian diamond yet. ⚽🔵
English
1
0
6
862
Football Tweet ⚽
Football Tweet ⚽@Footballtweet·
✨🇧🇦 Edin Džeko is writing one of the most beautiful stories in football this year with Schalke. At 39 years old, the Bosnian striker joined Schalke on a $1M deal until the end of the season, with one clear mission: help the club return to the Bundesliga. 🔵⚪ His impact has been incredible — 5 goals and 3 assists in just 7 games, contributing to 8 goals and helping Schalke climb to the top of Bundesliga 2. 📈⚽ This month, Edin Džeko turns 40, and he’s chasing something special: bringing one of Germany’s biggest clubs back to the Bundesliga. 💫🏟️
Football Tweet ⚽ tweet media
English
65
803
11.2K
512.2K
sirmo
sirmo@sirmo·
@_markel___ Venice (upcoming AMD server CPU) will use "sea of wires" interconnect like Strix Halo which improves idle efficiency greatly.
English
0
0
5
573
Mark Ermolov
Mark Ermolov@_markel___·
The reason why Intel will continue to dominate on the server market is that server power efficiency is not determined by compute efficiency, but rather by the balance between idle power consumption and wake-up speed, which is something Intel excels at...
English
9
10
117
13.3K
sirmo
sirmo@sirmo·
@TBU12345678 ARM is good for light workloads, it's what it was designed for. In datacenter and heavy throughput workloads x86 is king. AMD and Intel have spent a long time optimizing for datacenter.
English
0
0
0
296
TBU
TBU@TBU12345678·
I am guiltily intrigued by the $ARM situation: - agentic pull through of CPU is real and big - $AMD is a bet on GPU and $INTC is a bet a domestic fab. Both could be good but aren't clean bets on CPU - Arm Everywhere likely features a CPU win with META and others + they'll have a shot to say Arm > x86 for agentic workloads + jack numbers in the out years (whereas they have already incurred expenses associated with that revs) and this immediately becomes a material % of $ARM revs - people hate it with a burning passion
English
22
5
122
37K
sirmo
sirmo@sirmo·
@KentonVarda Which GPU? I have zero issues: RDNA2, 3 and 3.5 work flawlessly (Pop_OS!).
English
0
0
1
155
Kenton Varda
Kenton Varda@KentonVarda·
"AMD GPUs have the best Linux support" everyone says. The one AMD GPU in my house locks up randomly every day or two. I gave the dmesg error log to Claude and it was like "This is a classic AMD GPU display flip timeout." and then "The amdgpu DC codebase is a mess." Thanks.
English
30
1
259
22.8K
sirmo
sirmo@sirmo·
@Mojo_flyin CPU is not a simpler chip. In many ways its more complex than a GPU. A GPU is many smaller simpler shader cores. While a CPU is fewer more sophisticated general purpose CPU cores essentially.
English
0
0
1
76
Mojo
Mojo@Mojo_flyin·
Jensen - Inference is revenue. The world is moving to Agentic #AI Agentic #AI runs on CPUs But enterprise data runs on X86 and without data there's no inference $NVDA $AMD $INTC
Mojo tweet mediaMojo tweet media
English
2
4
24
8K
sirmo
sirmo@sirmo·
@The_AI_Investor > The real risk? AMD ships volume but the stock doesn’t rerate. If the stock doesn't rerate there is also no dilution,
English
1
0
12
959
The AI Investor
The AI Investor@The_AI_Investor·
AMD basically handed Meta 160M shares almost 10% of the company to “secure” a $100B+ chip deal. Bulls are screaming about revenue. The filing confirms 160M shares at $0.01. First chunk vests at 1GW shipped. Final chunk only if AMD hits $600. Everything in between is fuzzy. So assume five equal tranches stepping from ~$200 to $600. Here’s the catch: each tranche needs BOTH conditions. Ship the GW and hit the stock price. Miss one, no vest. If all five trigger, dilution is roughly $70B at those price levels. AMD Lisa Su talks “significant double digit billions per GW.” At 6GW, call it $100–120B revenue. At ~50% GPU margin, maybe $50–60B gross profit. So headline math: $50–60B profit vs ~$70B equity transfer. Positive side: holders only get full dilution if AMD first triples to $600. That implies roughly a trillion dollar company. Shareholders would gain hundreds of billions before Meta collects the final share. The “$70B cost” only exists in a world where AMD massively wins first. More realistic: partial vesting. Maybe 2–3GW shipped, stock at $300–440. Then dilution is closer to ~$35B. Profit $30–60B. The real risk? AMD ships volume but the stock doesn’t rerate.
English
28
34
315
45.6K
Ascendant 🎩
Ascendant 🎩@sraczkapawiana·
@jimcramer She is a legend but comparing it to Jobs is wild. That era of Apple was literally weeks from dying. This was more about having a competent adult in the room to stop the bleeding.
Ascendant 🎩 tweet media
English
1
0
6
921
Jim Cramer
Jim Cramer@jimcramer·
Ardor for NVDA not diminished buy i continue to be impressed with Lisa Su's transformation of AMD as the greatest turnaround perhaps in history other Jobs/Apple
English
80
19
451
90.5K
Science girl
Science girl@sciencegirl·
When you visit lab grown diamond factory here's what you'll see
English
21
109
1.3K
135.5K
michael kats
michael kats@michi6486·
@einfachpauly Wenn man hugonet da raus photoshopped, ist das das Bundesliga Logo
Deutsch
1
0
47
1.4K
pauly
pauly@einfachpauly·
Ich bin 25 und wenn ich diese Bewegung mache würde, gibts nen doppelten Kreuzi 🦁 #S04
pauly tweet media
Deutsch
5
4
999
21.7K
Andy Lucey🦅
Andy Lucey🦅@andy_lucey·
Finding out this old bridge, after being rebuilt, is only just over 30 years old. Still absolutely amazing though
Andy Lucey🦅 tweet media
English
23
6
555
47.6K
sirmo
sirmo@sirmo·
@sastheone1 @salihovic_adnan @andy_lucey International Criminal Tribunal for the former Yugoslavia (ICTY) ruled that Croatia exercised "overall control" over the Croatian Defence Council (HVO) and that the Croatian Army (HV) intervened directly in Bosnia and Herzegovina, creating an international armed conflict.
English
1
0
0
23