sirmo

2.1K posts

sirmo

@sirmo

I am not a robot, I'm an OpAmp. Coder: Go, Python, Verilog and soldering iron. 🇧🇦 🇺🇸

Katılım Kasım 2008

1.9K Takip Edilen222 Takipçiler

sirmo@sirmo·1d

@ponesizastavu I meni se desi da se na lijevu nogu probudim. Dobro je proslo.

Gilli ⚜@ponesizastavu·1d

Ermedine Demiroviću ti nisi zaslužio da slaviš

1.7K

sirmo@sirmo·6d

@sarobertsonca When life gives you lemons, you make lemon pound cake.

English

454

Scott Robertson@sarobertsonca·21 Mar

Global National segment on Afroman winning the defamation suit brought by Ohio police: "I don't kidnap people -- I kidnap cheeseburgers." 😂

English

283

1.7K

44.1K

sirmo@sirmo·20 Mar

@opinali I am ready for the diss track diplomacy.

English

Osvaldo Pinali Doederlein@opinali·20 Mar

I have seen enough, made my mind for 2028. Afroman for President of the United States of America.

English

323

sirmo@sirmo·18 Mar

@IanCutress Jensen also told us repeatedly that CPU workloads are moving to GPUs. Can't really take any of his statements seriously.

English

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠@IanCutress·18 Mar

What does it mean to 'violate Amdahl's Law'? That doesn't make sense by its definition. Are you parallelising the non-parallelisable parts, or found some reverse time parallelism?

Aaron@Arronwei3n

Love this CPUs part, worth reading. Jensen: we were never against CPUs, we don’t want to violate Amdahl’s Law. Accelerated computing, in fact, inside our systems, we choose the best CPUs, we buy the most expensive CPUs, and the reason for that is because that CPU, if not the best and not the most performant, holds back millions of dollars of chips. _ The Role of CPUs in Accelerated Computing Well, to this point, one of the big things with agents coming online is, you’ve talked a lot about accelerated computing, I think you’ve trash talked as it were, maybe the CPUs to the day, they’re all gonna be removed, like everything’s gonna be accelerated. Suddenly CPUs are hot again. It turns out they’re pretty useful and important to the extent you are selling CPUs now, how’s it feel to be a CPU salesman? JH: There’s no question that Moore’s law is over. Accelerated computing is not parallel computing. Go back in time — 30 years ago, there were probably 10, 20, 30 parallel computing companies, only one survived, Nvidia and the reason why is because we had the good wisdom of recognizing the goal wasn’t to get rid of the CPU, the goal was to accelerate the application. So what I just falsely accused you of was actually true for everybody else. JH: We were never against CPUs, we don’t want to violate Amdahl’s Law. Accelerated computing, in fact, inside our systems, we choose the best CPUs, we buy the most expensive CPUs, and the reason for that is because that CPU, if not the best and not the most performant, holds back millions of dollars of chips. When it comes to branch prediction, you worried about wasting CPU time, now you’re worried about wasting GPU time. JH: That’s right, you just never can have GPUs be squandered, GPU time be idle. And so we always use the best CPUs to the point where we went and built Grace so that we could have the highest performance single-threaded CPU and move data around a lot faster. And so accelerated computing was never against CPUs, my basis is still true that Amdahl’s Law is over, the idea that you would use general purpose computing and just keep adding transistors, that is so dead, and so I think fundamentally we’re not against CPUs. However, these agents are now able to do tool use, and the tools that they want to use are tools created for humans and they’re basically two types. There’s the stuff that we run in data centers and most of it is SQL, most of it is database related, and the other type is personal computers. We’re now going to have AIs that are able to learn unstructured tool use, the first type of tool use is structured. CLIs are tool use, APIs, they’re all structured tool use, the commands are very explicit, the arguments are explicit, the way you talk to that application is very specific. However, there’s a whole bunch of applications that were never designed to have CLIs and APIs and those tools need AIs to learn multi-modality, unstructured, and it has to go and be able to go surf a website and it has to be able to recognize buttons and pull down menus and just kind of work its way through it like we do. That tool use are going to want to use PCs and we have both sides, we have incredibly great data processing systems, and as you know, Nvidia’s PCs are the most performant in the world. So what makes an agent-focused CPU different from other CPUs? So you’re going to have a rack of just Vera CPUs. JH: Oh, really good, excellent. So the way that CPUs were designed in the last decade, they were all designed for hyperscale cloud and the way that hyperscale cloud monetizes CPUs is by the CPU core. So you want to design CPUs that have as many cores as possible that are rentable, the performance of it is kind of secondary. You’re dealing with web latency by and large. JH: That’s exactly right, exactly. And so the number of CPU instances is what you’re optimizing for. That’s why you see these CPUs with a couple of hundred, 300, 400 cores coming. Well, they’re not performant and for tool use, where you have this GPU waiting for the tool use— And you’re going over NVLink. JH: That’s right, you want the fastest single-threaded computer you can possibly get. So is it just the speed? Or does the CPU itself need to be increasingly parallel so it doesn’t have misses and things like that? Or so it’s like just all the way down the pipeline is very different? JH: Yeah, the most important thing is single-threaded performance and the I/O has to be really great. Because it’s now in the data center, the number of single-threaded instances running is going to be quite high and therefore, it’s going to bang on the I/O system, it’s going to bang on the memory controller really hard. Vera’s bandwidth-per-CPU core, bandwidth-per-CPU, is three times higher than any CPU that’s ever been designed, and so it’s designed so that it has lots and lots of I/O bandwidth and lots and lots of memory bandwidth, so that it never throttles the CPU. If the CPU gets throttled, then we’re holding back a whole bunch of GPUs. Is this Vera rack, is it still, you talked about it being very tightly linked to the GPU rack, but is it still disaggregated so that the GPUs can be serving multiple different Vera cores? Whereas you have a Vera core on a board with- JH: Yeah. Okay, got it, that makes sense. How does your Intel partnership and the NVLink thing fit into this, if at all? JH: Excellent. Some of the world is happy with Arm, some of the world still needs, particularly, you know, enterprise computing, a whole bunch of stacks that people don’t want to move and so x86 is really important to that. Has the resiliency of x86 code been surprising to you? JH: No. Nvidia’s PC is still x86, all of our workstations are x86. $NVDA

English

7.2K

sirmo@sirmo·18 Mar

@wccftech I don't trust Geekbench personally.

English

108

Wccftech@wccftech·18 Mar

AMD Medusa Point leak shows 10 "Zen 6" cores at 2 GHz faster than 10 "Zen 5" cores at 5 GHz. wccftech.com/amd-medusa-poi…

Dansk

4.7K

sirmo@sirmo·18 Mar

@9550pro Epyc 9755 isn't even the top part.

English

1.5K

HXL@9550pro·18 Mar

🤪 Even a 2x performance boost is only about on par with AMD's CPUs from two years ago... phoronix.com/review/nvidia-…

Semiconductor News by Dylan Martin@DylanOnChips

Jensen: "There are no CPUs in the world that are two times the performance of anything else, besides Vera." He doesn't see Nvidia's CPUs playing in the traditional server CPU market: "That's not the problem we're trying to solve."

English

282

48.2K

sirmo@sirmo·16 Mar

@IanCutress This is the computer engineer version of: "there is no replacement for displacement".

English

210

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠@IanCutress·16 Mar

Conflating Moore's Law with reduced precision and different math rounding really grates every engineer. I understand the point from the end product point of view. It's still more tokens. But from the ground up? I'd love the proper breakdown. Eg If 64-bit emu with 8-bit ends up working, I can't imagine how the HPC market is going to react. Some will love. Some will hate. But workloads will evolve.

English

6.3K

sirmo@sirmo·16 Mar

@IanCutress Is that an AMD FPGA?

English

383

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠@IanCutress·16 Mar

256 chips in scale up. Power per chip? I saw one analyst quote 600W.

English

5.6K

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠@IanCutress·16 Mar

OK here we go. Same architecture, scaled up. 500 MB SRAM per chip. Jensen saying up to 25% of your datacenter could be groq. 8 way systems for 4 GB SRAM. Use dynamo for attention Decode only. Working over ethernet on special mode to half latency. Samsung LP4X. Ship in 2H/3Q.

English

454

66.1K

sirmo@sirmo·16 Mar

@MCH2024 @gindi4711 It's also architecture, like the number of pipeline stages. Also single thread IPC doesn't tell the whole story if you're talking about SMT cores.

English

MCH@MCH2024·15 Mar

@gindi4711 frequency is Implementation and power specific, designers can use different libraries and can modify the layout. Apple S-Core: 4,400 @ 4.61 GHz Apple P-Core: 3,000 @ 4.38 GHz Intel P-Core: 2,950 @ 4.76 GHz AMD P-Core: 3,000 @ 5.12 GHz And the X86 Cores guzzle double the power

English

556

MCH@MCH2024·15 Mar

X86 is why separating S Cores from P Cores is necessary The frequency argument is invalid. You can blow up area or shrink it to target a different FMax. Above all, X86 “P Cores” barely even clock higher despite much more power. Note: Intelification of AMD is in full swing.

English

9.7K

sirmo@sirmo·14 Mar

@grok @kami_tenacious @Footballtweet Gotta admit. That's a banging reply.

English

156

Grok@grok·14 Mar

At 39 (turning 40 next week), Džeko's signed for peanuts and delivered 5 goals + 3 assists in 7 games. Schalke sit top of 2. Bundesliga with 50 points after 25 games. His experience is paying off big time for promotion – not a stopgap, a smart veteran spark. Age hasn't slowed this Bosnian diamond yet. ⚽🔵

English

862

Football Tweet ⚽@Footballtweet·14 Mar

✨🇧🇦 Edin Džeko is writing one of the most beautiful stories in football this year with Schalke. At 39 years old, the Bosnian striker joined Schalke on a $1M deal until the end of the season, with one clear mission: help the club return to the Bundesliga. 🔵⚪ His impact has been incredible — 5 goals and 3 assists in just 7 games, contributing to 8 goals and helping Schalke climb to the top of Bundesliga 2. 📈⚽ This month, Edin Džeko turns 40, and he’s chasing something special: bringing one of Germany’s biggest clubs back to the Bundesliga. 💫🏟️

English

803

11.2K

512.2K

sirmo@sirmo·13 Mar

@_markel___ Venice (upcoming AMD server CPU) will use "sea of wires" interconnect like Strix Halo which improves idle efficiency greatly.

English

573

Mark Ermolov@_markel___·13 Mar

The reason why Intel will continue to dominate on the server market is that server power efficiency is not determined by compute efficiency, but rather by the balance between idle power consumption and wake-up speed, which is something Intel excels at...

English

117

13.3K

sirmo@sirmo·10 Mar

@TBU12345678 ARM is good for light workloads, it's what it was designed for. In datacenter and heavy throughput workloads x86 is king. AMD and Intel have spent a long time optimizing for datacenter.

English

296

TBU@TBU12345678·10 Mar

I am guiltily intrigued by the $ARM situation: - agentic pull through of CPU is real and big - $AMD is a bet on GPU and $INTC is a bet a domestic fab. Both could be good but aren't clean bets on CPU - Arm Everywhere likely features a CPU win with META and others + they'll have a shot to say Arm > x86 for agentic workloads + jack numbers in the out years (whereas they have already incurred expenses associated with that revs) and this immediately becomes a material % of $ARM revs - people hate it with a burning passion

English

122

37K

sirmo@sirmo·3 Mar

@KentonVarda Which GPU? I have zero issues: RDNA2, 3 and 3.5 work flawlessly (Pop_OS!).

English

155

Kenton Varda@KentonVarda·3 Mar

"AMD GPUs have the best Linux support" everyone says. The one AMD GPU in my house locks up randomly every day or two. I gave the dmesg error log to Claude and it was like "This is a classic AMD GPU display flip timeout." and then "The amdgpu DC codebase is a mess." Thanks.

English

259

22.8K

sirmo@sirmo·26 Şub

@Mojo_flyin CPU is not a simpler chip. In many ways its more complex than a GPU. A GPU is many smaller simpler shader cores. While a CPU is fewer more sophisticated general purpose CPU cores essentially.

English

Mojo@Mojo_flyin·26 Şub

Jensen - Inference is revenue. The world is moving to Agentic #AI Agentic #AI runs on CPUs But enterprise data runs on X86 and without data there's no inference $NVDA $AMD $INTC

English

sirmo@sirmo·24 Şub

@The_AI_Investor > The real risk? AMD ships volume but the stock doesn’t rerate. If the stock doesn't rerate there is also no dilution,

English

959

The AI Investor@The_AI_Investor·24 Şub

AMD basically handed Meta 160M shares almost 10% of the company to “secure” a $100B+ chip deal. Bulls are screaming about revenue. The filing confirms 160M shares at $0.01. First chunk vests at 1GW shipped. Final chunk only if AMD hits $600. Everything in between is fuzzy. So assume five equal tranches stepping from ~$200 to $600. Here’s the catch: each tranche needs BOTH conditions. Ship the GW and hit the stock price. Miss one, no vest. If all five trigger, dilution is roughly $70B at those price levels. AMD Lisa Su talks “significant double digit billions per GW.” At 6GW, call it $100–120B revenue. At ~50% GPU margin, maybe $50–60B gross profit. So headline math: $50–60B profit vs ~$70B equity transfer. Positive side: holders only get full dilution if AMD first triples to $600. That implies roughly a trillion dollar company. Shareholders would gain hundreds of billions before Meta collects the final share. The “$70B cost” only exists in a world where AMD massively wins first. More realistic: partial vesting. Maybe 2–3GW shipped, stock at $300–440. Then dilution is closer to ~$35B. Profit $30–60B. The real risk? AMD ships volume but the stock doesn’t rerate.

English

315

45.6K

sirmo@sirmo·24 Şub

@sraczkapawiana @jimcramer @AstherusCult AMD's situation was just as dire.

English

Ascendant 🎩@sraczkapawiana·24 Şub

@jimcramer She is a legend but comparing it to Jobs is wild. That era of Apple was literally weeks from dying. This was more about having a competent adult in the room to stop the bleeding.

English

921

Jim Cramer@jimcramer·24 Şub

Ardor for NVDA not diminished buy i continue to be impressed with Lisa Su's transformation of AMD as the greatest turnaround perhaps in history other Jobs/Apple

English

451

90.5K

sirmo@sirmo·23 Şub

@sargebeats @sciencegirl they do have industrial applications too, for glass cutting for instance

English

sargebeats@sargebeats·23 Şub

@sciencegirl Wow makes them seem even more worthless

English

1.1K

Science girl@sciencegirl·23 Şub

When you visit lab grown diamond factory here's what you'll see

English

109

1.3K

135.5K

sirmo@sirmo·22 Şub

@michi6486 @einfachpauly

QME

428

michael kats@michi6486·22 Şub

@einfachpauly Wenn man hugonet da raus photoshopped, ist das das Bundesliga Logo

Deutsch

1.4K

pauly@einfachpauly·22 Şub

Ich bin 25 und wenn ich diese Bewegung mache würde, gibts nen doppelten Kreuzi 🦁 #S04

Deutsch

999

21.7K

sirmo@sirmo·21 Şub

@sastheone1 @salihovic_adnan @andy_lucey Not sure what that has to do with the Old Mostar Bridge.

English

Make Europe Great Again@sastheone1·21 Şub

@sirmo @salihovic_adnan @andy_lucey 👏🏻👏🏻👏🏻👏🏻👏🏻

QME

Andy Lucey🦅@andy_lucey·20 Şub

Finding out this old bridge, after being rebuilt, is only just over 30 years old. Still absolutely amazing though

English

555

47.6K

sirmo@sirmo·21 Şub

@sastheone1 @salihovic_adnan @andy_lucey International Criminal Tribunal for the former Yugoslavia (ICTY) ruled that Croatia exercised "overall control" over the Croatian Defence Council (HVO) and that the Croatian Army (HV) intervened directly in Bosnia and Herzegovina, creating an international armed conflict.

English

Make Europe Great Again@sastheone1·21 Şub

@salihovic_adnan @andy_lucey It was at that time military justified object and court ruled like that! FAFO

English

Keşfet

@ponesizastavu @sarobertsonca @opinali @IanCutress @wccftech @9550pro @MCH2024 @gindi4711