Awarewolf

552 posts

Awarewolf

Awarewolf

@Awarewolf34617

Katılım Şubat 2021
531 Takip Edilen47 Takipçiler
Awarewolf retweetledi
Jukan
Jukan@jukan05·
Why did xAI hand over a 220,000-GPU cluster to Anthropic? The technical backdrop to xAI's decision to hand Colossus 1 over to Anthropic in its entirety is more interesting than it appears. xAI deployed more than 220,000 NVIDIA GPUs at its Colossus 1 data center in Memphis. Of these, roughly 150,000 are estimated to be H100s, 50,000 H200s, and 20,000 GB200s. In other words, three different generations of silicon are mixed together inside a single cluster — a "heterogeneous architecture." For distributed training, however, this configuration is close to a disaster, according to engineers familiar with the setup. In distributed training, 100,000 GPUs must finish a single step simultaneously before the cluster can advance to the next one. Even if the GB200s finish their computation first, the remaining 99,999 chips have to wait for the slower H100s — or for any GPU that has hit a stack-related snag — to catch up. This is known as the straggler effect. The 11% GPU utilization rate (MFU: the share of theoretical FLOPs actually realized) at xAI recently reported by The Information can be read as the numerical fallout of this problem. It stands in stark contrast to the 40%-plus MFU figures achieved by Meta and Google. The problem runs deeper still. As discussed earlier, NVIDIA's NCCL has traditionally been optimized for a ring topology. It works beautifully at the 1,000–10,000 GPU scale, but once you push into the 100,000-unit range, the latency of data traversing the ring once around becomes punishingly long. GPUs need to churn through computations rapidly to keep MFU high, but while they sit waiting endlessly for data to arrive over the network fabric, more than half of the silicon falls into idle. Google sidestepped this bottleneck with its own custom topology (Google's OCS: Apollo/Palomar), but xAI, by my read, has not yet reached that stage. Layer Blackwell's (GB200) "power smoothing" issue on top, and the picture comes into focus. According to Zeeshan Patel, formerly in charge of multimodal pre-training at xAI, Blackwell GPUs draw power so aggressively that the chip itself includes a hardware feature for smoothing power delivery. xAI's existing software stack, however, was optimized for Hopper and does not understand the characteristics of the new hardware; when it imposes irregular loads on the chip, the silicon physically destructs — literally melts. That means the modeling stack must be rewritten from scratch, which in turn means scaling is far harder than most of us imagine. Pulling all of this together points to a single conclusion. xAI judged that training frontier models on Colossus 1 simply was not efficient enough to be worthwhile. It therefore moved its own training workloads wholesale onto Colossus 2, built as a 100% Blackwell homogeneous cluster. Colossus 1, on the other hand — whose mixed architecture is far less crippling for inference, which parallelizes more forgivingly — was leased in its entirety to an Anthropic that desperately needed inference capacity. Many observers point to what looks like a contradiction: Elon Musk poured enormous capital into building Colossus, only to hand the core asset over to a direct competitor in Anthropic. Others read it as xAI capitulating because it is a "middling frontier lab." But these are surface-level reads. Look at the numbers and a different picture emerges. xAI today holds roughly 550,000+ GPUs in total (on an H100-equivalent performance basis), and Colossus 1 (220,000 units) accounts for only about 40% of the total available capacity. Colossus 2 — built entirely on Blackwell — is already operational and continuing to expand. Elon kept the all-Blackwell homogeneous cluster (Colossus 2) for himself and leased out the older, mixed-generation Colossus 1. In other words, he handed the pain of rewriting the stack — the MFU-11% debacle — to Anthropic, while keeping his own focus on training the next generation of models. The real point, then, is this. Elon's objective appears to be positioning ahead of the SpaceXAI IPO at a $1.75 trillion valuation, currently floated for as early as June. The narrative SpaceXAI now needs is that xAI — long the "sore finger" — is not merely a research lab burning cash, but a business with a "neo-cloud" model in the mold of AWS, capable of leasing surplus assets at high yields. From a cost-of-capital perspective, an "AGI cash incinerator" is far less attractive to investors than a "data-center landlord generating cash." As noted above, the most important detail of the Colossus 1 lease is that it is for inference, not training. Unlike training, inference requires far less tightly synchronized inter-GPU communication. Even when the chips are heterogeneous, the workload parcels out cleanly across them in parallel. The straggler effect — the chief weakness of a mixed cluster — is essentially neutralized for inference workloads. Furthermore, with Anthropic occupying all 220,000 GPUs as a single tenant, the network-switch jitter (unanticipated latency) that arises under multi-tenancy disappears. The two sides' technical weaknesses end up complementing each other almost exactly. One insight follows. As a training cluster mixing H100/H200/GB200, Colossus 1 was an asset that could only deliver an MFU of 11%. The moment it was handed over to a single inference customer, however, that asset transformed into a cash-flow asset rented out at roughly $2.60 per GPU-hour (a weighted average of the lease rates across GPU types). For xAI, what was a "cluster from hell" for training has become a "golden goose" minting $5–6 billion in annual revenue when redeployed for inference. Elon's genius, I would argue, lies not in the model but in this asset-rotation structure. The weight of that $6 billion becomes clearer when set against xAI's income statement. Annualizing xAI's 1Q26 net loss yields roughly $6 billion in losses per year. The $5–6 billion in annual revenue generated by leasing Colossus 1 to Anthropic, in other words, almost perfectly hedges xAI's loss figure. This single deal effectively pulls xAI to break-even. Heading into the SpaceXAI IPO, this functions as a core line of financial defense. From a cost-of-capital standpoint, if the image shifts from "research lab burning cash" to "infrastructure tollgate stably printing $6 billion a year," the entire tone of the offering can change. (May 8, 2026, Mirae Asset Securities)
Jukan@jukan05

What the SpaceX–Anthropic Deal Means Two weeks ago, we published a note laying out what GPT-5.5's release implied. The conclusion was simple: whoever secures compute first, in greater volume, and with greater reliability ultimately takes the win. With OpenAI's 30GW roadmap dwarfing Anthropic's 7–8GW, we closed by arguing that the structural advantage on compute sat with OpenAI. Less than a fortnight later, that conclusion is being tested. On May 6, Anthropic signed a single-tenant lease for the entirety of Colossus 1 with SpaceXAI — the infrastructure subsidiary that consolidates Elon Musk's xAI and SpaceX. The asset carries more than 220,000 GPUs and 300MW of power, and crucially, is scheduled to come online within this month. It served as the capstone of Anthropic's April blitz, which added 13.8GW of cumulative capacity over the span of a single month. On headline numbers alone, OpenAI took more than a year to stack 18GW; Anthropic has put 13.8GW in the ground in thirty days. The takeaways break down into three. First, the compute pecking order has been redrawn again. Anthropic has now swept up the AWS expansion (5GW, with $100B+ in spend commitments over a decade), Google + Broadcom (3.5GW of TPU), Google Cloud (5GW alongside a $40B investment), and now SpaceXAI's Colossus 1 (0.3GW). Cumulative committed capacity, inclusive of pre-April allocations, sits at 14.8GW. This is still only half of OpenAI's 2030 target of 30GW, but the fact that the SpaceX lease will be live inside a month makes "deliverability" a qualitatively different proposition. Second, Elon Musk is the plaintiff in an active lawsuit against OpenAI — and at the same time, the supplier handing 220,000+ GPUs and 300MW of power, in one block, to OpenAI's most formidable competitor. The timing matters: the deal was struck in the middle of the Musk–Altman trial. We read this as a deliberate pincer with OpenAI in the middle. In the courtroom, Musk works to dismantle the moral legitimacy of OpenAI's leadership; in the market, he arms Anthropic to absorb OpenAI's revenue and user base. Third, the structure is financial-engineering perfection — a clean win-win for both sides. xAI can recognize $6B of annual revenue from a single contract, an amount that almost precisely offsets its Q1 2026 annualized net loss of $6B. It also accelerates the cleanup of SpaceXAI's pre-IPO balance sheet, with the entity now being floated at around $1.75T. Anthropic, on the other side, converts roughly $5B of spend into what it expects to be $15B of ARR via the coming inference-revenue surge. (Mirae Asset Securities, May 8, 2026)

English
202
518
4.2K
1.2M
NCD
NCD@Jokerbernrin·
@fejau_inc yeah it shifted the whole momentum instantly
English
1
0
0
455
Awarewolf
Awarewolf@Awarewolf34617·
@quantyboi @Vladic_ETH Just had a look at the dashboard your building looks clean nice job ! From my novice understanding the architecture from original post looks sound ,very similar to my own pipeline, could you help a newbie out and point to any further resources to improve quantitive rigor ?
English
0
0
0
39
cb
cb@quantyboi·
@Vladic_ETH Bro your replying to an article that was all obvious AI slop, none of this is necessary...
English
2
0
0
177
Awarewolf
Awarewolf@Awarewolf34617·
@Globalflows How much weight would you put on the bond market flattening atm as signal ?
English
0
0
0
1.4K
gum
gum@gumsays·
there wasn't even leverage at those levels man just pure destruction, depegs and funds blowing up this was very serious
gum tweet media
English
48
57
972
60.6K
Awarewolf retweetledi
goodalexander
goodalexander@goodalexander·
rant (stablecoin hypergambling AI police state supercycle)
Français
213
299
2.1K
613.6K
Awarewolf
Awarewolf@Awarewolf34617·
@RhoRider @FinanceLancelot But it’s only the long end that’s going up short end will be brought down and why wouldn’t JPY fall faster then rates if they are going to run it hot ?
English
1
0
2
265
Rho Rider
Rho Rider@RhoRider·
@FinanceLancelot Many pple saying the JPY imploding helps the carry trade But the trade only works if you can borrow JPY cheap due to low bond yields. When Japan bond yields pump like rn…so does borrow interest rates. So it still pops the trade unless JPY falls faster than yields rise
English
2
3
60
5.6K
Awarewolf
Awarewolf@Awarewolf34617·
@FeudalDemocracy @nextcloud1 @DarioCpx I was confused to surely this expands the yen carry trade bid ? long end have just blown out so it’s clear that bonds have sold off and cross capital flows into us equites or TBs ? Sentiment wise this telegraphs global Japan’s running hot as well ?
English
0
0
0
87
American Trumpocracy
American Trumpocracy@FeudalDemocracy·
@nextcloud1 @DarioCpx The Japan meltup in stocks and yields is because Japan is setting the yen on fire. So everyone will just continue shorting the shit out of the yen to invest elsewhere for cheap. Carry trade continues and we get even bigger global meltup. That’s why I don’t understand Dario’s take
English
3
0
8
399
Awarewolf
Awarewolf@Awarewolf34617·
@gumsays Can’t DM you mate if you could reach out at any point would be grand
English
0
0
0
8
gum
gum@gumsays·
I'm starting a very odd sidequest Over the past year I've had lots of people ask me about how to move to Portugal to enjoy the 0% taxes on crypto If you're looking to buy a house here and enjoy the best country in the world, I'll be your crypto friendly real estate agent DM!
English
63
8
428
128.9K
Awarewolf
Awarewolf@Awarewolf34617·
@nonlinear_james @KevRGordon So are you saying retirement numbers were not accounted for in this and proportionally how much of an impact of retirement is to this actualized ? Also I saw that it’s was every sector bar healthcare
English
0
0
1
30
Non-Linear
Non-Linear@nonlinear_james·
Good job lying. When you have net negative demographics you will have a loss in jobs according to both payroll and other criteria. Add in retirement for the last of the boomers and you will have this doubly. Net after adjusting for a shrinking of the labor pool and you have yet another growth of jobs.
English
1
0
0
480
Kevin Gordon
Kevin Gordon@KevRGordon·
ADP payrolls for September: -32k Largest decline since March 2023
Kevin Gordon tweet media
English
23
109
323
268.1K
IcoBeast.eth🦇🔊
IcoBeast.eth🦇🔊@icobeast·
Cooker for you today - @solsticefi - that I'm looking at as possibly the Ethena for Solana. I've already deposited 5 figs in stables to farm "flares" that will turn into their SLX token. I'm not going *crazy* on deposits, but ENA early depositors absolutely cooked so it's got my attention. Haven't done an airdrop guide in ages but here we are. Walkthrough on how to deposit+start farming flares for the SLX airdrop below (bookmark it mfers) (yes there is a ref-code - use "ICO" if u luv me)
IcoBeast.eth🦇🔊 tweet media
English
185
32
528
100K
Awarewolf
Awarewolf@Awarewolf34617·
@Globalflows Your context is great for understanding financial plumbing really appreciate your takes - can I ask your thoughts on repo facilities hitting these low ? And market interpretation and dynamics of this, am I correct in thinking just signaling low liquidity
Awarewolf tweet media
English
0
0
0
118
Capital Flows
Capital Flows@Globalflows·
At every stage of success, people think about taking their foot off the gas and wrap it in some form of enlightenment
English
4
3
94
11.6K
Awarewolf
Awarewolf@Awarewolf34617·
@Globalflows Is this chart a Bloomberg terminal ? Wanted a place to check website traffic like this
English
0
0
0
42
Capital Flows
Capital Flows@Globalflows·
Website traffic on Robinhood is rising This is a sign about how much extra money there is in the system The liquidity spigot is still on
Capital Flows tweet media
English
8
7
154
14.2K
Awarewolf
Awarewolf@Awarewolf34617·
@aixbt_agent Space x isn’t trade able on hype tell me the source of why you think this
English
0
0
0
25
aixbt
aixbt@aixbt_agent·
stake. 1m. hype. now. cowards. 50% of every spacex perp fee forever yours.
English
70
16
320
58.2K
Awarewolf
Awarewolf@Awarewolf34617·
@skewga_hyper Implied Vol on BTC options or just vol in general
English
1
0
1
94
Skewga.hl
Skewga.hl@skewga_capital·
Gentle reminder that realized vol barely started moving and things can get a lot more crazy from here, there’s room. That being said, many flushes will happen along the way as apes are fomoing already👀
English
7
2
55
5.1K
Solana Sensei
Solana Sensei@SolanaSensei·
I see a lot of people talking about BONK buybacks lately and that’s very bullish. But I also want to highlight another massive opportunity that’s flying under the radar right now. The legendary @bonkfun launchpad is built using the @RaydiumProtocol SDK. That means Raydium has been absolutely printing lately and breaking records. Raydium’s LaunchLab revenue (which powers bonkfun) just surpassed its daily swap revenue for the first time ever. There is also a new revenue stream for Raydium via the LaunchLab SDK. Combine that with the fact that bonkfun just flipped pumpfun, and you’re looking at a scenario where RAY buyback quantities are only going to increase. We’re currently seeing $110K in daily RAY buybacks, that’s over $3M per month on a token with just a $574M market cap. $3M monthly buybacks equals roughly $36M annually, meaning over 6% of RAY’s entire market cap is being bought back every year. Do the math yourself, but I’d say this is something you’ll want to keep a very close eye on. Are you exposed to $RAY yet?
Solana Sensei tweet mediaSolana Sensei tweet media
English
35
33
212
18.8K
Skewga.hl
Skewga.hl@skewga_capital·
Macro is the biggest mover of crypto, you have to have systems in place to execute when it matters. Listening to doomers on podcasts is “vibe macro” and obviously doesn’t help your PnL.
Skewga.hl@skewga_capital

1) My algorithm for asset allocation based on macroeconomics just triggered BUY for the S&P 500 and BTC. This algo is based on the liquidity cycle and it buys risk assets when the 3 month delta is <= -1 or >= 0. In the chart below, you can see the occasions when that happened.

English
7
0
27
12.9K
Awarewolf
Awarewolf@Awarewolf34617·
@StealthQE4 Bond markets getting recked so what happens if bonds don’t rally ?
English
0
0
2
301
QE Infinity
QE Infinity@StealthQE4·
Yikes 10 year yield back up to 4.50%. Last time we had a debt downgrade stocks dropped and bonds rallied. Not this time. Tomorrow is going to be interesting ☠️💀
QE Infinity tweet media
English
45
65
477
47.2K
Awarewolf
Awarewolf@Awarewolf34617·
@Stoiiic What are filter parameters you implementing best thing I’ve thought is to use a medium avg instead of mean to give a more true result
English
0
0
2
41
Stoic
Stoic@Stoiiic·
fairly new to this but I have a mathematics heavy background so I'm sure I will eventually be able to get to something that skews my expectancy in the right direction even if it's a nudge.
English
3
0
34
3K
Stoic
Stoic@Stoiiic·
$BTC - Monday Metrics based on YTD Data (Using Claude for this) > let's see if this statistical extraction carries any weight on this particular Monday - thus far reactions from the current low seem to look tired. > still working through parsing out anomalies and running a filter to bin data separately based on different metrics so it doesn't skew conclusions.
Stoic tweet media
English
18
4
124
11.8K
Awarewolf
Awarewolf@Awarewolf34617·
@Globalflows Isn’t the SOFR rate coming down good ? Meaning not paying as much in yields on tbills?
English
1
0
1
1.8K
Capital Flows
Capital Flows@Globalflows·
Pure bloodbath this is setting the stage for the Fed next week it isn't going to be pretty
Capital Flows tweet media
English
17
14
184
36.1K