Rishab Verma

723 posts

Rishab Verma

@rishabv90

Google AI Compute Infra. A fellow AI pilled engineer. & Of course, thoughts here, are my own 🫰

Seattle, WA Katılım Mayıs 2017

282 Takip Edilen121 Takipçiler

Sabitlenmiş Tweet

Rishab Verma@rishabv90·11 Mar

If you're using AI infra on GCP, specifically GPUs, please let me know how I can help or if you have any feedback 🙂🫰 docs.cloud.google.com/compute/docs/a…

English

564

Rishab Verma@rishabv90·1h

@QuinnyPig 🤣

QME

261

Corey Quinn@QuinnyPig·1h

Oh, honey…

English

152

13.9K

Rishab Verma@rishabv90·6h

@Gilbert_Belion my dream car with nightmare maintenance

English

Belion@Gilbert_Belion·11h

Reliability aside, this car just hits differently. The Range Rover SV is an experience Toyota can't replicate, and it knows it.

English

291

134

2.6K

162.7K

Rishab Verma@rishabv90·6h

@quxiaoyin +1 Xiaoyin. Although, edge AI is picking up pretty fast....

English

Xiaoyin Qu@quxiaoyin·14h

local is overrated. Cloud sandbox is the future of all ai agents. Long AWS. Google cloud and cloud providers.

English

7.1K

Rishab Verma@rishabv90·15h

@Existencial33 @lfg_cap @michaeljburry At price (if you're comfortable sharing) ? Can use DMs too

English

Husker@Existencial33·15h

@rishabv90 @lfg_cap @michaeljburry Just closed 20 H200 nodes

English

Hemingway Capital@lfg_cap·1d

H100s are worth more today per hour than 18 months ago. Yet per token prices have collapsed >90%. GPU economics are improving at every level. Don’t let @michaeljburry break your brain.

English

338

46.2K

Rishab Verma@rishabv90·15h

@racheleizner Somebody has to update SJC airport

English

2.3K

rachel 🪷@racheleizner·1d

at least Elizabeth Holmes was doing fraud with something fun. soc2 is boring af

English

103

2.9K

58.8K

Rishab Verma@rishabv90·15h

@lfg_cap @Existencial33 @michaeljburry Good thread folks...I'm still a little skeptical tbh, would love to see if you secured any deals recently

English

Hemingway Capital@lfg_cap·17h

@Existencial33 @rishabv90 @michaeljburry Thanks

English

Rishab Verma@rishabv90·15h

@The_AI_Investor 🤣

QME

The AI Investor@The_AI_Investor·23h

Suhail reports five GPU providers out of stock for 8xH100 nodes, underscoring a surge in AI inference demand that outpaces 2022's training-focused shortage. Suhail didn't know who to ask, give Michael Burry a call, he know a lot of useless H100 sitting idle.

Suhail@Suhail

I am now at 5 GPU providers being completely sold out for a single node of 8xH100s. I don’t think people understand the gravity of what is about to come.

English

16.7K

Rishab Verma@rishabv90·15h

@bubbleboi 💯

QME

275

bubble boi@bubbleboi·18h

H100s will probably be worth more than B200s once you see this magic trick

Hemingway Capital@lfg_cap

H100s are worth more today per hour than 18 months ago. Yet per token prices have collapsed >90%. GPU economics are improving at every level. Don’t let @michaeljburry break your brain.

English

110

17.9K

Rishab Verma@rishabv90·15h

@stylishdawg Yep 😊

433

ein@stylishdawg·16h

@rishabv90 😭😭😭😭😭😭

QME

1.6K

ein@stylishdawg·1d

not impressed. google has been doing this with their product managers for years.

DramaAlert@DramaAlert

This coffee shop only hires down syndrome employees. ❤️

English

2.3K

235.6K

Rishab Verma@rishabv90·18h

@Existencial33 @lfg_cap @michaeljburry x.com/i/status/20297…

rw ./@gradientintern

H100 rental prices just reached a new high YTD, been steadily rising since late 2025 after Claude Opus 4.5 release. H100s is almost 4 years old now 👀 Could this be a possibility of compute hoarding due to over resource insecurities as cost bloom in HBM and other parts of supply chain? Rental price increases looks correlated with memory prices but as always, correlation may not be causation.

QME

440

Husker@Existencial33·18h

@rishabv90 @lfg_cap @michaeljburry Share some numbers then.

English

315

Rishab Verma@rishabv90·18h

@lfg_cap @michaeljburry As a H100 product owner, I can confirm

English

1.6K

Rishab Verma@rishabv90·18h

@Existencial33 @lfg_cap @michaeljburry Nope, incorrect

English

319

Husker@Existencial33·18h

@lfg_cap @michaeljburry No they’re not. 18 months ago H100s were at 2.5$/h Now they’re at 1.7.

English

2.5K

Rishab Verma@rishabv90·18h

@shivanijpatel 💯 yes!! I'll design compute infra for free

English

Shivani Patel@shivanijpatel·21h

data center on alcatraz - who's building this?

English

828

Rishab Verma@rishabv90·19h

@BrianRoemmele @grok What's your plan on using this ? Any specific use case?

English

3.2K

Brian Roemmele@BrianRoemmele·21h

My project for the last few hours: Convert the Nvidia RTX 5090 to 128 GB of unified memory and a 28% speed increase! This board beats the L40s with a $10,000 savings! I got the process down with Mr. @Grok supervising. He said “Garage AI builders beat corporate bloat”.

English

909

72.2K

Rishab Verma@rishabv90·23h

@Yuchenj_UW Yes, wonderfully handled

English

Yuchen Jin@Yuchenj_UW·1d

This is how you handle a PR crisis: Own the mistake. Acknowledge your partners. Commit to fixing it. That’s how you earn trust. Glad to see the Cursor cofounders got it right. Also love the trend: more companies building on top of open-source models. GPUs go brrr.

English

336

29.4K

Rishab Verma@rishabv90·23h

@thegenioo After speaking with a couple of mixtral engineers for 45 mins at GTC, I can confirm this comparison is not accurate 🙏 But sure...

English

Hamza@thegenioo·1d

@rishabv90 yup it is

English

142

Hamza@thegenioo·1d

Mistral is xAI of Open source labs

Artificial Analysis@ArtificialAnlys

Mistral has released Mistral Small 4, an open weights model with hybrid reasoning and image input, scoring 27 on the Artificial Analysis Intelligence Index @MistralAI's Small 4 is a 119B mixture-of-experts model with 6.5B active parameters per token, supporting both reasoning and non-reasoning modes. In reasoning mode, Mistral Small 4 scores 27 on the Artificial Analysis Intelligence Index, a 12-point improvement from Small 3.2 (15) and now among the most intelligent models Mistral has released, surpassing Mistral Large 3 (23) and matching the proprietary Magistral Medium 1.2 (27). However, it lags open weights peers with similar total parameter counts such as gpt-oss-120B (high, 33), NVIDIA Nemotron 3 Super 120B A12B (Reasoning, 36), and Qwen3.5 122B A10B (Reasoning, 42). Key takeaways: ➤ Reasoning and non-reasoning modes in a single model: Mistral Small 4 supports configurable hybrid reasoning with reasoning and non-reasoning modes, rather than the separate reasoning variants Mistral has released previously with their Magistral models. In reasoning mode, the model scores 27 on the Artificial Analysis Intelligence Index. In non-reasoning mode, the model scores 19, a 4-point improvement from its predecessor Mistral Small 3.2 (15) ➤ More token efficient than peers of similar size: At ~52M output tokens, Mistral Small 4 (Reasoning) uses fewer tokens to run the Artificial Analysis Intelligence Index compared to reasoning models such as gpt-oss-120B (high, ~78M), NVIDIA Nemotron 3 Super 120B A12B (Reasoning, ~110M), and Qwen3.5 122B A10B (Reasoning, ~91M). In non-reasoning mode, the model uses ~4M output tokens ➤ Native support for image input: Mistral Small 4 is a multimodal model, accepting image input as well as text. On our multimodal evaluation, MMMU-Pro, Mistral Small 4 (Reasoning) scores 57%, ahead of Mistral Large 3 (56%) but behind Qwen3.5 122B A10B (Reasoning, 75%). Neither gpt-oss-120B nor NVIDIA Nemotron 3 Super 120B A12B support image input. All models support text output only ➤ Improvement in real-world agentic tasks: Mistral Small 4 scores an Elo of 871 on GDPval-AA, our evaluation based on OpenAI's GDPval dataset that tests models on real-world tasks across 44 occupations and 9 major industries, with models producing deliverables such as documents, spreadsheets, and diagrams in an agentic loop. This is more than double the Elo of Small 3.2 (339) and close to Mistral Large 3 (880), but behind gpt-oss-120B (high, 962), NVIDIA Nemotron 3 Super 120B A12B (Reasoning, 1021), and Qwen3.5 122B A10B (Reasoning, 1130) ➤ Lower hallucination rate than peer models of similar size: Mistral Small 4 scores -30 on AA-Omniscience, our evaluation of knowledge reliability and hallucination, where scores range from -100 to 100 (higher is better) and a negative score indicates more incorrect than correct answers. Mistral Small 4 scores ahead of gpt-oss-120B (high, -50), Qwen3.5 122B A10B (Reasoning, -40), and NVIDIA Nemotron 3 Super 120B A12B (Reasoning, -42) Key model details: ➤ Context window: 256K tokens (up from 128K on Small 3.2) ➤ Pricing: $0.15/$0.6 per 1M input/output tokens ➤ Availability: Mistral first-party API only. At native FP8 precision, Mistral Small 4's 119B parameters require ~119GB to self-host the weights (more than the 80GB of HBM3 memory on a single NVIDIA H100) ➤ Modality: Image and text input with text output only ➤ Licensing: Apache 2.0 license

English

151

13.4K

Rishab Verma@rishabv90·1d

@superham What an awesome achievement 👏👏

English

Rishab Verma@rishabv90·1d

@leerob @redtachyon You rock, and love these types of comms

English

1.1K

Lee Robinson@leerob·1d

@redtachyon Hi. I'm an engineer at Cursor and wrote tweet myself, thank you

English

367

23K

Ariel@redtachyon·1d

Huh, Cursor has the same PR consultants as OpenAI

Lee Robinson@leerob

Yep, Composer 2 started from an open-source base! We will do full pretraining in the future. Only ~1/4 of the compute spent on the final model came from the base, the rest is from our training. This is why evals are very different. And yes, we are following the license through our inference partner terms.

English

101

40K

Rishab Verma@rishabv90·1d

@amanrsanger Congratulations 👏🎉 super impressive and got to meet your super awesome team at GTC !

English

664

Aman Sanger@amanrsanger·1d

It's a really good model! Excited for more people to try it

Lee Robinson@leerob

English

212

31.3K

Keşfet

@QuinnyPig @Gilbert_Belion @quxiaoyin @Existencial33 @lfg_cap @michaeljburry @racheleizner @The_AI_Investor