Abhi Venigalla

941 posts

Abhi Venigalla banner
Abhi Venigalla

Abhi Venigalla

@ml_hardware

Researcher @Databricks. Former @MosaicML, @CerebrasSystems. Addicted to all things compute.

San Francisco, CA Katılım Ekim 2018
1.5K Takip Edilen8.1K Takipçiler
Abhi Venigalla retweetledi
Tanishq Kumar
Tanishq Kumar@tanishqkumar07·
I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.
English
135
456
4.1K
608.6K
Abhi Venigalla retweetledi
Davis Blalock
Davis Blalock@davisblalock·
🚀 Today we’re releasing FlashOptim: better implementations of Adam, SGD, etc, that compute the same updates but save tons of memory. You can use it right now via `pip install flashoptim`. 🚀 arxiv.org/abs/2602.23349 A bunch of cool ideas make this possible: [1/n]
Davis Blalock tweet media
English
30
228
1.6K
216.6K
Abhi Venigalla retweetledi
ken
ken@aquariusacquah·
2 weeks ago, we rebuilt our entire product. "Browser automation" fell short of our mission to eliminate all repetitive knowledge work. The new Kaizen is the ultimate digital employee: always on, extremely capable, continually learning. Sign up for access in the tweet below.
English
17
11
68
7.1K
Abhi Venigalla retweetledi
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
InferenceX v2: NVIDIA Blackwell Vs AMD vs Hopper - Formerly InferenceMAX, GB300 NVL72, MI355X, B200, H100, Disaggregated Serving, Wide Expert Parallelism, Large Mixture of Experts, SGLang, vLLM, TRTLLM semianalysis.substack.com/p/inferencex-v…
English
18
44
243
230.1K
Abhi Venigalla retweetledi
Cerebras
Cerebras@cerebras·
OpenAI Codex-Spark powered by Cerebras You can now just build things faster—at 1,000 tokens/s.
English
60
141
2K
286.7K
Abhi Venigalla retweetledi
Rohan Kodialam
Rohan Kodialam@KodialamRo·
The world’s most powerful data agent releases today. Sphinx 1.0 is here to power elite data teams.
English
70
177
1.9K
1.7M
tae kim
tae kim@firstadopter·
After ten years, can someone give me an estimate of TPU 2025 revenue for external customers?​​​​​​​​​​​​​​​​ Bueller?
English
12
2
80
13.4K
Abhi Venigalla retweetledi
Cody Blakeney
Cody Blakeney@code_star·
I've got something new for everyone. My first substack article! Not the one I planned to do first, but a fun one! I have made a handy calculator base on the DeepSeek v1 coefficients for finding optimal LR and batch sizes for dense LLMs.
Cody Blakeney tweet media
English
16
17
169
40.8K
Abhi Venigalla retweetledi
Abhi Venigalla retweetledi
Sphinx AI
Sphinx AI@getsphinx·
🚀 Thrilled to announce our $9.5M funding round led by @buckymoore at @lightspeedvp, alongside an incredible group of investors from the Valley and New York. ✨ With this announcement, we’re also moving Sphinx Copilot -- the state-of-the-art AI agent for data science -- out of closed beta. It’s now available at sphinx.ai (with a generous free tier!). Our early partners have gone from raw data ➝ commercial insights in minutes instead of days. We can’t wait to see what the data community builds with Sphinx. 🌱 This is just the beginning for Sphinx. We’re redefining how AI works with data, from copilots to fully autonomous researchers and analysts. We're excited to keep building best-in-class machine intelligence for a new generation of data-driven innovation. sphinx.ai/blog/sphinx-la…
English
7
12
58
32.6K
Abhi Venigalla retweetledi
typedfemale
typedfemale@typedfemale·
alexander wang with yann lecun
typedfemale tweet media
English
16
52
1.9K
129.2K
Abhi Venigalla retweetledi
Sasha Doubov
Sasha Doubov@sashadoubov·
memory-bound gf, compute-bound bf
English
3
6
113
11.7K
Abhi Venigalla retweetledi
typedfemale
typedfemale@typedfemale·
presenting: big jeff's trainium hell
English
114
566
4.7K
670.5K
Abhi Venigalla retweetledi
Cerebras
Cerebras@cerebras·
Cerebras just beat NVIDIA Blackwell Last week: Blackwell hit 1,000 t/s on Llama 4. Today: Cerebras hit 2,500 t/s on the same model, same benchmarks by @ArtificialAnlys Blackwell smoked Groq, AMD, Google – everyone. Only Cerebras stands – and we smoked Blackwell.
Cerebras tweet media
English
35
56
486
143.5K
Mihir Patel
Mihir Patel@mvpatel2000·
A great way to tell if an org has good ML eng is by backing out their MFU and checking if it's actually good when they brag about their training stack. Super useful to know 1) all the numbers (memorize hardware stats!) and 2) how to drive the math
Horace He@cHHillee

The fundamental question here (computing MFU) is a very reasonable question to ask in an interview (and if I'd recommend learning it if you don't know how). However, the real interview question I would like to ask is this: "I see 3 assumptions in this question that range from somewhat misleading to kinda unusual to flat out wrong. What are they?"

English
1
5
75
9.8K