Venkat Raman — inference/acc

2.6K posts

Venkat Raman — inference/acc

@venkat_systems

distributed systems, low latency, inference | 🦀 | hobbies: ⛷️ 🏊🏽‍♂️ 📷

µs, ns, 80% speed-of-light Katılım Ocak 2013

1.9K Takip Edilen546 Takipçiler

Venkat Raman — inference/acc@venkat_systems·9h

@cognition shouldn’t kevin be managing devins?

GIF

English

300

Cognition@cognition·10h

Devin can now manage a team of Devins. Devin will break down large tasks and delegate them to parallel Devins that each run in their own VM. Over time, Devin gets better at breaking down and managing tasks for your codebase. Available now for all users.

English

350

63.7K

Venkat Raman — inference/acc@venkat_systems·9h

@jasonlk @camillenvargas i like no bs honest vcs ! i would too !

GIF

English

Jason ✨👾SaaStr.Ai✨ Lemkin@jasonlk·10h

@venkat_systems @camillenvargas If I had a billion dollar position? Absolutely

English

camille@camillenvargas·23h

I don’t understand how they’re not embarrassed

Tarek Mansour@mansourtarek_

This is how Kalshi Q1 board meeting ended. cc @Alfred_Lin @matthuang @aleximm @luanalopeslara @_charlienoyes @abhishekm1636

English

16.5K

Venkat Raman — inference/acc@venkat_systems·10h

@jasonlk @camillenvargas would you?

English

Jason ✨👾SaaStr.Ai✨ Lemkin@jasonlk·11h

@camillenvargas If you have a billion dollar position in a startup you can take some embarrassment

English

1.9K

Venkat Raman — inference/acc@venkat_systems·10h

@himanshustwts @SkyLi0n are they eligible for claude-code oss credits?

English

967

himanshu@himanshustwts·13h

last commit btw LMFAO

OpenAI Newsroom@OpenAINewsroom

We've reached an agreement to acquire Astral. After we close, OpenAI plans for @astral_sh to join our Codex team, with a continued focus on building great tools and advancing the shared mission of making developers more productive. openai.com/index/openai-t…

English

142

5.8K

465.6K

Venkat Raman — inference/acc@venkat_systems·12h

@kochyhere @jorandirkgreef @TigerBeetleDB whoa ! don’t tell me TigerBeetle is also puffn w/ object storage now ! @Sirupsen should share cool puffer swag tips for beetle 😉

English

Koustav Chowdhury@kochyhere·14h

.@TigerBeetleDB's contribution to my growth as an engineer is understated. what a gem of a video - youtube.com/watch?v=y2_Bqk…

YouTube

English

1.8K

Venkat Raman — inference/acc@venkat_systems·12h

@jarredsumner how hard did u guys try to get them ? 😉

English

541

Jarred Sumner@jarredsumner·12h

Congrats!!

Charlie Marsh@charliermarsh

We've entered into an agreement to join OpenAI as part of the Codex team. I'm incredibly proud of the work we've done so far, incredibly grateful to everyone that's supported us, and incredibly excited to keep building tools that make programming feel different.

English

427

45.8K

Venkat Raman — inference/acc@venkat_systems·12h

i never understood the hype behind gemini 3 n code-red at openai bcos of it gemini does coherent image gen well, that’s all coding using their agent, n reliable web search w/o outrageous hallucination is an absolute shit show. youtube transaction used to be good but that is also restricted total 🤡 show

English

593

kache@yacineMTB·14h

it is laughable how bad gemini is

English

497

50.3K

Venkat Raman — inference/acc@venkat_systems·12h

@reach_vb @jxnlco openai & codex comeback must be studied team is on 🔥

English

Vaibhav (VB) Srivastav@reach_vb·14h

LETS GOOOO! So so psyched for this!! OpenAI really securing the future of Open Source! 🔥

OpenAI Newsroom@OpenAINewsroom

English

172

10.4K

Venkat Raman — inference/acc@venkat_systems·13h

@OpenAINewsroom @youyuxi @astral_sh Congrats @charliermarsh & @astral_sh team ! was expecting this 🚀🚀

English

249

OpenAI Newsroom@OpenAINewsroom·14h

English

441

768

6.7K

3.3M

Venkat Raman — inference/acc@venkat_systems·14h

@arpit_bhayani Congrats Arpit ! I was hoping you’ll get into early stage startups again. Pls check DM :)

English

Arpit Bhayani@arpit_bhayani·1d

Joined Razorpay as Principal Engineer II :) From being a long-time customer to now building parts of the system - it's a full circle. Fintech is a new territory for me - time to get under the hood of how money actually moves. New domain, same guarantees - availability, correctness, performance - just with real money on the line.

English

547

569.2K

Venkat Raman — inference/acc@venkat_systems·16h

oh no ! @Replit @Lovable n @emergentlabs are so cooked now

Scott Stevenson@scottastevenson

Google disrupting Figma is unexpected

English

109

Venkat Raman — inference/acc@venkat_systems·18h

@sundeep - is groq static scheduling going to be part of oss dynamo ? - is there going to be groq-cuda libs n sdk would love to make some oss contributions i’ve been grinding thread-per-core, share nothing, message passing libs for extreme low latency & high throughput systems

English

sunny madra@sundeep·21h

Today you need access to an AI factory: 7 chips 5 distinct purpose built rack scale systems.

English

3.1K

Venkat Raman — inference/acc@venkat_systems·18h

@dylan522p is it okay to say out loud - i felt secondhand pride for you n the team 🚀

English

576

Dylan Patel@dylan522p·22h

Jensen name-dropped me in the keynote and posed with our belt. He has a physical belt too but they just showed the pic Intially I made fun of the 35X perf improvement being bogus, I thought it was an exaggeration of performance Turns out he was sandbagging, and perf is 50x

English

1.8K

162.4K

Venkat Raman — inference/acc@venkat_systems·1d

@garrytan workday is just a glorified database n excalidraw charts

English

Garry Tan@garrytan·1d

Recent earnings call, Aneel Bhusri of Workday says startups with AI agents are "parasites" This is what system of record incumbents really think of startups. The war is just beginning. The facts: the user data belongs to the users, not the incumbent software vendor.

English

356

156.8K

Venkat Raman — inference/acc@venkat_systems·1d

@levelsio if they never sold, there might not be a tsmc as we know today

English

470

@levelsio@levelsio·1d

The biggest fumble in business ever might be Philips spinning off ASML, TSMC and NXP Philips co-founded ASML in 1984, then co-founded TSMC in 1987, then they founded NXP They sold each of them for short term profits in the 2000s ASML is now worth $545B TSMC is worth $1.76T NXP is worth $50B Philips today is worth just $27B If they'd never sold, Philips would be the largest company in the EU today, worth $650B Philips CEO Cor Boonstra called it "making money with the success of the past" 🤡

English

216

415

5.4K

504.3K

Venkat Raman — inference/acc retweetledi

Peter Steinberger 🦞@steipete·1d

@dantelex No, it's again crypto folks spamming and hurting the project.

English

186

11.7K

Venkat Raman — inference/acc@venkat_systems·1d

@levelsio u mean like foundations n NGOs.. most of the money is spent on operations, expenses n ppls salaries than the actual beneficiaries ?

English

261

@levelsio@levelsio·1d

/r/mildlyinteresting In Portugal you pay up to €7.50 when you buy a laptop called a "copyright levy" You pay €4/TB of storage in the computer, so for a MacBook Neo 13" with 512GB that's €2.05 It's regulation made in 1998 to compensate artists for you illegally sharing MP3 files which nowadays of course doesn't make sense anymore since we have Spotify and YouTube Much of the money doesn't even arrive with artists btw, 30% is taken by the organization collecting the tax and lot of it remains unclaimed and some of that goes again to the organization collecting the tax as "operational costs" 🤡

English

333

208

8.1K

901K

Venkat Raman — inference/acc@venkat_systems·1d

@HotAisle @ssskryl @thegeomaster @insane_analyst i’ve too tried and failed to find TCO of Groq, Cerebras n SambaNova very difficult to calculate tokens/watt & sustained goodput per tco for a given model serving config

English

Hot Aisle@HotAisle·2d

@ssskryl @thegeomaster @insane_analyst You're trying to polish two turds. Once no longer exists in its current form and the other is a dead man walking. This will be the fate of many ASIC's over the years.

English

182

Hot Aisle@HotAisle·2d

This guy is a master manipulator. 20 wafers is $3m*20… or more… at what concurrency? For $60m… I can buy a cluster of approximately 768 mi355x and have 288GB*768 amount of memory and also use less DC space and power and serve who knows how many users… too lazy to do the math. He is clearly afraid and disappointed at not getting the $20b deal from N.

Andrew Feldman@andrewdfeldman

NVIDIA's biggest GTC announcement was a $20 billion bet on the same problem we solved 6 years ago. Their next-gen inference chip - not available yet - has 140x less memory bandwidth than @cerebras. To run a single 2 trillion parameter model, you need 2,000+ Groq chips. On Cerebras, that's just over 20 wafers. Even paired with GPUs, Groq maxes out at ~1,000 tokens per second. We run at thousands of tokens per second today. And every day. In production now. Why? When you connect 2,000 chips together, every interconnect has latency. Every cable has overhead. It doesn't matter what your memory bandwidth is on paper if you're bottlenecked by the wiring between thousands of tiny chips. We solved this with wafer scale. One integrated system. Little interconnect tax. Jensen told the world that fast inference is where the value is. He’s right - it’s why the world’s leading AI companies and hyperscalers are choosing Cerebras.

English

13.5K

Venkat Raman — inference/acc@venkat_systems·2d

@msharmavikram @marksaroufim @GPU_MODE @NVIDIAGTC the legend in the flesh, Stephen Jones ! Vikram u should get him on twitter :) i binged all his gtc sessions in 2024, truly life changing

English

130