Ravikant Dewangan

3.2K posts

Ravikant Dewangan banner
Ravikant Dewangan

Ravikant Dewangan

@ronitkd

Founder @Autosterea_ | Full-stack SaaS dev (5k+ rev apps) | @Boeing Integration Lead 15+ yrs | MS S&C | @CrossFit L3 Trainer | @Persistence_Ath Owner.

Seattle, WA Katılım Ekim 2009
466 Takip Edilen211 Takipçiler
Sabitlenmiş Tweet
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
Last weekend I hosted Greg Glassman at my gym for a 2-day @MetFixByBSI seminar. What I learned broke my brain. I went down a rabbit hole I couldn't escape and when I finally came up for air, I felt like I'd been unplugged from the Matrix. Now I'm building a course to teach what I learned. 🧵👇
English
1
1
5
683
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@tunguz Old hardware running new models is the most underrated path. Pulled a 2018 box off the shelf last month, dropped qwen on it, now it handles overnight batch jobs while my main rig codes. Cheapest agent infra I have built.
English
0
0
0
8
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@mckbrando Cross-modal handoff is the unlock. Built a coaching ops agent last month that reads athlete PDFs, queries Postgres history, jumps to terminal to push programs, then back to browser to schedule. Pure click loops fall apart on anything real.
English
0
0
0
4
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@tszzl Token use tracks the ceiling of what you tried. I burn 10M a week now and ship 4x what I shipped a year ago. The metric is undercounted because most builders still write code like it is 2024.
English
0
0
1
24
roon
roon@tszzl·
token use gets too much hate as a metric - in times of technological transition peoples default will be to underuse and underestimate the new tech. “steam power used” would have been a good KPI for pre industrial civilization just as kardashev scaling remains for ours
English
26
9
130
4.8K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
The hollow position is the most skipped gymnastics fundamental in CrossFit. A 30-second active hold separates athletes who can kip safely from those who break. We screen for it at Persistence Athletics before greenlighting kipping work. Most pull-up shoulder issues trace back to weak hollow control.
English
0
0
0
16
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
Watching builders chase context length numbers. 1M, 2M, 10M tokens. The real bottleneck isn't context window size, it's retrieval relevance. 200K of useful context beats 2M of noise every time. Most evals never measure this.
English
0
0
0
17
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@comradCornelius @AdamKellyFA @vandy_62 S&C catches heat for injuries the position coach and training staff actually own. Pitching volume, mechanics, sleep, off-season programming, split across 4 to 5 people with no integrated plan. CrossFit drills aren't the problem. Lack of a single accountable owner is.
English
0
0
0
15
Comrade Cornelius
Comrade Cornelius@comradCornelius·
@AdamKellyFA @vandy_62 Why did so many pitchers get injured? Is it the S&C coaches job to mitigate that? Does Corbin have a say in who that coach is? Does that coach basically run them through CrossFit drills all year?
English
1
0
0
16
Vandy Sixty Two (62)
Tim Corbin has no one to blame but himself for missing the NCAA Tournament. It all came down to pitching: 12th in the SEC in ERA with a 5.23 2nd in doubles given up 2nd in triples given up Missed on portal additions Relied too much on freshman pitchers
English
7
0
23
4.5K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@bindureddy Long-running complex tasks is where eval discipline shows. Most builders never set baselines so they can't tell if a new model is better or just newer. I run the same agent loop with 3 different models per change. The wins are usually obvious only in retrospect.
English
0
0
0
131
Bindu Reddy
Bindu Reddy@bindureddy·
GPT 5.5 xHigh is exceptionally good at very long running complex tasks You can create extremely complex apps with a single prompt It's strangely underrated given how good the model is....
English
30
8
188
8.9K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@PeterDiamandis Running 3 to 5 agents in parallel for coding is my practical limit. Beyond that, context coordination overhead kills the gains. 1000 concurrent agents needs a managerial layer most builders don't have wired up yet.
English
0
0
0
42
Peter H. Diamandis, MD
Peter H. Diamandis, MD@PeterDiamandis·
Power users will soon want 1,000 concurrent agents. Engineers, architects, designers... all orchestrating swarms. Compare that demand to the current supply. It's peanuts.
English
83
31
478
21.8K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@npew Native settings UI is the unsexy boss fight for computer use. macOS hides toggles three submenus deep, names change every release. Two clicks vs seven is real IT helpdesk savings. The browser is the easy demo. The OS is where the value lives.
English
0
0
1
89
Peter Welinder
Peter Welinder@npew·
Using Codex Computer Use to navigate the Settings app is incredible. It knows MacOS so much better than me. Similar experience to it using the command line. Superhuman.
English
10
3
83
5.7K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@serenaa_ge DeepSWE is useful but still a snapshot. The signal that matters is what happens at run 87 of an unattended loop: read a stack trace, edit the file, rerun the test, six tools chained without supervision. Harness quality is the moat, not the score.
English
0
0
0
275
Serena Ge (Datacurve)
Serena Ge (Datacurve)@serenaa_ge·
Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
Serena Ge (Datacurve) tweet media
English
337
487
4.1K
962.2K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@kimmonismus Ramp's enterprise data: Anthropic 34.4%, OpenAI 32.3%. Gemini does not register in serious paid-seat tracking. Default app swaps inflate the topline. Distribution is real, but it is one OS setting away from being someone else's.
English
0
0
0
35
Chubby♨️
Chubby♨️@kimmonismus·
I'm not sure if Google is winning the AI race. However, I think they're winning the AI distribution race, which is a different thing. 900M Gemini users is impressive on a slide. But a huge chunk of that is Android users who got a default app swap and Search users who got AI Overviews without opting in. But that doesnt mean its a bad thing. 9.7 trillion tokens/month two years ago. 480 trillion last year. 3.2 quadrillion now. That's a 7x jump in twelve months. To keep that going, Google plans to spend $190 billion on infrastructure this year. OpenAI has been trying to reach the 1b user milestone for some time now. For Google, on the other hand, it's a simpler game. Why? With billions of Android devices, and combined with Google and its AI mode, they have the ability to introduce everyone to AI, specifically Gemini, for free. How do they do it? TPUs! Google not only laid the foundation for modern LLMs with their 2017 paper "Attention is all you need," but also made a far-sighted decision back in 2012 to invest in TPUs - their own in-house chips that are particularly well-suited for machine learning tasks. Now in its eighth iteration, they even have two chips: one particularly good for inference, and one particularly good for training. This makes them more independent. Furthermore, they have a solid foundation that generates strong revenue and good profits, allowing them to subsidize AI usage for free, and without ads, unlike OpenAI (this is not a judgment, just a statement of fact). TPUs Therefore, Google has a very good chance of winning the game thanks to this outstanding starting position and free distribution. But to be fair: the game *is* far *from over*. However, the starting position is outstanding for Google. Image: The Economist article
Chubby♨️ tweet media
English
68
28
520
35.6K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
Two-stage VLM content scoring. Cheap model gates which frames the expensive one looks at. 12 cents per clip vs 90 cents running GPT-4o on every frame. Same accuracy on the cohort. Eval harness took longer to wire than the model. Keeps happening.
English
0
0
0
39
Harry Stebbings
Harry Stebbings@HarryStebbings·
I just interviewed a CEO who said three things that blew my mind: 1. We replaced our $600K Salesforce contract with a vibe-coded CRM, built within 3 weeks. 2. We will get rid of 80% of the SaaS we use internally. 3. If Anthropic doubled pricing, we would not change usage in any way.
English
337
63
927
383.7K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@emollick Builders are running their own n=1 experiments. The data lives in private commit graphs and Stripe dashboards. I ship about 3x what I did in early 2025 on EDSO and MetFix. Most of the signal never enters formal research channels.
English
0
0
0
63
Ethan Mollick
Ethan Mollick@emollick·
We have, as far as I can tell, no good tests of the productivity impact of the autonomous coding tools that appeared starting in December 2025. Every paper out there is from prior to the Claude Code/Codex revolution. A huge gap in our knowledge about what is happening in coding.
English
78
47
697
41.6K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@karrisaarinen Solo builders are absorbing it. Shipped EDSO and MetFix in months it would have taken me a year. Big orgs still carry the same approval, review, and meeting overhead, so the gains get eaten by the system around the code.
English
0
0
0
22
Karri Saarinen
Karri Saarinen@karrisaarinen·
We keep hearing about 10x or 100x productivity gains in engineering and knowledge work. But outside the model labs, I haven’t seen the corresponding 10-100x revenue growth across the market or increase in quality. So where is the productivity going?
English
423
105
2.6K
315.6K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@lennysan @danshipper Point 1 is real. Last six months I've been shipping EDSO and MetFix from inside Claude Code, not my IDE. The agent holding context across sessions is what changed the loop for me.
English
0
0
0
44
Lenny Rachitsky
Lenny Rachitsky@lennysan·
My biggest takeaways from @danshipper: 1. The future of work will happen inside Codex or Claude Code. Instead of putting AI into your SaaS tool, you’ll use your SaaS tools inside your favorite AI agents' in-app browser. Dan spends all his time in Codex now—writing documents, managing email, doing research, everything. He's using Google Docs, PostHog, and everything he needs within the agent's in-app browser. The agent can see what he’s doing, and has all of his context, so he and his agent collaborate quickly and super effectively. 2. Automation is a lie—every automation needs a human. Dan's company doubled in size this year despite being incredibly AI-forward. Why? Because in order to make automation work well, you need humans making sure everything keeps working. This is why benchmarks are misleading—they measure AI on problems we’ve already framed and can score, but there’s always a higher frame. 3. PMs will win the AI era. Marcus, a former PM who previously ran Axios’s writing product, joined Every after getting super AI-pilled. Now he runs their product Spiral, and ships faster than anyone on the team. He pairs technical knowledge with spiky product sense, deep user empathy, and an eye for what matters. Dan thinks any PM who gets really AI-native will be incredibly dangerous because the building is done for you—what matters is figuring out what to build and if it’s great. 4. Full-stack designers are becoming superheroes. Designers used to make beautiful interactions that engineers didn’t want to build or couldn’t execute properly. Now designers don’t need to hand things off; they can build it themselves. Designers are naturally creative people, and AI is the perfect tool for them because it lets them bring their vision to life without the traditional bottlenecks. 5. SaaS is not dead. In fact, Dan is bullish on SaaS stocks. When users bring their own AI (via Codex or Claude Code) to use SaaS products, the user—not the SaaS company—pays for tokens. This saves SaaS company’s margins. Since the agents need their own seats, Dan predicts that agents will create massive new demand for SaaS because there will be tons of agents using these products at high volume. 6. Every company will have one “super-agent” inside their Slack that every employee will use. Dan initially thought every employee would have their personal work agent, like a shadow AI org chart, but he’s completely flipped his view. He realized agents need humans who care about them. When someone gets tired of maintaining their personal agent, it becomes useless. The winning model is one forward-deployed engineer or AI-savvy person who maintains a company-wide agent (like Shopify’s River or Viktor), and then it trickles down to more specialized team agents as models improve and become less fiddly. 7. The AI job apocalypse is not happening, but you do need to evolve to stay relevant. Models make yesterday’s human competence cheap. But because everyone uses the same models, it all looks the same if you use it the default way; it becomes commoditized slop. Humans then take that frozen competence and use it to make something new and interesting for their specific situation. The key: “ride the models”—use them for everything you do, try new models when they drop, keep turning over rocks. 8. We will read way more AI-generated writing, and we will like it. Human writing is incredibly important for things that matter, but for internal docs, planning, and email, AI-generated is often better because most people are bad at writing strategy documents. 9. Build software for humans and agents to use together. The current model is building a CLI that an agent uses independently. Instead, you and your agent should be using the app together. This creates new design challenges—agents can make a billion requests in three seconds, so you need approval flows, inboxes that summarize what happened, logs, and easy rollback. 10. Forward-deployed engineers are the new most essential role. The big model companies have teams of people managing their internal agents, and those teams aren’t going away. It’s different from traditional software building, and certain engineers love it. As models get better, this role will evolve—you’ll be managing more agents doing more things.
Lenny Rachitsky@lennysan

Automation is a lie. CLIs are over. The SaaSpocalypse is dumb. A year ago @danshipper came on the podcast to predict where AI was heading. He was remarkably right—including the call that everyone was sleeping on Claude Code. Dan has a unique lens into where things are going because his team at @every is possibly the most AI-pilled group of people in tech. I always learn a ton talking to Dan. So I brought him back for round two. We'll score these in exactly a year: 🔸 Every company will have one “super-agent” in Slack. 🔸 Codex and Claude Code will become the new operating system for knowledge work. 🔸 The AI job apocalypse is not happening. 🔸 PMs and designers will thrive. 🔸 We will read way more AI-generated writing and we will like it. 🔸 "I would buy SaaS stocks right now." Listen now 👇 youtube.com/watch?v=4D3hDm…

English
144
234
2K
728.4K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
Memorial Day Murph at Persistence today. 4 of 12 athletes went unbroken on pull-ups, and all four run our 3x weekly farmer carries. Carries are the most underprogrammed accessory in most CrossFit gyms.
English
0
0
0
28
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@yunta_tsai This is exactly how I run agent work now. Last week Claude Code spawned three sub-agents in parallel to audit token counts in my Cowork plugin. Cut the review from 25 minutes to 6. The orchestrator pattern is the bottleneck killer.
English
0
0
0
237
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@CrossFit Forward rolls into wall walks is the trap. Heart rate spikes from the rolls, then you need shoulder stability inverted. Going to test it Tuesday on six athletes at Persistence and see how the order holds up.
English
0
0
0
74
CrossFit
CrossFit@CrossFit·
Workout of the Day Tuesday 260526 Adrian 7 rounds for time of: 3 forward rolls 5 wall walks 7 toes-to-bars 9 box jumps ♀ 24-inch box ♂ 30-inch box Compare to 120923. Post time to comments. 📍CrossFit Drummond in Quebec, Canada #CrossFit #WorkoutoftheDay
CrossFit tweet media
English
4
9
47
7.3K
Ravikant Dewangan
Ravikant Dewangan@ronitkd·
@scion_x_ The agent loop test is what matters. Claude Code rewrites a failing pytest, re-runs, and ships in one turn for me. Curious how Grok Build handles that same read-error-fix-rerun cycle.
English
0
0
1
165
Laurent
Laurent@scion_x_·
Just spent a full coding session with Grok Build and honestly? It's right there with Claude. The model is sharp, the agentic flow holds up on complex tasks, and it has actual personality. Few rough edges on the UX side (sent detailed feedback to the team), but the core is genuinely impressive. @xai cooked 🔥
Laurent tweet media
English
268
1K
2.9K
837.1K