Chinmay Nivargi

819 posts

Chinmay Nivargi

@chinmayn

Building AI agents at Intuit. Previously led evals for Rufus at Amazon. ex AWS and Applied Materials. PhD @Stanford, @IITB .

San Francisco Bay Area Katılım Kasım 2008

1.7K Takip Edilen379 Takipçiler

Chinmay Nivargi@chinmayn·27 Nis

Find myself preferring Opus 4.6 over 4.7 for PM artifacts. 4.7 feels like a smarter ChatGPT o1 or o3 - very smart, but not really readable. Anyone else have the same experience?

English

Chinmay Nivargi@chinmayn·4 Nis

Of course I had to try to create the Mt. Rushmore of tech companies discussed by @benthompson and @gruber. Nano Banana and ChatGPT were quite good - and Grok and Meta AI came nowhere close. Same prompt for each. Guess which is which.

English

Chinmay Nivargi@chinmayn·13 Mar

@andrewchen A coworker has agency not only to break a goal down into steps to achieve them, but also to set them in the first place. Agents are getting good at memory and integration, but not yet on true agency. Agents working together, challenging each other are paths to RL our way there...

English

andrew chen@andrewchen·12 Mar

ai is shifting from “chatbot UX” to “coworker UX.” it’s not just about having smartest model, they’ll have the best handoff between human judgment and autonomous execution I think this is what we see in explosive hype around openclaw. What's magical about it is that it can be proactive, it can self-improve, it can link into your accounts so that it’s triggered. These are the things you need to be a coworker, chief of staff, colleague, etc rather than just something that is a smarter google search. What people underestimate is that the interface paradigm itself is changing: - Chatbots assume every task begins with a prompt - Coworkers don’t wait for prompts They watch the environment, notice patterns, and surface things before you ask. The best AI systems will feel less like “ask me anything” and more like “I’m already working on it.” that requires three things that chatbot systems historically lack: memory, agency, and integration. Memory so the system understands your projects and preferences over long running periods of time. agency so it can break goals into steps and execute them. integration so it can touch real systems, like email, docs, repos, finances, calendars, APIs. Once those exist together, the model stops being a tool and becomes a participant in the workflow. we are sooooo close to having all this, but not yet... the claws show a glimmer of the future. So the question is, which agentic systems will know what you’re trying to do? Which ones can take partial direction and move the ball forward? Which ones learn your style and anticipate the next step? Feels like we're almost there, and likely to figure this out in 2026. am very very pumped this is about to happen.

English

311

31.5K

Chinmay Nivargi@chinmayn·26 Şub

Now I know how @Spotify characterizes my music taste 😅

English

Chinmay Nivargi@chinmayn·10 Şub

@lennysan @amankhan and @talraviv have reinvented the technical tutorial here - and Cursor seems to be a perfect substrate for it. This is way more approachable in the age of coding agents, and improves upon Jupyter notebooks. Although, maybe we are trading off some comprehension for personalization and vibes!

Lenny Rachitsky@lennysan

Texts from friends this morning: - "I won't be surprised if this goes viral because it's the most approachable content I've seen yet for people to get started with AI." - "I finally have a place to point people to when they ask me 'How do I get started with AI?'" - "By far the most useful how-to I’ve seen yet for people to get started with Cursor." Most AI content is designed to induce FOMO and make you feel behind, not to actually teach you anything. Today's post is the opposite. @talraviv and @amankhan spent 100+ hours building an interactive experience that teaches you the most essential AI concepts—from inside @cursor_ai itself. I’ve never seen anything like this before and I’m excited to bring it to you. If you've ever nodded along when someone says "context engineering" or "RAG" in a meeting—while hoping no one asks you to use them in a sentence—this post is for you. Don't miss this one: lennysnewsletter.com/p/how-to-build…

English

Chinmay Nivargi@chinmayn·4 Şub

@matt_elevenlabs 11B

Matthieu ❙❙ ElevenLabs@matt_elevenlabs·4 Şub

We just raised $500M at an 11B valuation 🎉 To celebrate, we’re giving away 1,000 free credits so you can test our platform. For the next 6 hours, comment “11B” below and we’ll DM you the credits (must follow) 👇

ElevenLabs@ElevenLabs

We raised $500M at an $11B valuation to transform how people interact with technology.

English

16.3K

742

13.3K

1.9M

Chinmay Nivargi@chinmayn·4 Şub

A retcon that fixes everything! You’re welcome.

English

Chinmay Nivargi@chinmayn·4 Oca

@TrungTPhan @RampCapitalLLC Perhaps it’s possible to set up an AI workflow that asks Grok daily and populates this list

English

Trung Phan@TrungTPhan·4 Oca

@RampCapitalLLC last few years stopped by third week of jan, need to finish the spreadsheet this year!

English

3.5K

Trung Phan@TrungTPhan·3 Oca

started the X Main Character tracker for 2026

English

112

239

7.7K

203K

Chinmay Nivargi@chinmayn·20 Ara

@pitdesi Sad ending - it looked so good!

English

Sheel Mohnot@pitdesi·19 Ara

I really wanted to love this as a minivan enthusiast and cute vehicle lover, but nobody wants a van with less than 300 miles of range All of the bad EVs (most of them) are getting flushed out of the system with the end of the EV tax benefit

Sawyer Merritt@SawyerMerritt

NEWS: Volkswagen has announced that it is cancelling the all-electric ID. Buzz in the U.S. after just a year on the market. “Following a careful assessment of current EV market conditions, we have made the strategic decision not to move forward with model year 2026 ID. Buzz production for the U.S. market.” The ID. Buzz started at $60k with a 234 mile range and a 91 kWh battery.

English

274

32.6K

Chinmay Nivargi@chinmayn·18 Ara

@devangbaheti @patrickc @stripe I didn’t read it as loans rejected for business that accepted them. It is loans not offered at all, which may not have as much of an impact on affinity and volumes.

English

Patrick Collison@patrickc·18 Ara

I'm very excited that we now have results from an almost two-year randomized controlled trial that we ran across thousands of businesses on Stripe: those that accepted a loan from @Stripe Capital grew annual revenue around 27% faster. (Two years isn't that long in the scheme of things, but it is when you're waiting for the results of an experiment you're interested in.) We had two kinds of controls: businesses to whom we offered loans but didn't accept them, and a holdout group of businesses to whom we randomly did not offer loans but which were otherwise identical to those to which we did. As such, we feel confident in the causal nature of this conclusion. While this might not sound like news ("capital increases growth"), I think the finding is a good reminder that many businesses are still quite capital-starved (this effect is on top of all of the other sources of capital that businesses have access to), and it is consistent with what we hear directly from businesses in surveys. Beyond Stripe, inefficient capital allocation at economy-wide scale is likely a major bottleneck to growth around the world. We have an ambitious roadmap planned and we're very much looking forward to expanding worldwide access to growth capital.

English

112

1.4K

228.4K

Chinmay Nivargi@chinmayn·23 Eki

@carlvellotti @shobhitchugh CC

Chinmay Nivargi retweetledi

Carl Vellotti 🥞@carlvellotti·21 Eki

Every PM should be using Claude Code. So I built a HUGE course for you to learn Claude Code... IN Claude Code! 🔹 Complete guide 🔹 Make PRDs, analyze data, create decks Soon, I'll sell it for $149. For the next 24h: FREE! Follow + RT + comment "CC" & I'll DM it.

English

2.2K

1.5K

2.4K

460.9K

Chinmay Nivargi@chinmayn·8 Eyl

@lulumeservey For those wondering- benning.army.mil/infantry/199th…

English

Lulu Cheng Meservey@lulumeservey·8 Eyl

Everyone working at a startup should read A Message to Garcia

English

390

88.3K

Chinmay Nivargi@chinmayn·3 Eyl

@ValerieTetu A/B testing will still continue to answer the question - On average, did the dynamically adapted, personalized feature lead to better conversion / engagement / revenue? And by how much?

English

Valerie Tetu@ValerieTetu·2 Ağu

I question the future utility of A/B testing for products and websites. Why would we optimise for the "best average for all" in a world in which interfaces, content, and features can dynamically adapt to each individual user?

English

Chinmay Nivargi@chinmayn·3 Eyl

@ankrgyl IMO Evals ≠ Experiments. They serve different layers. Pre-launch evals = input metrics → ensure the AI feature itself is robust. Experimentation = output metrics → measure actual customer impact. Evals test the model in isolation; A/B tests test the whole system in the wild.

English

1.2K

Ankur Goyal@ankrgyl·3 Eyl

This is the exact opposite conclusion to draw from the acquisition. AI enables you to build much more dynamic products that evolve faster (and even automatically). The foundation underlying that is good evals. A/B tests are officially the way of the past.

ben hylak@benhylak

wow, openai just bought @statsig evals are dead. a/b tests are the future of building AI products.

English

174

63.4K

Chinmay Nivargi@chinmayn·20 Ağu

@thomyorke channeling his inner @bjork here open.spotify.com/track/4cug7RkM…

English

Chinmay Nivargi@chinmayn·6 Ağu

@sariazout For me the key is to actively inject a lot of my own thoughts in the prompts and followups vs. giving into the vibes. It still may be a distraction - but I find that LLMs help me get over the hump of creating the mental space and focus to sit with my own thoughts.

English

119

sari azout@sariazout·6 Ağu

I’ve been spending 4+ hrs a day talking to LLMs and I love it, it feels fun. But I’ve been having this feeling lately that it may just be another distraction and maybe if I just sat with my own thoughts longer, I’d get somewhere better AND faster. Not sure. Curious if others feel this too.

English

159

12.2K

Chinmay Nivargi@chinmayn·28 Tem

@jspujji 🚀

QME

Jesse Pujji@jspujji·28 Tem

The fastest way to waste a decade is to optimize for goals you never chose. Every week, I meet operators smashing revenue targets yet privately admitting they have no personal North Star. That mis-alignment bleeds into hiring, roadmap…even family time. My fix is a one-page Dream Future State prompt. 15 mins with it drags your ACTUAL ambitions into daylight. Comment “🚀” and I’ll DM it.

English

Keşfet

@benthompson @gruber @andrewchen @Spotify @lennysan @amankhan @talraviv @matt_elevenlabs