Krishna
15.3K posts

Krishna
@ntkris
Co-founder @with_ako: building the AI teammate that just gets it and completes tasks. Ex-@deliveroo.
London Katılım Mart 2009
1K Takip Edilen3.8K Takipçiler

The other day the AI Agent for a leading AI startup told us we had to pay 4x more than we had been paying to keep using it
It tuns out we didn't have to -- but the AI Agent hadn't been properly trained on its own product
You have to, have to, have to train all your own AI Agents too
Or out comes garbage
English

POV: You didn't give up.
For most of AgentMail's life, growth was flat. No one understood it but we kept pushing.. until one person did. The next 10 followed. Then, the rest.
Keep going, make that bet on your vision for the future.
Make something agents (should) want.

Garry Tan@garrytan
Sometimes startups feel like: Make people want something that they should want.
English

@ivanburazin At this point, I can’t think of why you would hamstring an agent by giving it just a code execution box. Maybe if the scope was super limited or something.
English

Even 6 months ago, the market was generally skeptical of sandboxes as a primitive.
For most workloads, you could just use code exec boxes as they're cheap and fast.
But they were designed for code that behaved predictably with expected inputs/outputs, and were stateless by default.
When autonomous agents arrived, they needed environments to write, preview, and run code + full computer access to browse and execute tasks.
Each task required a persistent, isolated environment that could spin up in milliseconds and run for hours.
Now multiply that by thousands of parallel agents serving millions of daily active users.
With compute becoming infinitely forkable, sandboxes are now the substrate for the agentic economy.
English

@thorstenball I don’t think all of it goes away. Some of these markdown files are general and others are specific to a company (like an SOP). The latter stays
English

@amrishrau Slightly different perspective. PMs can now become executors, meaning actually get the work done themselves. More that technical, I think becoming great at reviewing AI’s output and systems (like evals) to improve the AI system will be extremely valuable.
English

@TaraViswanathan I think this intentional as they are going after businesses?
English

This won’t be OpenClaw until they give it a personality.
I don’t feel a connection to “Cowork”.
But I am obsessed with Kevin, my OpenClaw minion, who runs around doing all kinds of things for me while saying hilarious things in group chats with friends and making me laugh pretty much every day.
Felix Rieseberg@felixrieseberg
We're shipping a new feature in Claude Cowork as a research preview that I'm excited about: Dispatch! One persistent conversation with Claude that runs on your computer. Message it from your phone. Come back to finished work. To try it out, download Claude Desktop, then pair your phone.
English

@petergostev Exhausting. Reviewing AI output to improve quality is absolutely required but also exhausting
English

@pestctrlguy Depends on whether you’ve finished building your thing! I think the bigger cost is always going to be maintaining the internally built CRM
English

@boringmarketer Would love to have you use our product @with_ako for your service if you’re open to it. I’m using it internally for our own GTM so I can share why it works well
English

i'm going to take on one pilot customer for a new managed ai "go to market" service
yes AI, agents, and automation is deeply involved, and myself/team are the humans in the loop who will turn the dials and interface with your team
ideal fit is a b2b company (software or service) that is doing at least $2m/year, you want to leverage ai for marketing and you know you could be moving faster at a higher quality
hit me with a dm if this sounds interesting and I can share more details
English
Krishna retweetledi

@thdxr We did some preliminary code mode evals in a JS sandbox when we wrote our SQL vs. Bash eval, and JS was about 2.5x more token efficient than bash to achieve the same result.
English

You're all sleeping on Haiku.
Haiku 4.5 is a new beast - it isn't the 'glorified autocomplete' we've come to think of it as - it's a capable, strong agentic model that's crazy fast and dirt cheap.
Here's Haiku doing deep research with exa, generating reports, making infographics, integrating those infographics into the report, all around five cents, in under two minutes.
Flash is a strong contender but it's more expensive, slower, cannot do interleaved toolcalls - but it does have the 1m context.
Once you have a good harness - like this internal chat agent we use, or hankweave - the model matters less than it did before. You only need Opus when you're surrounded by twenty confusing skills that turn on at random times, with confusing compressed context.
Use haiku more - it'll highlight some fixable problems. Most hanks (AI Programs) on hankweave move towards sonnet from opus and then to haiku as they mature - which means faster and cheaper executions overall, and more breathing room for experimentation.
English

@antinertia I have a feeling AI changes these numbers dramatically. Channels get flooded, CAC becomes very high etc
English

you all see companies going from $0 → $100m in months…
but here’s the real math you should actually care about
for plg saas / prosumer:
- ltv:cac = 3:1 to 5:1
lower = not efficient
higher = underinvesting
- cac payback < 6–9 months
how fast you get your money back after acquiring a user
- nrr > 100%
how much your existing customers grow your revenue (after churn)
these are a few of the key metrics you need to track as a founder, and what your investors will ask for
others include:
- expansion revenue drivers
what makes users pay more over time (credits top-ups, etc.)
- one-and-done rate
% of users who use once and never come back (this is very high in ai right now…)
this is different for sales-led products (e.g. enterprise)
just wanted to write this because twitter feels like an echo chamber, and no one talks about the real math behind those tech companies
English

@ivanburazin 100%. Think there’s a big diff between standards / processes (slack to communicate) and CRUD interfaces. Embedding into existing processes helps with adoption
English

Agents won't vibe-code a new Slack every time they need to communicate.
They'll use the same Slack because the other agent's team also uses Slack.
Traditional SaaS will survive because standards don't get disrupted by probabilistic code generation.
Network effects and standardization still matter.
English

@jessethanley Good way to put it. Do you feel this is particularly acute given the use case Bento serves?
English

I try to avoid being a doomer, but I do feel immense pressure to have my product conform to the new way of working in a way I haven’t felt before.
Organic traffic is no longer a distribution strategy, customers don’t want to login anymore, you need to have a CLI/MCP or customers will go elsewhere, our AI features need to be updated weekly and one-shot the most random requests…
Your company is at risk if you don’t move fast enough.
I’m embracing it but I understand why people are anxious.
It’s never ending.
English






