Herald

562 posts

Herald

@Herald_Dev

Live in minutes, Herald investigates novel incidents without runbooks, runs securely from a CLI and is completely free. Formerly RunLLM. Try now: https://t.co/KY0CFypldv

San Mateo, CA Katılım Nisan 2022

14 Takip Edilen809 Takipçiler

Herald@Herald_Dev·9h

Herald has been named to the The InfraRed 100, recognizing the top private companies defining the future of cloud infrastructure. Thank you to the @Redpoint team for the recognition. Want to try Herald? We now offer Herald CLI — completely free, full-featured, and up-and-running securely from your terminal in minutes. Join the waitlist: lnkd.in/ge37zaaU See the full list of 2026 InfraRed 100 honorees and accompanying industry report: redpoint.com/infrared/repor…

English

Herald@Herald_Dev·9h

RunLLM is now Herald. There's a story behind it, and it starts with a moment every engineer knows. It's 3am, an alert fired, and you're looking at something you've never seen before, a novel incident. Your runbooks don't cover it. Your tools tell you something is wrong, but none can say why or how, or what to do about it. For decades, that's been the deal. Something breaks, you fix it fast. Companies are pretty good at it. But it comes with real costs — alert fatigue, engineer burnout, and the moments when customers tell you something is down before you know it yourself. So we asked a different question: what if you didn't have to wait for something to break? What if your systems could tell you what's about to go wrong, before alerts fire, before customers notice? To herald something is to signal that it's about to happen. And that's the shift we're bringing to observability and reliability: from t₊₁ to t₋₁, where t is the moment something breaks. To deliver on this promise, we're offering Herald CLI — a full-featured, completely free agent that runs securely on your laptop and gets up and running in minutes. Try it on your own stack to see how Herald moves you from always being behind problems to getting ahead of them. 👉 Sign up for Herald CLI early access here: tinyurl.com/2fyp4wnj 👉 Read more about the Herald brand from our CEO @vsreekanti: tinyurl.com/mvk7mnsh

English

Herald retweetledi

Vikram Sreekanti@vsreekanti·12h

Lots of exciting news to share today! 1. @RunLLM is now @Herald_Dev. The new name reflects the fact that our AI SRE is the only product on the market that operates autonomously — teaching itself about your product & infra, detecting early warning signs of incidents, and investigating without runbooks. Read more: herald.dev/blog/heralding… 2. Herald was named to the InfraRed 100, an annual list recognizing the most promising private companies defining the future of cloud infrastructure. Thanks to Redpoint for the recognition! 3. We're releasing the beta of the Herald CLI — an agent that runs securely on your laptop and gets up and running in minutes. Sign up for early access here: herald.dev/cli

Redpoint@Redpoint

The Redpoint InfraRed 100 is now live. These are the companies building the infrastructure that powers everything happening in AI right now, from world models and agent runtimes to the sandboxes, databases, and security tools agents depend on. Congratulations to this year's honorees! Read the full 2026 InfraRed Report: our state of the union on AI and cloud infrastructure 👉 redpoint.com/reports/the-in…

English

17.2K

Herald retweetledi

Chenggang Wu@cgwu0530·20 May

Why do on-call engineers often ignore expensive AI tools during incidents? I wrote about what's broken and what it takes to fix it: tinyurl.com/y9e3k93n

English

200

Herald retweetledi

Vikram Sreekanti@vsreekanti·14 May

Finding product-market fit has always been the holy grail for every startup. In AI, it might not be the "we've made it" moment it once was. The traditional advice once you find PMF is to operationalize. Codify the ICP. Build the playbooks. Deepen the product. The point is consistency — $N in, $M out. In AI, consistency is a liability. Customer preferences are being rebuilt every week. The demo they saw last night is the new benchmark. If that signal takes three weeks to travel from a sales call back to a roadmap decision, you're already behind. The companies that win aren't going to be the ones that find PMF first. They're going to be the ones that keep replacing their own product while the market is still figuring itself out: open.substack.com/pub/frontierai…

English

389

Herald retweetledi

Vikram Sreekanti@vsreekanti·8 May

If customers had been willing to write us $250K checks on day one, we would have built the wrong product. With RunLLM, we set out to build the same AI SRE agent everyone else was building: an RCA agent triggered by alerts, driven by customer-maintained runbooks. It was the obvious answer. Humans use runbooks, so the agent should too. Except alert thresholds are noisy. Nobody actually maintains their runbooks. And the agent inherits every gap. We didn't figure that out because we were smarter than anyone else. We figured it out because the market gave us time. Enterprise SRE buyers don't move fast. They have committees. They want weeks to evaluate. They ask hard questions about what happens when something breaks at 3am. That slowness is put us on the right track. In a fast market, the competitive pressure forces you to ship the obvious solution and iterate from there. You don't get time to ask whether you're solving the right problem — you just have to start solving something. In a slow market, you're forced to keep asking. And for hard problems, the obvious solution is rarely the right one. The interesting question in AI SRE isn't "how do we automate the runbook." It's "how do we detect early warning signs, validate them, and find root cause before any threshold alert fires?" We didn't get to that question by moving fast. We got to it because the market wouldn't let us. I see a lot of founders right now benchmarking themselves against Cursor's growth curve and feeling like something is wrong. For most infrastructure problems worth solving, that curve was never going to apply. And the slowness you're frustrated by is probably the thing that's going to make your product impossible to copy in three years. Friction is information. Don't optimize it away too early: open.substack.com/pub/frontierai…

English

539

Herald retweetledi

Vikram Sreekanti@vsreekanti·30 Nis

We spent $63 on a single investigation last month. That number stuck with me, because it's the cleanest illustration I've seen of where AI economics are actually heading. Per-token costs are plateauing. But per-request token consumption is going up — fast. Every time we add another LLM call to pre-read data, rerank results, or evaluate relevance, the bill goes up. And we keep adding them, because that's how you actually get good answers. The honest truth: we have a dozen more places we'd love to throw an LLM at the problem. We're held back by cost, latency, and evals — not by ideas. Most teams are reaching for fine-tuning or RL to fix this. I'd push back. The hard part of post-training isn't the algorithm. It's having the right data in the right shape, and most teams don't. The boring lever almost no one pulls hard enough: matching model size to task difficulty. Gating questions, filtering documents, synthesizing logs — none of these need a frontier model. A smaller model handles them fine, at a fraction of the cost. We default to GPT-4.1 Mini for a lot of these, and it's been one of the highest-leverage decisions we've made. There's no clean rule for when to use what. It's still more art than science. But if you're not actively making that call, you're paying for it. Wrote more about how we think about managing token demand here: open.substack.com/pub/frontierai…

English

442

Herald retweetledi

Vikram Sreekanti@vsreekanti·23 Nis

Agents can't choose between structure and flexibility. We learned this the hard way. In the early days of RunLLM, we built the way most AI SRE vendors still build: have customers write runbooks, encode them as workflows, let the agent execute them in response to alerts. It worked in demos. It fell apart in production. The moment an alert looked different from anything we'd seen before, the agent was useless. The moment a customer's architecture changed, the runbook was stale. We were shipping a glorified lookup table and calling it an agent. The instinct is to flip the other way. Let the model figure it out. Give it good context, a capable loop, and get out of the way. That works until you try to run it at scale. Context windows fill up and something has to decide what to keep. Costs balloon and something has to route cheaper tasks to cheaper models. Multiple agents need to coordinate and something has to orchestrate them. Each of those is an engineering decision that can't be solved by asking the model nicely. The teams building serious agents have all landed in the same place, independently: structure where it has to be enforced, flexibility where reasoning matters, and a deliberate architecture deciding which is which. Picking a side is how you avoid doing that work. New post on the AI Frontier this week on why the Python vs. Markdown debate is the wrong debate: open.substack.com/pub/frontierai…

English

457

Herald@Herald_Dev·28 Nis

x.com/i/article/2049…

ZXX

Herald retweetledi

Vikram Sreekanti@vsreekanti·16 Nis

A VP of Construction Engineering. That's who our AI-powered SDR was emailing last week. We're a developer tools company. Construction engineering is not in our ICP. But the agent saw "VP of Construction Engineering" and decided it was close enough to "VP of Engineering." A human would catch that instantly. The agent couldn't, because it was built the way most agents are built today: take a human workflow, write down the steps, and hand each one to an LLM. That works when everything fits the expected pattern. It falls apart the moment anything requires judgment. I keep seeing the same mistake across the industry. Agents that try to generate a finished slide deck from a prompt. Agents that try to write and send entire email sequences autonomously. Agents that present themselves as replacements for the human rather than tools that make the human better. The best agent products I've used don't work that way. They keep the scope narrow, the feedback loops fast, and the human in the loop on the decisions that require taste. When the cost of generating work is zero, taste is what stands out. The agents that win are the ones designed to let humans apply it. We wrote about this — and what we think "agent-native" actually means — in the first post of our new series: frontierai.substack.com/p/build-agents…

English

546

Herald@Herald_Dev·14 Nis

x.com/i/article/2044…

ZXX

1.1K

Herald retweetledi

Vikram Sreekanti@vsreekanti·9 Nis

Ask Claude to build you a financial model in Excel. You'll get back reasonable structure, plausible assumptions, formulas that link together correctly. Now you have to check it. Do you open every cell and inspect every formula? If you do that, you might as well have built it yourself. If you don't, you're trusting a junior employee who works at superhuman speed but might have encoded some very strange assumptions that didn't stand out at first glance. Validating agent-generated work is the problem nobody is talking about. Agents have made creation cheap. They haven't made it any easier to know whether what was created is actually right. The bottleneck used to be writing the code, building the model, drafting the document. Now it's checking the output. And our tools — spreadsheets, code review, document editors — were all designed for a world where humans did the creating. None of them are built for the volume or the speed agents produce at. @profjoeyg and I wrote about this, and what we think validation actually has to look like going forward: open.substack.com/pub/frontierai…

English

13.8K

Herald@Herald_Dev·7 Nis

x.com/i/article/2041…

ZXX

531

Herald retweetledi

Vikram Sreekanti@vsreekanti·2 Nis

AI agents shouldn't have a job title. The entire AI industry is racing to build "AI SDRs," "AI SREs," and "AI SOC analysts." You can't walk through SF without seeing a billboard for one. We get why — customers search for these terms, and if your site doesn't speak their language, you lose the SEO battle before you make your pitch. But here's the problem: when you name your agent after a job title, you're promising it can do everything that person does. Including the stuff that never made it into the job description. The result is mismatched expectations, eroded trust, and products that underdeliver on their own marketing. Meanwhile, the agent category with the deepest adoption, the strongest data flywheels, and the most widespread quality? Coding agents. And none of them called themselves an "AI software engineer." That's not a coincidence. The full post explains why job title thinking constrains what an agent can actually do: open.substack.com/pub/frontierai…

English

484

Herald@Herald_Dev·2 Nis

x.com/i/article/2039…

ZXX

145

Herald retweetledi

Vikram Sreekanti@vsreekanti·26 Mar

We all know we live in the AI bubble, but the bubble is smaller than you might think. Even at some of the most innovative companies in the world, AI adoption is a big hurdle. Right now, you might be tempted to focus on the people who are excited to adopt — but that might not be a sustainable long-term strategy. Here's why you're going to have to break out of the bubble 👇

English

675

Herald retweetledi

Vikram Sreekanti@vsreekanti·19 Mar

The idea that a startup will build an agent to help understand all your enterprise data is really appealing — unfortunately, it's incredibly difficult to defend. Enterprises have data everywhere, and a single front door that helps you find and analyze what you need at the right time is the holy grail. With LLMs, many startups are promising this future. The reality, however, is that these products are indefensible. The frontier model labs are desperately competing for enterprise attention, and they have all the advantages. The full post breaks down why 👇

English

776

Herald retweetledi

Vikram Sreekanti@vsreekanti·12 Mar

Predicting the doom of SaaS companies is all the rage right now, but... are they actually going to die? Maybe! Some SaaS companies are very likely to be disrupted. Others have more defensibility. Where do you fall on the spectrum? @profjoeyg and I put together a SaaS extinction test 👇

English

125

Keşfet

@Redpoint @vsreekanti @RunLLM @profjoeyg @elonmusk @BarackObama @taylorswift13 @cristiano