
Vítor Balocco
3.6K posts

Vítor Balocco
@vitorbal
Builder and Applied AI Engineer. Cofounder of @runlayer. Prev. AI lead for @Zapier Agents, @Stedi, ESLint. Carioca 🇧🇷 living in Madrid 🇪🇸


We get stuck every month or two on complex problems, usually around complex concurrency problems across multiple services. After solving it manually I always stash the git sha (before/after) in a running list. We now have some very useful eval sets for when new models come out. Most of them still unsolvable without hindsight steering from many many context windows worth of investigation and reproduction


Thank god MCP is dead Just as useless of an idea as LLMs.txt was It's all dumb abstractions that AI doesn't need because AI's are as smart as humans so they can just use what was already there which is APIs

Thank god MCP is dead Just as useless of an idea as LLMs.txt was It's all dumb abstractions that AI doesn't need because AI's are as smart as humans so they can just use what was already there which is APIs

the overindexing on CLIs is kind of insane to me it's building a primitive that's not portable, properly discoverable, has no good approval flow DCR / CIMD to APIs would go so much further but CLI is just the current hype thing

Thank god MCP is dead Just as useless of an idea as LLMs.txt was It's all dumb abstractions that AI doesn't need because AI's are as smart as humans so they can just use what was already there which is APIs

Code Review optimizes for depth and may be more expensive than other solutions, like our open source GitHub Action. Reviews generally average $15–25, billed on token usage, and they scale based on PR complexity.


The core of this system is MCP elicitation When a destructive action like `await tools.vercel.dns.removeRecord` is called, it triggers an elicitation from the client to approve it More harness should bring support for this one, is an incredibly useful primitive


this is the Final Boss of Agentic Engineering: killing the Code Review at this point multiple people are already weighing how to remove the human code review bottleneck from agents becoming fully productive. @ankitxg was brave enough to map out how he sees SDLC being turned on its head. i'm not personally there yet, but I tend to be 3-6 months behind these people and yeah its definitely coming.

Happy Friday, rebels. The Entire CLI now has experimental support available for the @cursor_ai IDE and CLI. Plus, faster Checkpoints, public repos on Entire, and more in this week’s Dispatch. 🤖 entire.io/blog/entire-di…











