Eric Barroca

7.6K posts

Eric Barroca

Eric Barroca

@ebarroca

founder at @vertesiahq, platform for AI-driven knowledge work.

Tokyo-to, Japan Katılım Ağustos 2008
508 Takip Edilen1.1K Takipçiler
Eric Barroca
Eric Barroca@ebarroca·
Context engineering is not prompt assembly. It's the durable infrastructure that prepares, scopes, ranks, governs, and audits what the model sees per turn. The window is the output. The layer is the work
English
0
0
0
3
Eric Barroca
Eric Barroca@ebarroca·
@mitchellh Very good point :) the issue with having to agent loop is that it influence itself as well. Curious: with the right architecture framing from the start or mid loop would have done it right?
English
0
0
0
46
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
I've got an agent in a loop optimizing a renderer with the goal to minimize frame times (and tests to measure). It got times down from 88ms to 2ms and allocations down from ~150K to 500. Sounds good, right? Wrong. This is exactly why agent psychosis is a big fucking problem. As an experiment, I rewrote the Ghostty core render state in Go, with access to identically laid out data structures as Ghostty and the exact same validation tests. I made a purposely naive renderer (simple, correct, but slow). 88ms per frame with 150,000 allocations (horrendous, lol)! I then kickstarted a Ralph loop to bring the frame times down. I told it it can't modify input data structures or the public API or tests (they're correct), but it can do anything else it wants. It got to work. It has worked for about 4 hours. I've spent around $350 on this experiment so far. The results? 88ms => 1.5ms 150K allocs => ~500 allocs Incredible right? Nope. My hand-written renderer I ported has frame times (same benchmark) of ~20us (0.020ms) and 0 allocations in the update path. This is the problem with psychosis and lacking systems understanding. If you don't understand the system, you're going to accept that this is an incredible result. If you understand the system, you'll see better solutions immediately and can do roughly 75x better on throughput. The people who blindly trust agent output are in the former camp. They're sheeple, overdrinking from a fountain of mediocrity. Standard disclaimer: I use AI all the time. I like AI. The point I'm making is to not blindly accept results. Think. Analyze. Learn.
English
291
923
8.4K
713.5K
Eric Barroca
Eric Barroca@ebarroca·
Yes, and the shape of that data matters. Mainframes had records. Cloud had databases. Agents need documents and data, together, prepared and addressable. Otherwise the model is reasoning on bag-of-words, not on the truth
MongoDB@MongoDB

In every tech transformation, something changes. But one thing has stayed constant. Our President & CEO Chirantan @cj_mongodb joined @HarryStebbings of @20vcFund to discuss why data remains the constant across every major technology shift — from mainframes, to cloud, to AI. Watch the full conversation: mongodb.social/6017B8VcEb

English
0
0
0
9
Eric Barroca
Eric Barroca@ebarroca·
Vector store ≠ context layer. The vector store loses what makes documents documents: structure, version, identity, governance. RAG is a useful component inside the context layer. It is not the layer itself
English
0
0
0
3
Eric Barroca
Eric Barroca@ebarroca·
Most attempts at agent governance reach for the leash: block this, restrict that, add a gate. A leash stops one action. A contract defines a whole space of allowed action — and refuses everything outside it. Process engines built for code did not need contracts. Process engines built for agents do. That is the architecture.
English
0
0
0
4
Eric Barroca
Eric Barroca@ebarroca·
@petarivanovv9 AI makes a lot of the other part a lot faster too: debugging, reviewing, testing (and making them), deciding, etc.
English
0
0
0
53
Petar Ivanov
Petar Ivanov@petarivanovv9·
Writing code was never the bottleneck. In a typical sprint, an engineer spends maybe 20% of their time on net-new code. The rest is reading, debugging, reviewing, deciding, communicating, and operating. AI makes the 20% piece ten times faster and barely touches the other 80%, except by adding more code to review, which makes that part slower.
English
53
34
396
36K
Eric Barroca
Eric Barroca@ebarroca·
Documents are often the truth. Systems of record are representations. When the two disagree – at audit, in a claim, in a contract dispute – the document is what gets asked for. An agent that only reads the system row is reading a derivative, not the truth
English
0
0
1
20
Eric Barroca
Eric Barroca@ebarroca·
Yes. Access scope evolves with capability is the right shape. The harder problem is doing it at runtime: which scope this agent gets for this task, on whose behalf, with what credentials, and audited as whom. Sandboxing limits the blast radius. The contract decides what's allowed in the first place
Anthropic@AnthropicAI

New on the Engineering Blog: The access and permissions we grant agents should evolve with their capabilities. In our own products, we set these parameters through sandboxing, which limits the scope of any potentially destructive actions. Read more: anthropic.com/engineering/ho…

English
0
0
0
17
Eric Barroca
Eric Barroca@ebarroca·
Finally close to touch my limit on codex 5.5 this week - and can’t see how to buy more capacity - so went back to Claude opus 5.7 as main model to see. Feels like such a downgrade. Can do things just as well but feels so slow and chatty. I want a machine not a friend
English
1
0
0
155
Eric Barroca
Eric Barroca@ebarroca·
Stop dropping PDFs into a chat window and calling it AI. Real enterprise content is not chat input. It is contracts, claims, policies, statements, case files, slides, tables — and each one needs intake, structured extraction, embeddings, lineage, permissions, versioning. The repository owns all of that. The chat window owns none of it. One is a tool, the other is infrastructure.
English
0
0
0
15
Eric Barroca
Eric Barroca@ebarroca·
Intelligence Is Contextual. Designing the enterprise context layer. The context window is not the context layer. The window is the output. The layer is the work — durable infrastructure around documents and data. ebarroca.substack.com/p/intelligence…
English
0
0
0
13
Eric Barroca
Eric Barroca@ebarroca·
Yes, and the artifact library matters as much as the UI. If an agent can generate a spreadsheet, report, diagram, or app, the system has to keep those artifacts addressable, versioned, and attached to the work that produced them
Tomasz Tunguz@ttunguz

Software's future: a harness to control AI-generated UIs + a context library of artifacts. The interface isn't going away — it's become malleable to whatever the user needs, when they need it. Read more: tomtunguz.com/plastic-user-i…

English
0
0
0
10
Eric Barroca
Eric Barroca@ebarroca·
The frontier keeps moving. The best reasoning model this quarter is not the best one next quarter. Same for visual, extraction, long-context. Each is a different leaderboard, each with a different winner. A serious agent platform should route by task, cost, latency, risk, and quality. Not by one vendor's model family.
English
0
0
0
6
Eric Barroca
Eric Barroca@ebarroca·
Agent. Tool. Memory. Planner. Executor. Router. Skill. Chain. Graph. Worker. Crew. Every concept gets a name. Every name gets an abstraction. At some point, you are not building a system anymore. You are assembling a vocabulary. The problem is not that the ideas are wrong. It is that they become framework objects instead of system responsibilities. So you get: hidden behavior implicit control flow hard-to-debug systems layers you have to read source code to understand The better framing is simpler: reasoning execution state tools boundaries The hard part is not naming things. The hard part is making the system predictable.
English
1
0
0
8