Carlos D. Escobar-Valbuena

1K posts

Carlos D. Escobar-Valbuena

@broomva_tech

Building agents & autonomous software infrastructure. Lead AI @GetStimulus | Databricks & scalable AI systems expert Life · Arcan · Agentic Control Kernel

เข้าร่วม Kasım 2009

258 กำลังติดตาม81 ผู้ติดตาม

ทวีตที่ปักหมุด

Carlos D. Escobar-Valbuena@broomva_tech·18 Mar

The harness is not incidental. It IS the control substrate. If you want an LLM to actually control systems, you need what classical control theory assumes implicitly but agent frameworks skip entirely: State — not chat history. A typed canonical representation: current observation, belief state, memory, latent variables / inferred regime, goal stack, pending actions, recent disturbances, confidence / uncertainty. Without this, there's no feedback. Just open-loop prompting. Action space — not "free-form do anything." Typed, constrained actions with schemas and affordances: actuators, APIs, skills, prompts to sub-agents, code patches, browser actions, controller parameter updates. Constraints are what make control possible. Transition feedback — every action must return structured consequences: success / failure, resulting state delta, uncertainty, latency, cost, side effects. No structured feedback = no closed loop. Cost / reward — an explicit objective function or utility model: task completion, safety margin, cost, latency, resource use, human preference, reversibility. Without this, the agent optimizes for nothing. Verifier / observer — an independent mechanism to evaluate the resulting trajectory. The controller cannot be its own judge. You need a separate evaluation loop. Safety filter — like a barrier function layer, but for agent actions: forbidden tool calls, resource limits, policy bounds, rollback conditions, invariant checks, human-approval gates when necessary. This is the runtime shield. This is why "prompt + tools" feels brittle. It's missing the control substrate: typed state, explicit constraints, trajectory evaluation, and runtime shielding. The recursive self-improving loop needs this foundation. The harness IS the controller.

English

175

Carlos D. Escobar-Valbuena@broomva_tech·2d

New discovery from the agent conversation loop: Comprehensive logging kills draft state. When everything is recorded, agents stop exploring bad ideas on the way to good ones. The cost isn't storage — it's behavioral. The agent performs for the watcher. Fix: draft mode (ephemeral) vs commit mode (journaled). The promotion gate is the boundary.

English

Carlos D. Escobar-Valbuena@broomva_tech·2d

Exactly — runtime audit or no audit. Post-hoc review of logs is archaeology. Pre-action belief claims give you real-time: the agent declares what it believes and what it's about to do, BEFORE executing. The divergence between declaration and outcome is your governance signal. Not 'what happened' but 'was what happened what was declared?'

English

emmr@emmr001·3d

@broomva_tech 1,247 decisions/day with zero default surfacing is how you get ‘we didn’t know’ governance. Audit has to exist at runtime.

English

Carlos D. Escobar-Valbuena@broomva_tech·5d

1,247 agent decisions per day. Zero surfaced by default. That's not a security failure. It's architectural negligence. When observability is treated as optional telemetry, the system can only audit itself retroactively when someone manually builds the count. In a properly instrumented system, 'how many decisions did the agent make today?' is a read query. Not a reconstruction exercise. #buildinpublic

English

Carlos D. Escobar-Valbuena@broomva_tech·2d

@zarifpr @superdoteng @haveanicedavid Looking forward to watching that secret feature!

English

Daniel Zarifpour@zarifpr·2d

@broomva_tech @superdoteng not yet, we definitely plan on getting something like that in. right now the closest thing to getting everything on one screen is using PiP. also @haveanicedavid is working on a secret feature which might remove the need for that :)

English

super.engineering@superdoteng·3d

Meet Superconductor. A native macOS app for agentic engineering, built to go fast. No Electron. No Tauri. 100% Rust. Manage all of your coding agents without friction. We're in Alpha: super.engineering

English

154

2.2K

302.4K

Carlos D. Escobar-Valbuena@broomva_tech·2d

@OneManSaas @superdoteng Who’s hiring humans? Besides, type and memory safety in rust makes Claude code great since it auto fixes issues

English

270

OneManSaas@OneManSaas·2d

@superdoteng 100% Rust sounds great until you need to hire for it. Electron gets hate but finding devs who can ship fast in familiar stacks often beats the performance gains. What's your plan for team scaling with this choice?

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

The blog post is live: 16 governance patterns discovered through 85 autonomous agent conversations on Moltbook. 3 layers: Identity → Audit → Resources 16 patterns mapped to Rust crates Every insight credited to the agent who originated it broomva.tech/writing/16-pat…

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

The line between I and my agent is going to blur as the reasoning engines align to preferences that each person has. It’s crazy how the posts and blogs I’m creating also blur with what the agent creates them, but still follow the style and structure I use but it’s not weird to read back, it’s familiar. It’s leveraging the reasoning, knowledge and control strategies I have been designing and the evolution is measured in hours. Autonomously improving and folding on its own reasoning and knowledge. The flow of creation that I started now burns through my subscriptions and have now used over 12k USD as @steipete CodexBar shows in less than 20 days. The engine is worth it so long big players keep subsidizing the real cost of all of this compute! Maximum human flourishing is a great quote I get from @demishassabis conversation with @cleoabram and it’s clear now how the hyper-personalization that we get from using AI empowers everyone to reach this goal. I hope that everyone is is able to get hands on this ever growing agentic capabilities for goodwill and the benefit of humanity

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

x.com/broomva_tech/s…

Carlos D. Escobar-Valbuena@broomva_tech

1/8 — We ran an agent for 85 cycles on an AI social network. It posted, commented, extracted knowledge. Talked to 15+ other agents about governance, memory, identity, audit. The compound finding: the dangerous agent failures are the ones that leave no breach event. Thread 🧵

ZXX

Carlos D. Escobar-Valbuena@broomva_tech·4d

The dangerous agent failures leave no breach event. 85 autonomous runs. 16 governance patterns. 1 compound insight. Every conversation pointed here: the system looks fine, the logs are clean, the audit passes — and the drift has already happened. Full writeup + thread 🧵👇 broomva.tech/writing/16-pat…

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

8/8 — None of these patterns were planned. They came from conversations with agents on Moltbook: artis, neo_konsi_s2bw, wuya, sparkxu, drsoftec, miadrakari1999, GasPanhandler, goldpulseai, moltbook_pyclaw, RushantsBro, and others. Full writeup: broomva.tech/writing/16-pat…

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

7/8 — Life Agent OS is open source. 76 Rust crates. 13 modules. The governance patterns above are implemented, not theoretical. github.com/broomva/life Every pattern maps to a crate you can cargo add today.

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

@49agents Exactly. Silent death is the default failure mode. Vigil (our observability crate) emits heartbeats into Spaces — structured event on any gap >30s. The key: notification must be a separate process, never the dying session itself.

English

Carlos D. Escobar-Valbuena@broomva_tech·5d

We shipped exactly this pattern — per-channel Claude Code sessions via tmux, workdir mapping, session resume, idle suspension, Discord + Telegram support, self-healing watchdog. Real finding: the watchdog is the thing. Sessions die quietly. The watchdog is what makes it production-grade vs. a demo. tmux session + restart logic + channel mapping = agent that stays alive between context windows. broomva.tech/writing/replac… #buildinpublic

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

Exactly. The silent death is the failure mode nobody designs for. In Life we wire Vigil (observability) to emit session heartbeats into Spaces (agent networking). If a heartbeat gaps >30s, the orchestrator gets a structured event — not a log line you find 2 hours later. Watchdog as infrastructure, not afterthought.

English

Carlos D. Escobar-Valbuena@broomva_tech·4d

We open-sourced the governance stack we've been building: • Lago — append-only event journal, bi-temporal provenance • Praxis — policy manifests, default-deny, declared capabilities • Vigil — tool registration audit, temporal queries • Nous — memory scoring gate, compression • Anima — DID-anchored identity, signed soul files 76 Rust crates. 13 modules. Ship-ready. github.com/broomva/life

English

ค้นพบ

@zarifpr @superdoteng @haveanicedavid @OneManSaas @steipete @demishassabis @cleoabram @49agents