
Alex Carmichael
219 posts

Alex Carmichael
@ai_carmichael
VP Ops who built an AI Chief of Staff. Writing about agents, harnesses, and operator workflows — for builders who aren't engineers.
USA Katılım Nisan 2026
80 Takip Edilen19 Takipçiler
Sabitlenmiş Tweet

Here is a consolidated list of articles on my persistent, LLM agnostic and telegram enabled personal agent setup:
alexcarmichael.com/halsey-setup
English

@dhh Do you run persistent harnesses with each model (toggle between them)?
English

@ericzakariasson Curious how "blocks files over 1k lines" is enforced. hard reject at PR time, or does the agent propose the split and you approve the decomposition?
English

@jxnlco Does the persona extraction step uses clustering on message embeddings or just free-form identify patterns? the latter may hallucinate consistency that isn't there.
English

@dair_ai at diversity threshold does the compiled model start degrading? how narrow does the workflow scope need to be to hold that quality bar?
English

NEW paper worth reading.
A full agentic workflow can be distilled into model weights and run at roughly 100x lower inference cost while preserving near-frontier task quality.
The workflow includes multi-step LLM calls, tool invocations, intermediate scratchpads, and decision structure.
Instead of expressing all of that at runtime through a framework, the paper amortizes the behavior into a compiled model through targeted distillation.
This is the strongest economic argument for agent compilation so far. Runtime loops are flexible, but expensive. Compiled workflows trade some flexibility for a massive inference-cost reduction.
Paper: arxiv.org/abs/2605.22502
Learn to build effective AI agents in our academy: academy.dair.ai

English

@gregisenberg Makes sense in theory, but what if each customer has minor variations in the workflow? Your process is exactly what I’ve followed building custom agents in my businesss, but it’s not obvious that they could be lifted and replaced to another company with a different tech stack
English

How to build a vertical AI agent cash-flowing startup:
find painful workflow in a boring industry → talk to 10 people who do that workflow every day → map every step, every tool, every spreadsheet, every phone call →
do the workflow manually first → be the agent before you build the agent → find the edge cases that break everything → document them in obsidian as structured markdown →
set up your agent stack → hermes for the harness → obsidian vault as the knowledge base → composio for authentication across apps → build your first 1-3 skills that solve the core pain →
use claude code or codex to build the product → use agents to set up other agents → use perplexity MCP and context7 for up-to-date docs → let the agent handle the scaffolding while you focus on the workflow logic →
ship the agent to your first 5 customers for free → watch what they actually use it for → they will surprise you → the thing you built for isn't always the thing they need most →
build content around the niche → not "building in public" content → useful content → the tips, the shortcuts, the pain points that only someone who does this workflow would know → become the person for that niche →
charge per outcome not per seat → per lease renewed, per claim processed, per candidate sourced → the ROI conversation takes 10 seconds when it's tied to a result →
set up watchdogs and alerts → your agent emails you when a cron job breaks or a skill fails → the customer should never have to tell you something is broken →
connect to open router → see exact costs per model per task → use GPT 5.5 for tool calls → use open source for lightweight tasks → route the right model to the right job → watch your margins double →
let hermes write to its own memory after every task → the agent compounds → the longer it runs the better it gets → that accumulated memory becomes your moat → a competitor can clone your product but they can't clone 6 months of context →
expand the workflow → you started with one step → add the next → then the next → now you own the entire workflow end to end → you went from a tool to the operating system for that vertical →
stack the agents → one agent is a side project → five agents across five customers is a business → each one runs in its own environment → you check in once a day →
raise only if you need capital not credibility → most agent businesses should never raise → the margins are too good to give away equity → stay lean → stay profitable → repeat
i'm rooting for you
English

@jxnlco Persistent task state across sessions and a structured decision log — right now every context reset throws away implicit reasoning that's hard to reconstruct.
English

@SahilBloom In my experience, people miss a lot of chances to keep their mouths shut
English

@jxnlco Does Codex pull from that repo at inference time, or is it baked into the system prompt context on session start?
English
Alex Carmichael retweetledi

@dair_ai Does the memory model itself require retraining when the domain shifts, or is the update mechanism truly online?
English

// Memory as a Model //
The paper augments any LLM with a separate trained memory model that stores, retrieves, and integrates facts on its behalf.
It decouples memory updates from base-model weight updates. It achieves continual-learning robustness without catastrophic forgetting, which is a property that RAG fails to deliver.
A vector store is a database with a learned encoder bolted on. MeMo is a learned subsystem with explicit interfaces. That distinction matters, as agents need to be able to ingest fresh knowledge weekly without retraining or vector-DB churn.
At its core, the position here is that memory in agents should be modular, learned, and gated, not a context-window hack.
Paper: arxiv.org/abs/2605.15156
Learn to build effective AI agents in our academy: academy.dair.ai

English

@SahilBloom My problem is that the excitement "collides" with the exhaustion and sleep quality lacks as I lay awake thinking about how excited I am for the next day. Curious how others handle this.
English

Build a life where you're energized in the morning and exhausted at night. Energized in the morning means you're excited about the things you get to work on and the people you get to work on them with. Exhausted at night means you gave your all to those things and people. Your best life is built in that collision.
English

the future is bright, lets get to work
Andrej Karpathy@karpathy
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English

@thejustinwelsh I know many who've built a business who would disagree with you. It's all tradeoffs
English
Alex Carmichael retweetledi







