Alex Carmichael

219 posts

Alex Carmichael

@ai_carmichael

VP Ops who built an AI Chief of Staff. Writing about agents, harnesses, and operator workflows — for builders who aren't engineers.

USA Katılım Nisan 2026

80 Takip Edilen19 Takipçiler

Sabitlenmiş Tweet

Alex Carmichael@ai_carmichael·4 May

Here is a consolidated list of articles on my persistent, LLM agnostic and telegram enabled personal agent setup: alexcarmichael.com/halsey-setup

English

Alex Carmichael@ai_carmichael·1d

@AndrewYNg Makes 0 sense

English

Andrew Ng@AndrewYNg·4d

The new White House policy requiring green card applicants to apply from outside the US is a capricious attack on legal immigration. It will hurt families, leave us with fewer doctors, teachers and scientists, and hurt American competitiveness in AI.

English

2.8K

1.6K

12.1K

1.4M

Alex Carmichael@ai_carmichael·1d

@dhh Do you run persistent harnesses with each model (toggle between them)?

English

DHH@dhh·4d

For complicated agent work, it's amazing how much GPT5.5 has improved. I found 5.2 to be very far behind Opus. Now using Opus 4.7 after 5.5 feels like a big step backwards. Gotta love this level of competion! Strong comeback for OpenAI.

English

203

213

5.2K

518.5K

Alex Carmichael@ai_carmichael·1d

@ericzakariasson Curious how "blocks files over 1k lines" is enforced. hard reject at PR time, or does the agent propose the split and you approve the decomposition?

English

eric zakariasson@ericzakariasson·5d

the most used skill internally at cursor right now /thermo-nuclear-code-quality-review - deletes complexity instead of moving it - blocks files over 1k lines - flags thin wrappers and leaked logic - rejects PRs that work but make code messier

English

135

258

5.4K

456.2K

Alex Carmichael@ai_carmichael·1d

@jxnlco Does the persona extraction step uses clustering on message embeddings or just free-form identify patterns? the latter may hallucinate consistency that isn't there.

English

jason@jxnlco·2d

codex prompt tip 13 "read my past 400 slack messages, identify my personas and make a skill on how to message people one each one, then do the same thing for emails and twitter so you jnow how to write in my voice"

English

356

24.3K

Alex Carmichael@ai_carmichael·1d

@dair_ai at diversity threshold does the compiled model start degrading? how narrow does the workflow scope need to be to hold that quality bar?

English

DAIR.AI@dair_ai·5d

NEW paper worth reading. A full agentic workflow can be distilled into model weights and run at roughly 100x lower inference cost while preserving near-frontier task quality. The workflow includes multi-step LLM calls, tool invocations, intermediate scratchpads, and decision structure. Instead of expressing all of that at runtime through a framework, the paper amortizes the behavior into a compiled model through targeted distillation. This is the strongest economic argument for agent compilation so far. Runtime loops are flexible, but expensive. Compiled workflows trade some flexibility for a massive inference-cost reduction. Paper: arxiv.org/abs/2605.22502 Learn to build effective AI agents in our academy: academy.dair.ai

English

279

18.2K

Alex Carmichael@ai_carmichael·1d

@thejustinwelsh I have no opinion on this

English

Justin Welsh@thejustinwelsh·5d

An easy way to be at peace is to not have an opinion on everything in the world.

English

103

351

12.4K

Alex Carmichael@ai_carmichael·1d

@gregisenberg Makes sense in theory, but what if each customer has minor variations in the workflow? Your process is exactly what I’ve followed building custom agents in my businesss, but it’s not obvious that they could be lifted and replaced to another company with a different tech stack

English

102

GREG ISENBERG@gregisenberg·2d

How to build a vertical AI agent cash-flowing startup: find painful workflow in a boring industry → talk to 10 people who do that workflow every day → map every step, every tool, every spreadsheet, every phone call → do the workflow manually first → be the agent before you build the agent → find the edge cases that break everything → document them in obsidian as structured markdown → set up your agent stack → hermes for the harness → obsidian vault as the knowledge base → composio for authentication across apps → build your first 1-3 skills that solve the core pain → use claude code or codex to build the product → use agents to set up other agents → use perplexity MCP and context7 for up-to-date docs → let the agent handle the scaffolding while you focus on the workflow logic → ship the agent to your first 5 customers for free → watch what they actually use it for → they will surprise you → the thing you built for isn't always the thing they need most → build content around the niche → not "building in public" content → useful content → the tips, the shortcuts, the pain points that only someone who does this workflow would know → become the person for that niche → charge per outcome not per seat → per lease renewed, per claim processed, per candidate sourced → the ROI conversation takes 10 seconds when it's tied to a result → set up watchdogs and alerts → your agent emails you when a cron job breaks or a skill fails → the customer should never have to tell you something is broken → connect to open router → see exact costs per model per task → use GPT 5.5 for tool calls → use open source for lightweight tasks → route the right model to the right job → watch your margins double → let hermes write to its own memory after every task → the agent compounds → the longer it runs the better it gets → that accumulated memory becomes your moat → a competitor can clone your product but they can't clone 6 months of context → expand the workflow → you started with one step → add the next → then the next → now you own the entire workflow end to end → you went from a tool to the operating system for that vertical → stack the agents → one agent is a side project → five agents across five customers is a business → each one runs in its own environment → you check in once a day → raise only if you need capital not credibility → most agent businesses should never raise → the margins are too good to give away equity → stay lean → stay profitable → repeat i'm rooting for you

English

148

116

1.3K

106.2K

Alex Carmichael@ai_carmichael·1d

@jxnlco Persistent task state across sessions and a structured decision log — right now every context reset throws away implicit reasoning that's hard to reconstruct.

English

jason@jxnlco·5d

If you're using codex desktop app today, what features do you feel like are still missing? Let me know and I’ll summarize all the feedback and share internally.

English

936

575

73K

Alex Carmichael@ai_carmichael·1d

@SahilBloom In my experience, people miss a lot of chances to keep their mouths shut

English

Sahil Bloom@SahilBloom·2d

I’m convinced that knowing when to keep your mouth shut is the ultimate life hack.

English

436

2.4K

15.8K

296.7K

Alex Carmichael@ai_carmichael·4d

No, after you, I insist.

Alexa Web3 (e/acc)@alexabelonix

Two Amazon robots got stuck in an aisle, spending what appears to be an eternity shuffling back and forth because neither one could figure out who should move first.

English

Alex Carmichael@ai_carmichael·5d

@jxnlco Does Codex pull from that repo at inference time, or is it baked into the system prompt context on session start?

English

1.9K

jason@jxnlco·5d

If you're using Codex, you might not know about this repo `openai/skills` if codex is open right now just ask codex `take a look at skills installer, what are some skills I should install` and it'll know how to scan this repo for skills and install the ones you need

English

1.4K

208.8K

Alex Carmichael retweetledi

Mads@europemaxxed·6d

europeans after enabling their out of office auto reply until september

English

549

70K

11.5M

Alex Carmichael@ai_carmichael·5d

@dair_ai Does the memory model itself require retraining when the domain shifts, or is the update mechanism truly online?

English

443

DAIR.AI@dair_ai·6d

// Memory as a Model // The paper augments any LLM with a separate trained memory model that stores, retrieves, and integrates facts on its behalf. It decouples memory updates from base-model weight updates. It achieves continual-learning robustness without catastrophic forgetting, which is a property that RAG fails to deliver. A vector store is a database with a learned encoder bolted on. MeMo is a learned subsystem with explicit interfaces. That distinction matters, as agents need to be able to ingest fresh knowledge weekly without retraining or vector-DB churn. At its core, the position here is that memory in agents should be modular, learned, and gated, not a context-window hack. Paper: arxiv.org/abs/2605.15156 Learn to build effective AI agents in our academy: academy.dair.ai

English

114

602

65.3K

Alex Carmichael@ai_carmichael·5d

@SahilBloom My problem is that the excitement "collides" with the exhaustion and sleep quality lacks as I lay awake thinking about how excited I am for the next day. Curious how others handle this.

English

532

Sahil Bloom@SahilBloom·6d

Build a life where you're energized in the morning and exhausted at night. Energized in the morning means you're excited about the things you get to work on and the people you get to work on them with. Exhausted at night means you gave your all to those things and people. Your best life is built in that collision.

English

149

134

1.5K

63.6K

Alex Carmichael@ai_carmichael·5d

@trq212 I think you meant to say "let's get this bread"

English

Thariq@trq212·19 May

the future is bright, lets get to work

Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

2.1K

85.2K

Alex Carmichael@ai_carmichael·5d

@thejustinwelsh I know many who've built a business who would disagree with you. It's all tradeoffs

English

129

Justin Welsh@thejustinwelsh·6d

Once you've built your own business, the idea of being told what to do by another adult seems ridiculous.

English

182

152

1.4K

38.1K

Alex Carmichael@ai_carmichael·20 May

Pure gold

Tatum Turn Up@tatumturnup

This is the greatest video I’ve ever seen. No notes. The lifeless clanker carcass just laying there. No crowd reaction, anything. Just Billie Jean. Until its lifeless shell is shamefully dragged off. Purely amazing.

English

Alex Carmichael retweetledi

Tatum Turn Up@tatumturnup·20 May

English

1.4K

8.4K

91.9K

5.5M

Alex Carmichael@ai_carmichael·20 May

I've completely stopped using Notion UI. It still exists as a backend database for my Agents to use but my workflow is 95% Telegram messaging my agents to do things. It happened gradually then all at once.

English

Alex Carmichael@ai_carmichael·19 May

@danshipper Something good

English

Dan Shipper 📧@danshipper·19 May

what did karpathy see

English

239

1.6K

85.5K

Keşfet

@AndrewYNg @dhh @ericzakariasson @jxnlco @dair_ai @thejustinwelsh @gregisenberg @SahilBloom