
Derek Matzen
478 posts

Derek Matzen
@derekmatzen
CEO // High Tide building Luna (personal AI) + Ra (infra AI)





i was going through the hermes agent architecture and codebase and one thing that really stood out to me is that hermes is taking a much more explicit route to self-improvement than most agent systems usually imply. like it is not doing some offline trajectory mining where you collect lots of traces, run some separate extraction pipeline, cluster behaviors and then distill them into skills later. instead i think hermes feels much more like agent-mediated procedural distillation: the model itself notices that a workflow is reusable and writes it out into a durable artifact through the skill interface. in fact there is no separate skill-extraction model, no embedding-based clustering pass and no dedicated replay-style learning loop in the main design (hermes is doing act, notice, write, reuse all in one loop). also the interesting part is that the same runtime that acts is also the runtime that writes down its own reusable procedures.






















