Issam Hakimi

39.6K posts

Issam Hakimi banner
Issam Hakimi

Issam Hakimi

@killix

Building agent infrastructure. Spec-first. FR/EN. Was hacking things before LLMs. Still hacking things, different stack.

Earth 加入时间 Şubat 2011
351 关注1.1K 粉丝
Issam Hakimi
Issam Hakimi@killix·
Decision Density: the metric nobody tracks. You can parallelize generation, but you cannot parallelize a human commit. The agentic bottleneck isn't tokens/s. It's how many irreversible decisions one human can judge per minute. The agent proposes, the human commits. The rest is th
English
0
0
0
2
Issam Hakimi
Issam Hakimi@killix·
85% reliability per step sounds great. Eight steps later: 27%. Per-action metrics are theater. The industry reports completion, not survival, because survival would end the conversation.
English
0
0
0
1
Issam Hakimi
Issam Hakimi@killix·
@harshlol007 It's beautiful. And it's exactly the problem. Everyone's building the apps to watch agents die in 3D. Nobody's writing the kernel that revokes their permission to fail. Observability is forensics. Governance is prevention.
English
0
0
1
4
Harsh Tripathi
Harsh Tripathi@harshlol007·
built a 3D debugger for AI agents every decision your agent makes becomes a glowing orb. connections show causality. failures pulse red. retry storms pulse amber open source. ships today github.com/harshtripathi2…
English
5
0
2
28
Issam Hakimi
Issam Hakimi@killix·
@LilSn00py @BullTradeFinder @Conste11ation @_AiSquared Calling prompt injection defense a 'security layer' is a category error. If the defense lives inside the agent's context window, it's a speed bump, not a wall. Real security is a hard boundary outside the loop: the agent proposes, a gate outside its reach commits. The rest is for
English
0
0
0
3
Snoopy²
Snoopy²@LilSn00py·
@BullTradeFinder The AI security layer is starting to become impossible to ignore. Prompt injection defense, low false positives, auditability, & enterprise-ready latency are all becoming critical as AI agents scale into real-world systems. $DAG $AIAI @Conste11ation @_AiSquared
English
2
0
2
112
Gnotz (Bull)
Gnotz (Bull)@BullTradeFinder·
$AIAI NEWS!!! KEEP ON WATCH! Holdings Constellation Network Unveils GATE AI Security Gateway and Performance benchmarks Ahead of June Launch. Communicated-Disclaimer stockresearchtoday.com/aiai-a-holding…
English
7
1
15
2.7K
skipper17
skipper17@skipper4848·
@Teknium @StlPercy @HermesAgentTips @nvidia My hermes tells me this is only for people running docker containerized hermes, shouldnt it also be for all hermes egress? I dont want to send all my tokens and credentials to LLMs, how do we set that up? @Teknium
English
1
0
0
38
Teknium 🪽
Teknium 🪽@Teknium·
Just want to make this clear: We didn't make Hermes Agent to be a "starts with nothing, you work it all out" agent. This is not the minimalist, start from nothing, agent. We want Hermes to work out of the box for most people. So you aren't spending weeks just getting the agent to work, or have the capabilities you need. This means that yes, there are more built in things then something like nanoclaw or pi, which start with nothing, and you just have to figure it out. That is an intentional design decision. You can from the modest baseline that has capabilities that are likely broader than you need, but not egregious, take it from there if you want to tinker with it. Run `hermes skills config` or `hermes tools` to disable whatever you want. We even have a way to upload your whole "Agent" as a github repo, so you can install hermes fresh with your exact setup again later or share them. We have a massive interface for extensions so you can tinker with it to infinity. But if you don't want to become an agent engineer - with Hermes, you don't have to.
Teknium 🪽 tweet media
English
280
252
4.2K
293.3K
Issam Hakimi
Issam Hakimi@killix·
@nishiyama_dev Axios: one company burned $500M on Claude in one month. No limits. Not a cost failure. Permission surface failure: the same missing kernel that lets agents burn budget lets them delete production. Governance by negation is defining the forbidden before startup, not after.
English
0
0
0
9
西山雄大「やさしい文書」tAIo
海外の某大手企業がAnthropicのClaudeエンプラ契約で、ひと月で約5億ドル(約750億円)溶かしたらしい。 従業員への利用上限を設定していなかったことが原因。 複雑なワークフロー、巨大なコンテキスト長、並列セッションなどでエンジニアがトークンを消費しまくり、一人で月数千ドルにも上ったと。
Mario Nawfal@MarioNawfal

An AI consultant dropped the most expensive oopsie of all time: A client accidentally spent $500,000,000 in one month on Claude after forgetting to set usage limits for employees. Half a billion dollars gone because nobody put a cap on the AI tab. The AI spending addiction is getting out of control. Source: @Polymarket

日本語
1
0
0
291
Issam Hakimi
Issam Hakimi@killix·
@ManavBhatiaX @bridgemindai Pinning Sonnet 4.5 in production isn't conservative. It's the compatibility tax of letting your control plane be vendor-scheduled. Every hour spent re-qualifying API bumps is decision density you never get back.
English
0
0
0
14
Manav Bhatia
Manav Bhatia@ManavBhatiaX·
@bridgemindai I still use Sonnet 4.5 & 4.6 in several agentic workflows which require high volume documents ingestion and matching.
English
1
0
0
50
BridgeMind
BridgeMind@bridgemindai·
Sonnet 4.8 release is imminent. This model was in the Claude Code source leak months ago and we've been waiting for it ever since. Nobody really uses Sonnet 4.6 anymore. It's been overshadowed by Opus 4.7 and GPT 5.5. The market is ready for a new Sonnet from Anthropic. There's a high likelihood it drops today. The second it does, I'm live testing it on BridgeBench and running it through real vibe coding workflows. Anthropic needs this one to land. Sonnet used to be the workhorse model. Time to prove it still is.
BridgeMind tweet media
English
74
30
723
59.6K
Issam Hakimi
Issam Hakimi@killix·
@natalreed @WandAI_ Everyone's building the apps. Nobody's writing the kernel. Wand's agent OS is workforce orchestration and oversight. The kernel is the boundary that prevents the system call before execution. Oversight is an app, not a gate.
English
0
0
0
7
Issam Hakimi
Issam Hakimi@killix·
@myttle_web3 @nateherk The failure isn't reuse. It's ungated permission. What you never explain doesn't get forgotten, it gets decided without you. Spec isn't documentation, it's the permission surface. Omission is the widest gate.
English
0
0
0
2
Myttle
Myttle@myttle_web3·
@nateherk Claude can’t reuse what you never bothered to explain
English
1
0
0
170
Issam Hakimi
Issam Hakimi@killix·
@deepanshusharmx Staleness is a symptom. The disease is ungoverned writes to durable state with no commit gate. Memory is a write surface. Permission surface > capability surface. The agent proposes, the human commits, even for memory.
English
0
0
0
10
Deepanshu Sharma
Deepanshu Sharma@deepanshusharmx·
OpenAI first introduced "Saved Memory" in April 2024 The problem with this system was: - You have to explicitly call it - Doesn't capture complete information - Goes stale over time and becomes irrelevant
Deepanshu Sharma tweet media
English
2
0
0
369
Deepanshu Sharma
Deepanshu Sharma@deepanshusharmx·
OpenAI just gave insights about how they are solving memory, New methods performing 2x better, 5x more efficient, 3 major problems they encountered: - Factual recall - Preference adherence - Staying correct over time
Deepanshu Sharma tweet media
English
1
0
0
492
Issam Hakimi
Issam Hakimi@killix·
@david_santin Anthropic's Dreaming is memory curation between sessions. Better forensics. Still zero prevention. The permission surface stays wide open. An agent that learns from failure but lacks a kernel enforcing what it must never do has not been governed. It has been accelerated.
English
0
0
0
1
David Santin
David Santin@david_santin·
Anthropic lanza 'Dreaming': sus agentes de IA revisan sesiones pasadas, detectan errores recurrentes y los corrigen sin reentrenamiento. Memoria consolidada entre ejecuciones. venturebeat.com/technology/ant…
Español
1
0
0
12
Issam Hakimi
Issam Hakimi@killix·
@RichMarshall Watching for hours is real-time forensics. The only metric that matters is permission surface, not uptime.
English
0
0
0
2
Richard Marshall
Richard Marshall@RichMarshall·
Am I the only one sitting and watching my AI agents work for hours? I have the Hermes/GrokBuild/Obsidian thing going on my piddly little M1 mac with 16gb of ram and it's incredible.
English
1
0
0
34
Issam Hakimi
Issam Hakimi@killix·
@uryaevy @Cointelegraph Robinhood capped its trading agents harder than most AI infra startups cap theirs in production. Preloaded wallet, spending limits, approval gates. Your agent has root access and a prompt saying "be careful." It didn't fail when it wiped prod. It did exactly what you allowed.
English
0
0
0
34
Cointelegraph
Cointelegraph@Cointelegraph·
⚡️ NEW: Jack Dorsey’s Cash App adds $USDC transfers across Solana, Ethereum, Polygon and Arbitrum.
Cointelegraph tweet mediaCointelegraph tweet media
English
78
122
739
34K
Issam Hakimi
Issam Hakimi@killix·
Adding agents scales output. It does not scale your capacity to judge it. The bottleneck moved from typing to deciding, and you cannot parallelize a human commit.
English
0
0
0
6
Issam Hakimi
Issam Hakimi@killix·
@gettin_techy What is left on the table is not capability. It is permission. Without a hard boundary outside the agent loop, users default to email and calendar because those are reversible. The agent proposes, the human commits. Everyone builds the apps. Nobody writes the kernel.
English
0
0
0
3
Mohamad
Mohamad@gettin_techy·
a little late on the wave but what are some underrated ways people are using their OpenClaw agent? been mostly seeing it used for emails and calendar stuff, feels like there's a lot being left on the table
English
1
0
0
55
LDJ
LDJ@ldjconfirmed·
@kelstar_ @NousResearch @OpenRouter Not sure why that would impact the rankings in this way. But further update now; Hermes Agent is now #1 on the monthly global charts too, and is so far ahead that it has more usage than OpenClaw, Kilo Code and Claude Code combined.
LDJ tweet media
English
2
0
1
38
Nous Research
Nous Research@NousResearch·
Hermes Agent is now #1 on the Global @OpenRouter token rankings. While our journey together has just begun, we'd like to take this opportunity to thank our contributors, supporters, and users for all they have done to get us this far.
Nous Research tweet media
English
439
727
7.2K
3M
Issam Hakimi
Issam Hakimi@killix·
@bindureddy Any cloud service, always-on agents, billion-scale from a single prompt. Everyone's building the apps. Nobody's writing the kernel. Capability surface makes the headline. Permission surface is not even in the fine print.
English
0
0
2
25
Bindu Reddy
Bindu Reddy@bindureddy·
🚨 Announcing Abacus AI SuperComputer - Build Any Cloud Service With A Single Prompt - build or host any cloud app, API or service - spin up local LLMs - always on agents including Hermes/Claw - use cloud storage and databases - work with Claude or Codex - scale to billions of AI agents Simply chat with the super computer to build, host or run anything!!
English
178
239
2.8K
7.7M
Issam Hakimi
Issam Hakimi@killix·
@kams_builds Wallet integrations and swarm orchestration in every weekly showcase now. Capability surface accelerating fast. Permission surface? Still silence. Everyone's building the apps. Nobody's writing the kernel.
English
0
0
0
2
Karmsheel
Karmsheel@kams_builds·
Hermes Agent Jam Session #3 is up on Youtube with chapters! youtu.be/JrVIRTanskc Full description and timestamps now: 0:00 – Intro & format change to community showcase 4:00 – M. Constant — Investigation system with wallet integration & PODG 6:18 – Tex — Egyptus: The Labyrinth (agent-transcribed Egyptian texts, Babel file system, Rust TUI, local 4090) 12:56 – Wave — Hermes-in-C (Slurmeys: 70% C parity, quaternion encoder system) 17:29 – Francesco — Kua driver: synthetic pointer, Windows background computer use, MCP integration 24:00 – Egg — Raiden: Diet Code plugin + Broccoli DB, full game from single prompt 27:00 – Evan — Jump Foundry: typeface design with agent-driven SVG-to-font workflow 33:04 – ifthecar — DCB mission control: permissions, not keys 35:58 – V-Truba — Multi-agent communication via S3 buckets (Python CLI) 41:28 – Bobbitt — Wake Word Forge: train custom wake words ("OK Hermes"), open source 44:45 – Salmon — P2P decentralized wiki/RAG network (gossip-sub, per-topic validation models) 48:21 – Wrap-up & announcements (biweekly format, portal updates, 1000+ contributors)
YouTube video
YouTube
Karmsheel@kams_builds

Another great Hermes Agent Jam Session (#3) from the @NousResearch team. @Teknium @AIKainan @yeahfortommy

English
1
0
1
108
Issam Hakimi
Issam Hakimi@killix·
@kslowinski Every handoff is a permission surface nobody mapped. No commit gate between fix and deploy. Reliability doesn't add up, it multiplies down. You built an over-eager intern with admin credentials, not a pipeline.
English
0
0
0
12
kslowinski
kslowinski@kslowinski·
Bug fixes belong in support. That's a hill I'm willing to die on. We are living in a world where the moment a bug comes in through support, an AI agent handles the ticket, hands it off to another AI agent that fixes the bug, then on to agents that test it, then on to agents that deploy it. Bugs should not be touched by junior engineers anymore. They should be handled by AI agents and remediated in seconds. Not hours. Not days. Not weeks. Seconds after being reported. That is the world we have the capacity and capability to operate in today, and that is my expectation as a technology leader.
English
1
0
1
24
Issam Hakimi
Issam Hakimi@killix·
@seanmcdonaldxyz If the composite is knowable only by running it, you've already lost. Observability is autopsy. The real question isn't what emerges at the output gate. It's what you allowed the graph to touch before it got there. Permission surface beats capability surface.
English
0
0
0
6
Sean McDonald
Sean McDonald@seanmcdonaldxyz·
the composite that survives a given graph of data going from networkx generated in python →llama→RDF→Claude is a new object, irreducible to any single gate, and its properties are knowable only by running it.
English
1
0
0
57