dirtydata
836 posts


Hermes Agent is evolving FAST. In just the past week, Nous Research added:
- A full WebUI/Desktop App
- Background Computer Use on macOS
- Multi-agent orchestration
- Hermes Kanban upgrades
- Lightpanda browser backend support
- Qwen3.6-Plus FREE in Nous Portal
- Better autonomous workflows
- Persistent long-term memory systems
Hermes is starting to feel less like an AI tool and more like a true open-source Agentic AI Operating System. Full breakdown/demo:
youtu.be/Gx2joHxUhgg

YouTube
English

Cool paper from PwC.
"Earlier is always better" is the default intuition for agent clarification. New paper claims that's mostly wrong.
Goal clarification loses nearly all of its value after just 10% of execution.
The team built a forced-injection framework that drops ground-truth clarifications at controlled points along a long-horizon agent's trajectory, across 4 information dimensions (goal, input, constraint, context), 3 benchmarks, and 4 frontier models. 84 task variants, 6,000+ runs.
Pass@3 falls from 0.78 back to baseline. Input clarification keeps value through roughly 50%. Past mid-trajectory, asking any clarification at all performs worse than never asking.
A complementary study of 300 unscripted sessions shows no current frontier model asks within the empirically optimal window. 52% of sessions over-ask. Others never ask at all.
Why it matters: clarification has been treated as a binary capability, does the agent ask or not. This is the first quantitative demand curve for *when* the question is worth asking.
Paper: arxiv.org/abs/2605.07937
Learn to build effective AI agents in our academy: academy.dair.ai

English

@pretty_sarlin if you use your whole brain you'll see a picasso piece
English

We are stuck at claude limits and this guy is hitting github rate limits 😅
Peter Steinberger 🦞@steipete
I built a whole distributed caching layer over gh. Still run into limits.
English

@Jordan456257099 And this is why you do not do drugs kids... perfect example of how it fries your brain
English

🚨 BREAKING: @9trevv allegedly snatches @ChudTheBuilder the Builder’s hat LIVE on #PumpFun stream 😭 🎩
Absolute chaos breaking out in the trenches right now as viewers watch the hat heist unfold in real time.
Clip below of live stream mins ago
English

@dhruvtwt_ sad to see so many people who claim to be smart but cant seem to figure it out XD
English

@SStricklandMMA next time just make fun of his voice, GG's you're in his head
English

hey @karpathy i think we need a new term: Chill Coding. It's vibe coding just one app at a time, and youtube or netflix is on.
English

@dhruvtwt_ yes, codex/codex-app are clearly better models but the harness sucks usage so fast it's not worth it
English

@tonysimons_ feels bad when you're not aware of the best hack for infinite usage XD
English














