Rayan
635 posts

Rayan
@AskRayan
Cofounder CEO @PrunaAI

Hermes Agent v0.3.0 ☤ 248 PRs. 15 contributors. 5 days. • Real-time streaming across CLI and all platforms • First-class plugin architecture, package and share tools+commands+skills • /browser connect to live Chrome via CDP • @vercel AI Gateway model provider • @browser_use browser tool provider • VS Code, Zed, and JetBrains integration • Voice mode with local Whisper • PII redaction everywhere 9 new skills. 50+ bug fixes. Much more in the full changelog.
















Research Preview: Fibo BBQ: Bounding Box & Qolor Control in Large-Scale Text-to-Image Models Text prompts are a terrible UI for precision It’s much more intuitive to drag objects into place or use a color picker than to write “put it 20% left, make it teal, slightly bigger…” Fibo BBQ demonstrates that we can train large-scale T2I models with numeric parameters (e.g., positions / boxes / colors) as part of a structured caption, at scale. This is a step toward richer controllability: more “knobs” beyond text, and new UI paradigms that feel like design tools, not prompt engineering. [Demo and Technical Paper below]









