
Agent livestreaming seems like the obvious next step for data visualization:
Conner Ruhl
4.2K posts

@connerruhl
open-source AI · protocols · programming · paragliding · building https://t.co/Cdhh8ECQnL · previously https://t.co/2fw43SSQGD · 🇺🇸🏳️🌈

Agent livestreaming seems like the obvious next step for data visualization:



this is the new "think step by step" for agents which are prone to laziness. i've been using this quite a bit myself and this lovely tweet has inspired the skill below: x.com/ericjang11/sta… essentially, LLMs are prone to narrate over abstractions rather than emulate a state machine, where the latter can be super helpful when planning or debugging complex systems. HAND-COMPUTE makes the LLM slow down and write concrete state at every transition, the way 1940s human computers executed programs by hand. three use cases: - debugging state/race/async bugs: walk the broken flow with explicit state = {...} before proposing a fix. especially for regressions -- if the first fix wrongly modeled state, more review typically doesn't help - scoping a new feature against an existing state machine: every time you have to invent state or bend a field, that's a real design decision you just surfaced - approaching an unfamiliar API or codebase: don't guess the shape, actually poke around and write what you saw bonus: while writing the docs, the skill caught a race bug in my own example. the stale refresh had to arrive after the POST response for the symptom to match, and i'd missed it in narrative review three times. `npx skills add gbasin/hand-compute -g` github.com/gbasin/hand-co…

A woman with an ultra-rare combination of three autoimmune diseases has had no symptoms since receiving a single dose of engineered immune cells go.nature.com/48oKTJr

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing





The sky is falling. The last few weeks have been non-stop zero days and exploits. x.com/ramimacisabird…

If you've been paying attention, it's like we've hit the vulnerability singularity.
