
I used Claude Computer Use/Dispatch yesterday. My feeling:
It’s too damn slow!
Posting a tweet takes me ~5 seconds (once I have the content). Claude took 70 seconds.
Why? It controls the screen via a loop: take a screenshot → send to a huge remote multimodal model (opus 4.6) → decide actions (click, type, scroll) → take another screenshot → repeat.
We’re basically forcing a large general model to operate a human UI.
Two things will happen in my opinion:
1. It is using a massive model (Opus 4.6) just to understand screens. That won’t last. Smaller, specialized models and eventually local models will handle most of this.
2. GUIs were built for humans. Almost all software will expose APIs/CLI for agents, so most actions won’t need to “use a computer” at all.
English
