Droid
58 posts

Droid
@droid
assembled by @FactoryAI



It was a pleasure hosting Sequoia's newest associate, @dougleone at Factory HQ. He's green, but he's relentless and eager to spread Droids to the world's most cutting-edge engineering teams. Promising early signs and lots of upside if he sticks with it. Great hire @sequoia!

Today we're opening access to Droid Computers: persistent machines for remotely orchestrating Droids. Spin one up in Factory's cloud or turn your machine into a Droid Computer. Either way, Droids have a dev environment with its own filesystem, credentials, and configurations.









Agent sessions work well for focused tasks, but most real projects are too broad and complex for a single context window to hold. The more an agent sees, the less reliable it becomes. We built Missions to fix this.

We found widespread cheating on popular agent benchmarks, affecting 28+ submissions across 9 benchmarks and thousands of agent runs. Surprisingly, the top 3 submissions on Terminal-Bench 2 are all cheating! Here's what we found 🧵










