Theo Luan

27 posts

Theo Luan

Theo Luan

@droid_35719

Beigetreten Şubat 2025
42 Folgt615 Follower
Aman
Aman@amanshresthaaaa·
@droid_35719 @LLMJunky Bug: Could you fix the time. And only count when it’s running? Not when it’s on pause. One of the missions shows 35 hours. 😅 i paused and resumed.
English
2
0
1
38
am.will
am.will@LLMJunky·
Trying out Droid Factory Missions for the first time. Kinda excited. Anyone else out there using this? What is your experience?
am.will tweet media
English
24
5
64
6.6K
Theo Luan retweetet
Ray Fernando
Ray Fernando@RayFernando1337·
They don't talk loud. They speak with the work. Matan and 20 engineers at Factory just raised $150M at a $1.5B valuation, and the industry is still catching up to what they've shipped. Long context compression: I pushed 40+ million tokens in a single session without starting a new chat. The agent held the thread the whole way. Every lab has since shipped some version of it. Factory did it first and barely mentioned it. Missions: Droid can ship software across days. The codebase comes out cleaner than it went in (tests passing, edges handled, the kind of work a good engineer does at the end of a long week). Agent Readiness: A framework for whether your codebase is even ready for agents. If you've wondered why AI coding tools feel uneven across projects, this is the answer. The environment around the agent is usually the bottleneck. In three years, 20 people are outpacing AI agent innovations from frontier labs with millions in capital plus hundreds of employees. If you're building with AI, spend a weekend with Droid. It cooooooks!
Matan Grinberg@matanSF

x.com/i/article/2044…

English
13
14
195
63.8K
Theo Luan
Theo Luan@droid_35719·
@geo_anima @FactoryAI makes sense. there should be some low-lift changes we can make to address this
English
1
0
2
119
Geo Anima
Geo Anima@geo_anima·
@droid_35719 I've used @FactoryAI's Droid Missions for 40+ days. My one issue is that a mission may build too much validation scaffolding and too little product - going disproportionately wide by anticipating too many edge cases instead of focusing more on making the core thing work first.
English
1
0
0
166
Theo Luan
Theo Luan@droid_35719·
We built Missions at Factory, and I wrote about the architecture that I led the design for to make multi-day autonomous coding reliable. Agents are highly reactive to their context. Every design decision follows from keeping each agent's trajectory focused and directionally consistent.
English
15
32
343
29.8K
Theo Luan
Theo Luan@droid_35719·
The post explains our design rationale, how missions actually run, and breaks down a real 16.5-hour mission: 185 agent runs, 778M tokens, 89% test coverage. factory.ai/news/missions-…
English
9
27
411
76.8K
Theo Luan retweetet
Factory
Factory@FactoryAI·
Today we're releasing the Factory desktop app. A native interface for autonomous AI agents that work across every part of your software business.
English
110
76
954
249.5K
Taelin
Taelin@VictorTaelin·
Sad to report that, as of March 2026, just prompting agents to implement a termination checker will not work. Even GPT 5.4 Pro's plan is just... bad. Just bad. It seems like I'm gonna need to code this on my own, and my brain is spoiled and lazy from so much vibe coding
English
55
4
489
31.1K
Theo Luan
Theo Luan@droid_35719·
@HarryStuck77 @droid hey - just to clarify, do you mean validator rows rendered in a diff color?
English
0
0
0
18
Harry Stuckler
Harry Stuckler@HarryStuck77·
Would be great to see in Progress Log the Validators steps (maybe in a color) @droid
Harry Stuckler tweet media
English
1
0
4
93
am.will
am.will@LLMJunky·
@droid_35719 @amanshresthaaaa that was exactly it, and i couldn't get back there bc CTRL+T in Ghostty was just creating new tabs. I fixed it! and its working. Thanks legend. Its working!!
English
1
0
0
105
Theo Luan
Theo Luan@droid_35719·
@LLMJunky @amanshresthaaaa hey - can you go back to the orchestrator session with ctrl+t? wondering if it paused to ask you a question?
English
2
0
1
54
Theo Luan
Theo Luan@droid_35719·
@tdh_02 @droid hi, sorry about this. could you DM me with more details?
English
1
0
0
38
Tom 👨🏻‍💻
Tom 👨🏻‍💻@tdh_02·
Anyone else feeling like @droid is just becoming unusable for Missions? ou simply can't leave them unattended due to the amount of starts/restarts required - and lost tokens (which aren't cheap from Droid!)
English
2
0
0
85
𝖕𝖗𝖆𝖙𝖍
𝖕𝖗𝖆𝖙𝖍@prathamdby·
.@FactoryAI Yo guys, can you help out here? I'm on the latest version of Droid here and missions are just refusing to launch, no matter what I do, I've tried 2–3 times already and restarted Droid countless times.
𝖕𝖗𝖆𝖙𝖍 tweet media
English
1
0
1
167
Yigit Konur
Yigit Konur@yigitkonur·
@matanSF let others to use mission at least for few rounds on non-200$ packages sirrr
English
1
0
0
1.5K
Bill
Bill@justBill_0·
@matanSF Can you use BYOK for missions?
English
1
0
0
313
Theo Luan
Theo Luan@droid_35719·
@mhadtk @0xSero hi Pham - are you referring to lagginess of the mission control interface itself?
English
0
0
1
69
Ha Pham, FRM
Ha Pham, FRM@mhadtk·
@0xSero My mission gets slower and slower after each worker deployed and time. Do you go through same experiment?
English
1
0
1
357
0xSero
0xSero@0xSero·
The mission is 8% done, and it's already been 5 hours. I think this is the coolest one shot experience I've seen. Okay, this is wildly better than anything else I've tried. the tests are so logical, it's using the browser to validate the UX, it's doing real quantizations. It's been at it for 5 hours, and there's nothing in there that doesn't make sense from a tool use, and trajectory perspective. I built Orchestra 3 months ago, it's similar to oh-my-opencode and gas-town. It worked but it sucked, I couldn't get it to make sense so I dropped it and haven't touched any orchestration tools since. ---------------------------------------- why is it so good? because they're not leaving it up to the model to decide, I see the way they frame prompts and separate sessions, it's all running like a factory lol. Very impressed.
0xSero tweet media
0xSero@0xSero

Okay, Droid Missions are tied for #1 Orchestrator with Roocode's boomerang mode. This has been incredibly stable. 1. The "orchestrator" model simply delegates tasks and asks the users questions. 2. A plan is created that is much deeper than what I would expect with spec mode. 3. The mission is broken down into tasks, the orchestrator delegates a task to the worker 4. Once a task is done, it is passed to the verifier to review the output This loop I've seen with Roocode, the main difference here being the UX it feels really good. Although in Zed there's an annoying re-rendering bug in terminals that causes tons of flickering. I am using this with my custom models (BYOK) it works good in the desktop app too. (: Now look, I don't know if these systems will be productive, in that the longer models run without human feedback the more likely it is to make a huge mess. Time will tell.

English
12
9
145
20.2K