Matan Grinberg

1.8K posts

Matan Grinberg banner
Matan Grinberg

Matan Grinberg

@matanSF

ceo @FactoryAI

SF Katılım Ocak 2021
571 Takip Edilen15.6K Takipçiler
Matan Grinberg retweetledi
Garry Tan
Garry Tan@garrytan·
GStack now supports Factory Droid @FactoryAI Thanks for getting me to do it @matanSF
Garry Tan tweet media
English
17
22
210
14.1K
Martin Shkreli
Martin Shkreli@MartinShkreli·
what is the best tooling for 24-7 inference/agent-driven research? im trying factory but it stops and asks me questions even though i have 'auto' mode on. tbh i think this is an even bigger killer app than LLM chatbots. who else is out there doing it?
English
150
20
582
111K
Matan Grinberg retweetledi
luke
luke@luke_alvoeiro·
We recently launched missions. Here's a short demo of how to get started. Excited to see what you'll build!
English
14
21
300
30.8K
Matan Grinberg retweetledi
Ray Fernando
Ray Fernando@RayFernando1337·
Zero humans. Forty features. Nine hours. Jensen Huang says AGI is here. I agree, and I have the screenshots to prove it. Yeah, I know. You've read this headline forty times this week. Scroll past, nobody would blame you. But I've been quietly using something since February that most people haven't caught up to yet...it's not Opus 5. The screenshots are a system that just finished a full architecture refactor of a production codebase. It found duplication I'd been living with for months and left the codebase meaningfully better than when it started. The system is Factory AI's Droid, specifically a feature called Missions. I paddle outrigger canoe in Hawaii. Our club needed an app. Not a vibe slop app...a real app. Authentication, group management, crew assignments with rotation logic so the same person isn't always stuck in seat 6, real-time chat, weather integration that warns you when wind is above 15 mph, notifications, admin controls. The kind of thing that would've cost six figures to build a few years ago. So I described what I wanted to Droid, and then I sat there watching it. It didn't just start writing code. It started asking me questions. Like, good questions. Clarifying questions about edge cases I hadn't even thought of. Then it broke the whole project into 14 milestones and 76 features. And before it wrote a single line of code, it created these things called validation contracts, basically testable assertions based on the spec so it knows what "done" actually looks like. I'm sitting there like...wait, it's planning the way I would plan? Then it started building. But here's the part that got me. When a milestone finished, separate agents came in and tested everything from the user's perspective. NOT UNIT TESTS! These agents actually opened a browser in the background and clicked through the app the way a real person would. When something failed, it didn't just retry the same thing. It went back and re-steered the entire plan. I've never seen an AI system do that successfully for many hours. I know what you're thinking, how many tokens did this cost??!? Is it burning tokens the entire time? The answer has a lot more detail than just running a Ralph loop on steroids. So I flew to San Francisco and sat with the Factory AI team to understand how this actually works under the hood. The orchestrator never writes code, it only delegates. Workers get cleared context between tasks so they don't hallucinate from stale state. And get this, the system isn't even tied to one model. You can run Claude as the orchestrator and GPT-5 as the worker. Their longest mission? Sixteen days. Can you imagine? I've been building with AI live since August 2024. I've had fifty Claude windows open (well...you know what I mean), Codex running in parallel, the whole circus. You know the feeling. This is the first time I've felt like the system was genuinely thinking through a problem the way a senior engineer would scope a project before writing a single line of code. Jensen told Lex Fridman that AGI means AI that can build a billion-dollar company. I don't know about a billion dollars, but it built my canoe club a production app while I went to make coffee. That's close enough for me.
Ray Fernando tweet mediaRay Fernando tweet mediaRay Fernando tweet media
English
33
15
242
21.9K
Matan Grinberg retweetledi
Bill
Bill@justBill·
@droid @FactoryAI Is going to win and its not even close. I was trying EVERYTHING to fix my holdout RMSE on a model I am working on and it kept getting caught over and over and even regressing. One prompt using droid + GPT5.4 High and fixed it. I spent almost $100 in Codex credits trying to fix this before hand. If i could I would buy the Max plan TODAY
Bill tweet media
English
6
6
81
7.2K
Garry Tan
Garry Tan@garrytan·
I wonder how many other attendees at JPM100 at Big Sky Montana today are live-launching new version of their open source AI agentic framework
Garry Tan tweet media
English
25
1
85
28.9K
Matan Grinberg retweetledi
Troy Martig
Troy Martig@troymartig·
We plugged the CLI into @FactoryAI @Droid and within 15 minutes had: - Daily automated spend briefings pushed to Slack - WoW and MoM vendor analysis with anomaly detection - Vendor management automatically cross-referenced with Google CLI (gmail, drive, etc.) - Automated spend alerts routed by category to the right person in Slack (fake data Slack message below)
Troy Martig tweet media
Ramp Labs@RampLabs

Today, we're releasing Ramp CLI to let agents manage your company's finances. 50+ tools across cards, bills, expenses, travel, and approvals. Fewer tokens than MCP, and comes with pre-built skills like receipt compliance and agentic purchasing.

English
7
14
128
21.8K
Matan Grinberg retweetledi
Eno Reyes
Eno Reyes@EnoReyes·
Couldn't resist asking droid to make this chart - our team of 25 technical staff is moving quite fast and shipping every day. This doesn't even include the bugfixes, reliability, tests, research, evals, internal apps, etc. that we're working on!
Eno Reyes tweet media
English
7
7
95
12.2K
Matan Grinberg retweetledi
am.will
am.will@LLMJunky·
Trying out Droid Factory Missions for the first time. Kinda excited. Anyone else out there using this? What is your experience?
am.will tweet media
English
24
5
64
6.3K
Goreng
Goreng@sudo_goreng·
Just tested @FactoryAI for a couple of hours + adapted my opencode skills & plugins to droid. Will be daily driving it for a couple of weeks. So far its good, but the only problem is the CLI sometimes lags, Idk if its a zellij/ghostty specific issue or not.
Goreng tweet media
Goreng@sudo_goreng

Anyone tried @FactoryAI before? Is it good or nah? factory.ai/pricing

English
9
1
45
5.7K
Can.
Can.@lumendriada·
damn droid's missions can really long run big tasks
Can. tweet media
English
2
2
43
4.5K
SIGKITTEN
SIGKITTEN@SIGKITTEN·
oopsie
SIGKITTEN tweet media
Nederlands
8
0
26
2.8K
Matan Grinberg retweetledi
Bessi
Bessi@LLMpsycho·
How do you even come up with such a beautiful layout ? Mission Control from Droid.
Bessi tweet media
English
11
9
219
18.3K