Matan Grinberg

1.8K posts

Matan Grinberg

@matanSF

ceo @FactoryAI

SF Katılım Ocak 2021

571 Takip Edilen15.6K Takipçiler

Sabitlenmiş Tweet

Matan Grinberg@matanSF·9 Oca

x.com/i/article/2009…

ZXX

264

76.7K

Matan Grinberg@matanSF·12h

@garrytan @FactoryAI Let’s go! 🤝

English

472

Matan Grinberg retweetledi

Garry Tan@garrytan·12h

GStack now supports Factory Droid @FactoryAI Thanks for getting me to do it @matanSF

English

210

14.1K

Matan Grinberg@matanSF·1d

@MartinShkreli Sorry for the confusion here. Need to make Mission mode more obvious. DMing

English

647

Martin Shkreli@MartinShkreli·1d

what is the best tooling for 24-7 inference/agent-driven research? im trying factory but it stops and asks me questions even though i have 'auto' mode on. tbh i think this is an even bigger killer app than LLM chatbots. who else is out there doing it?

English

150

582

111K

Matan Grinberg retweetledi

luke@luke_alvoeiro·1d

We recently launched missions. Here's a short demo of how to get started. Excited to see what you'll build!

English

300

30.8K

Matan Grinberg retweetledi

Geo Anima@geo_anima·2d

tbh @droid maxing is life

Faisal@infmes

Almost 3B tokens used in the last month using droid missions

English

3.6K

Matan Grinberg retweetledi

Ray Fernando@RayFernando1337·2d

Zero humans. Forty features. Nine hours. Jensen Huang says AGI is here. I agree, and I have the screenshots to prove it. Yeah, I know. You've read this headline forty times this week. Scroll past, nobody would blame you. But I've been quietly using something since February that most people haven't caught up to yet...it's not Opus 5. The screenshots are a system that just finished a full architecture refactor of a production codebase. It found duplication I'd been living with for months and left the codebase meaningfully better than when it started. The system is Factory AI's Droid, specifically a feature called Missions. I paddle outrigger canoe in Hawaii. Our club needed an app. Not a vibe slop app...a real app. Authentication, group management, crew assignments with rotation logic so the same person isn't always stuck in seat 6, real-time chat, weather integration that warns you when wind is above 15 mph, notifications, admin controls. The kind of thing that would've cost six figures to build a few years ago. So I described what I wanted to Droid, and then I sat there watching it. It didn't just start writing code. It started asking me questions. Like, good questions. Clarifying questions about edge cases I hadn't even thought of. Then it broke the whole project into 14 milestones and 76 features. And before it wrote a single line of code, it created these things called validation contracts, basically testable assertions based on the spec so it knows what "done" actually looks like. I'm sitting there like...wait, it's planning the way I would plan? Then it started building. But here's the part that got me. When a milestone finished, separate agents came in and tested everything from the user's perspective. NOT UNIT TESTS! These agents actually opened a browser in the background and clicked through the app the way a real person would. When something failed, it didn't just retry the same thing. It went back and re-steered the entire plan. I've never seen an AI system do that successfully for many hours. I know what you're thinking, how many tokens did this cost??!? Is it burning tokens the entire time? The answer has a lot more detail than just running a Ralph loop on steroids. So I flew to San Francisco and sat with the Factory AI team to understand how this actually works under the hood. The orchestrator never writes code, it only delegates. Workers get cleared context between tasks so they don't hallucinate from stale state. And get this, the system isn't even tied to one model. You can run Claude as the orchestrator and GPT-5 as the worker. Their longest mission? Sixteen days. Can you imagine? I've been building with AI live since August 2024. I've had fifty Claude windows open (well...you know what I mean), Codex running in parallel, the whole circus. You know the feeling. This is the first time I've felt like the system was genuinely thinking through a problem the way a senior engineer would scope a project before writing a single line of code. Jensen told Lex Fridman that AGI means AI that can build a billion-dollar company. I don't know about a billion dollars, but it built my canoe club a production app while I went to make coffee. That's close enough for me.

English

242

21.9K

Matan Grinberg retweetledi

Bill@justBill·2d

@droid @FactoryAI Is going to win and its not even close. I was trying EVERYTHING to fix my holdout RMSE on a model I am working on and it kept getting caught over and over and even regressing. One prompt using droid + GPT5.4 High and fixed it. I spent almost $100 in Codex credits trying to fix this before hand. If i could I would buy the Max plan TODAY

English

7.2K

Matan Grinberg@matanSF·2d

@garrytan at least one

English

186

Garry Tan@garrytan·3d

I wonder how many other attendees at JPM100 at Big Sky Montana today are live-launching new version of their open source AI agentic framework

English

28.9K

Matan Grinberg retweetledi

Troy Martig@troymartig·3d

We plugged the CLI into @FactoryAI @Droid and within 15 minutes had: - Daily automated spend briefings pushed to Slack - WoW and MoM vendor analysis with anomaly detection - Vendor management automatically cross-referenced with Google CLI (gmail, drive, etc.) - Automated spend alerts routed by category to the right person in Slack (fake data Slack message below)

Ramp Labs@RampLabs

Today, we're releasing Ramp CLI to let agents manage your company's finances. 50+ tools across cards, bills, expenses, travel, and approvals. Fewer tokens than MCP, and comes with pre-built skills like receipt compliance and agentic purchasing.

English

128

21.8K

Matan Grinberg retweetledi

Eno Reyes@EnoReyes·3d

Couldn't resist asking droid to make this chart - our team of 25 technical staff is moving quite fast and shipping every day. This doesn't even include the bugfixes, reliability, tests, research, evals, internal apps, etc. that we're working on!

English

12.2K

Matan Grinberg retweetledi

am.will@LLMJunky·3d

Trying out Droid Factory Missions for the first time. Kinda excited. Anyone else out there using this? What is your experience?

English

6.3K

Matan Grinberg@matanSF·3d

@sudo_goreng @FactoryAI Lags how?

English

496

Goreng@sudo_goreng·4d

Just tested @FactoryAI for a couple of hours + adapted my opencode skills & plugins to droid. Will be daily driving it for a couple of weeks. So far its good, but the only problem is the CLI sometimes lags, Idk if its a zellij/ghostty specific issue or not.

Goreng@sudo_goreng

Anyone tried @FactoryAI before? Is it good or nah? factory.ai/pricing

English

5.7K

Matan Grinberg@matanSF·4d

@winstonweinberg @sequoia @a16z @coatuemgmt @conviction @eladgil @EvanticCapital @kleinerperkins Legend

English

1.6K

Matan Grinberg@matanSF·4d

@EricFriedman @bentossell Let’s go! Lmk what u think

English

130

Eric Friedman ⚙️@EricFriedman·4d

@matanSF @bentossell Ok, this got me and finally going to try it

English

130

Matan Grinberg@matanSF·5d

or use droid and get features like this without waiting 4 months :)

Claude@claudeai

New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.

English

123

12.8K

Matan Grinberg@matanSF·5d

@lumendriada Yessir!

English

480