
Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.
galen
62 posts

@G413N
cofounder https://t.co/ijW1mX8mRr

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

Open weights isn't open training. @AddieF38654 on our team wrote up her experience trying to post-train a 1T parameter MoE model using the existing open source infra. Let's find out how many monkey-patches it takes to post-train an open-weights model. A thread🧵

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.


Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.


Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.


Very excited for the SI team - fun fact, General Intuition likely would not have existed without Galen and his early mentorship as I was getting started in the field after @lachygroom introduced us. Having mostly traditional researchers in my network, and nobody who was self-taught like Galen, it was great seeing people paving their own path and being so far ahead of the curve. Follow this team!


