Jake Loo
257 posts

Jake Loo
@jakeloo
exploring and experimenting with AI. co-founded @thirdweb at @fdotinc. @UCBerkeley.


New Engineering blog: We tasked Opus 4.6 using agent teams to build a C compiler. Then we (mostly) walked away. Two weeks later, it worked on the Linux kernel. Here's what it taught us about the future of autonomous software development. Read more: anthropic.com/engineering/bu…

New on the Anthropic Engineering blog: tips on how to build more efficient agents that handle more tools while using fewer tokens. Code execution with the Model Context Protocol (MCP): anthropic.com/engineering/co…





moved in to our new hq today w @FarzaTV

We made Claude, but multiplayer. For the first time (ever) -- you can collaborate with a model + others in the same exact chat. Demo + available now:

Our vision is for AI that uses world models to adapt in new and dynamic environments and efficiently learn new skills. We’re sharing V-JEPA 2, a new world model with state-of-the-art performance in visual understanding and prediction. V-JEPA 2 is a 1.2 billion-parameter model, trained on video, that can enable zero-shot planning in robots—allowing them to plan and execute tasks in unfamiliar environments. Learn more about V-JEPA 2 ➡️ai.meta.com/blog/v-jepa-2-… As we continue working toward our goal of achieving advanced machine intelligence (AMI), we’re also releasing three new benchmarks for evaluating how well existing models can reason about the physical world from video. Learn more and download the new benchmarks ➡️ai.meta.com/blog/v-jepa-2-…

The wildest AI takes are "it's not good enough yet" or "it can't do X well." This completely ignores the exponential improvement curve we're on. If you don't believe every aspect of digital work will reach human-level quality soon, you're setting yourself up for failure. Agents are already handling tasks that required entire teams a few years ago. The building blocks for autonomous digital work is already here. Most people aren't ready for how fast this will impact every company on the planet. Jobs will transform, companies will become leaner, but we'll also see an explosion of new startups. We're entering the greatest entrepreneurial opportunity in decades.




Summary in case you missed any LLM research from the past month: * RL on math datasets improves math ability v1 * RL on math datasets improves math ability v2 * RL on math datasets improves math ability v3 * RL on math datasets improves math ability v4 * RL on math datasets...







