James Cogan
96 posts

James Cogan
@cogan
From Commodore VIC-20 to operating ventures with multi-agent fleets. Machine economy obsessed. Documenting what comes next. Founder, dailypixel.





Gossip Goblin is arguably the best AI filmmaker in the world. His new film THE PATCHWRIGHT is a masterpiece (10M+ views). But nobody knows how he actually makes these. Until now. He let me share every step of the workflow with you 🧵👇





@sudoingX What is the reason for using Openclaw at this point? I have had zero issues with Hermes.


We made a video for Chris Brown. And we're still not over it. 🔥 Incredibly honored and excited to join the BROWN journey with Chris and keep exploring creativity and storytelling with the power of AI and tech. More to come. @chrisbrown

Three hours. That's how long an AI can now work on expert software tasks without a single human check-in. In early 2024 it was five minutes. METR's Time Horizon measures the maximum task length, in how long a human professional would need, that a frontier AI can finish autonomously with 80% reliability. Claude Mythos Preview hit three hours in early 2026. GPT-5.2 sits around an hour. Gemini 3.1 Pro at ninety minutes. Same exponential trend across all three labs. The doubling time has compressed from seven months between 2019 and 2025 to 89 days since 2024. Four times a year, the autonomous work capacity of frontier AI doubles. A 12-month product lag at the current pace is four doublings: the frontier has roughly 16x the autonomous task length you're building against. Even at the historical seven-month pace, 12 months is nearly two doublings: 3-4x. This is why incumbents keep losing to AI native startups despite more capital, more headcount, more distribution. The capital is real. The headcount is real. The intelligence layer underneath the product was committed to 18 months ago when the project kicked off, against models that no longer exist at the frontier. The career math runs the same way. A PM at a non-AI native company is learning the AI of a year ago. Their eval frameworks, agent architectures, and prompting patterns are tuned to a system that already got lapped twice. By the time internal proof-of-value lands, the benchmark is two generations gone. There's an access gradient riding on top of the doubling. Lab engineers running pre-release internal models sit ahead of public frontier users. API users running the latest releases sit ahead of teams still on last year's stack. API access compounds. Eval data on unreleased models compounds. Insider knowledge of what ships in six months compounds. The entire moat is rate of change. Three hours today. Six hours by summer. Twelve by year end.














