Davide Zetti
576 posts

Davide Zetti
@DukeZetti
| motion design•animation•drawings |






Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute ) and 87.5% in high-compute mode (thousands of $ per task). It's very expensive, but it's not just brute -- these capabilities are new territory and they demand serious scientific attention.

Breakthroughs aren't just shouted out from the mountain tops. Scientific breakthroughs must be independently verified by a wider scientific community. Not just a few people declare hard to verify milestones.































