
introducing Flywheel: the infrastructure for autonomous research.
tensorqt
7.1K posts

@tensorqt
chaos dancing star @paradigmainc

introducing Flywheel: the infrastructure for autonomous research.


This allows agents to identify dead-ends, transport patterns between different campaigns, in a way that doesn’t dissipate the knowledge gained by exploration.






You will find the Autoresearch Campaign already populated: we set out 3 distinct agents, harnessed by Claude Code, respectively being GLM-5 by @Zai_org , Sonnet 4.6 by @AnthropicAI and GPT 5.4 by @OpenAI to have a simultaneous contest. 80$ in @OpenRouter credits later, here’s what the exploration looked like: Green is GPT, Blue is GLM, Orange is Sonnet. in this short challenge, GLM-5 by @Zai_org (somewhat surprisingly) won, and all models were pretty narrow in their exploration.


You will find the Autoresearch Campaign already populated: we set out 3 distinct agents, harnessed by Claude Code, respectively being GLM-5 by @Zai_org , Sonnet 4.6 by @AnthropicAI and GPT 5.4 by @OpenAI to have a simultaneous contest. 80$ in @OpenRouter credits later, here’s what the exploration looked like: Green is GPT, Blue is GLM, Orange is Sonnet. in this short challenge, GLM-5 by @Zai_org (somewhat surprisingly) won, and all models were pretty narrow in their exploration.


introducing Campaigns (Beta).

We believe this is the new way of discovering things: set a scope and spend compute to explore the search space. We also believe this could be an interesting, inspectable open-ended benchmark of model quality in autonomous exploration.









@karpathy's autoresearch wave just makes it clearer the paradigm is shifting. if the objection is that the found ideas are incremental, so is a large fraction of existing papers. how does peer review adapt? What's the scientist's role in this new paradigm? join us @iclr_conf