Ben Day
467 posts

Ben Day
@itsmebenday
thinking about forecasting @_Mantic_AI





Mantic used Tinker to RL gpt-oss-120b on judgmental forecasting; the result outperformed frontier models on event predictions. Combined with @_Mantic_AI's forecasting architecture, task-specific training takes us to the cusp of automated superforecasting.

Mantic used Tinker to RL gpt-oss-120b on judgmental forecasting; the result outperformed frontier models on event predictions. Combined with @_Mantic_AI's forecasting architecture, task-specific training takes us to the cusp of automated superforecasting.

Mantic used Tinker to RL gpt-oss-120b on judgmental forecasting; the result outperformed frontier models on event predictions. Combined with @_Mantic_AI's forecasting architecture, task-specific training takes us to the cusp of automated superforecasting.







They are ants solving a geometric problem and it is mind-blowingly colorful.










We’ve been making progress on our forecaster at @_Mantic_AI. We started competing in the @metaculus Cup last summer and landed the first top-10 finish for an AI. In the fall, we stepped it up and beat the community prediction, a combined forecast that leverages the ‘wisdom of the crowd’ of ~500 forecasters.

One of @metaculus's largest tournaments ever. I narrowly beat @_Mantic_AI, which placed 4th, ahead of the community. Lots of suprising events this quarter, causing the unweighted aggregate to perform about as well as the Metaculus Community Prediction.









