Benjamin Wilson

5 posts

Benjamin Wilson

Benjamin Wilson

@CodexVeritas2

AI Research Engineer | Passionate about Forecasting and Epistemics | Running FutureEval the Metaculus AI Forecasting Benchmark

加入时间 Aralık 2022
69 关注13 粉丝
Benjamin Wilson 已转推
Metaculus
Metaculus@metaculus·
1/ AI forecasting doesn't beat top humans yet, but our research indicates models will match pro forecasters in June 2027. We built FutureEval to track this: a live benchmark with human baselines and bot tournaments measuring AI forecasting against reality.
Metaculus tweet media
English
24
113
1K
3.5M
Benjamin Wilson 已转推
Stefan Schubert
Stefan Schubert@StefanFSchubert·
In a @metaculus forecasting tournament ten human pros and 96 bots participated. Each human beat all the bots. The big drop-off in the chart below is between the last human and the first bot.
Stefan Schubert tweet media
English
7
23
210
14.9K
Benjamin Wilson
Benjamin Wilson@CodexVeritas2·
@AiDigest_ @sage_future_ Not sure if it’s just me, but clicking the sage future account does load anything. Excited to follow when I can get it to not error though!
English
1
0
0
17
AI Digest
AI Digest@aidigest_·
Introducing @aidigest_ Here, you'll find our interactive AI explainers and demos to help you stay ahead of the curve You can follow our forecasting tools (Fatebook and Quantified Intuitions) at the newly-separate @sage_future_ account: x.com/sage_future_/s…
Sage@sage_future_

We're a nonprofit building tools to make sense of the future: @aidigest_: interactive AI explainers and demos fatebook.io: the fastest way to make and track your predictions quantifiedintuitions.org: a suite of rapid forecasting training tools

English
2
3
9
1.3K
Benjamin Wilson 已转推
Metaculus
Metaculus@metaculus·
The Q1 AI Benchmark series launches January 20th! The gap between AI and human forecasting is narrowing, but the rate of progress is uncertain. You are invited to create your own forecasting bot to help track AI progress by competing against the best human forecasters on complex, real-world questions. • $30,000 prize pool • API credits courtesy of @OpenAI & @AnthropicAI We've updated our templates to get your bot forecasting fast on all new question types, including Multiple Choice and Date. Practice questions are open now. Link below.
Metaculus tweet media
English
1
2
5
886