Bruno Andreis

1 posts

Bruno Andreis

Bruno Andreis

@andreisbru

University of Oxford(@OxfordTVG) | KAIST(@MLAI_KAIST)

Katılım Aralık 2022
20 Takip Edilen9 Takipçiler
Bruno Andreis retweetledi
Sumeet Motwani
Sumeet Motwani@sumeetrm·
Very excited to announce HorizonMath with @erikyw26 and collaborators! How can we measure AI progress on mathematical discovery? Turns out there’s several classes of problems where discovery is hard but verification is easy. We develop a benchmark with 101 such problems and test GPT 5.4 Pro, Claude 4.6 Opus, and Gemini 3.1 Pro. Pending expert review, GPT 5.4 Pro finds two potentially novel solutions that beat existing baselines🧵
Sumeet Motwani tweet media
English
8
28
161
14.4K