Arnim Bleier
470 posts

Arnim Bleier
@arnimb
Computational Social Science and #reproducibility @gesis_org 🐘 @[email protected]. Opinions my own!










DeepSeek r1 is exciting but misses OpenAI’s test-time scaling plot and needs lots of data. We introduce s1 reproducing o1-preview scaling & performance with just 1K samples & a simple test-time intervention. 📜arxiv.org/abs/2501.19393




👀 A 10 page paper caused a panic because of a math error. I was curious if AI would spot the error by just prompting: “carefully check the math in this paper” especially as the info is not in training data. o1 gets it in a single shot. Should AI checks be standard in science?







We're thrilled to share an update about our continued collaboration with @developmentseed on the @NASA Visualization, Exploration and Data Analysis (VEDA) platform! See how we've made it easier for researchers to explore large geospatial datasets 🌍 2i2c.org/blog/2024/veda…













