

Steven Dillmann
113 posts

@DillmannSteven
Stanford PhD working on #AI4Science and maintaining Terminal-Bench-Science @StanfordAILab 🧬🤖🪐



RuneBench is out: measuring long horizon goal optimization across 14 AI coding models inside Runescape


Dan Levy, cofounder of SSI, will be our next guest for Saplings! On March 12, we will have a fireside chat with time for audience Q&A. Luma link in thread! We are super excited about this and hope to see y'all there!




Introducing Slingshots // TWO: Research that ships. 14 projects, six institutions – let’s meet the batch 🧵



Knowing which questions to ask is often the hardest part of science. Today we're releasing AutoDiscovery in AstaLabs, an AI system that starts with your data and generates its own hypotheses. 🧪






