

@james_y_zou's talk is on, an exciting look at how AI agents are starting to function as collaborative scientists, not just assistants, especially for biomedical discovery and healthcare research. @CAISconf @Stanford @StanfordHAI
Batu El
26 posts



@james_y_zou's talk is on, an exciting look at how AI agents are starting to function as collaborative scientists, not just assistants, especially for biomedical discovery and healthcare research. @CAISconf @Stanford @StanfordHAI

🚀 Today, we’re excited to introduce SimpleTES for scaling the scientific discovery loop. 🧵 I always ask myself: what are we actually scaling in scientific discovery? Most LLM discovery methods focus on test-time scaling generation — more tokens, more agents, more turns. But science advances through the evaluation-driven loops: propose → evaluate → refine → repeat. SimleTES captures this idea, discovering SOTA solutions across 21 scientific problems! Key discoveries: 🏎️ 2.17x faster lasso solver than glmnet — the gold-standard LASSO solver, engineered for decades. ⚛️ 24.5% fewer quantum routing overhead on IBM Q20 — superior than previous standard library LightSABRE. 📐 0.380868 on Erdős Minimum Overlap — outperforming previous solutions from mixed-frontier ensembles or humans. 🧬 0.74 on Tabula Muris (scRNA-seq denoising) — new SOTA, generalizing to unseen tissue types without retraining. #LLM #AI4Science #ScalingLaws #SimpleTES #MachineLearning





Most modern multi-agent systems use pre-specified workflows, fixed roles, and aggregation rules. As agents handle increasingly complex tasks, what happens when we can't specify optimal workflows ahead of time? We study this in our work "Multi-Agent Teams Hold Experts Back" 🧵



Most modern multi-agent systems use pre-specified workflows, fixed roles, and aggregation rules. As agents handle increasingly complex tasks, what happens when we can't specify optimal workflows ahead of time? We study this in our work "Multi-Agent Teams Hold Experts Back" 🧵

