Jacob Rosenthal
108 posts

Jacob Rosenthal
@_JacobRosenthal
Medicine & ML @TriIMDPhD @Cornell_Tech // Working on the future of AI-augmented medical diagnostics, treatments, and systems // 🦉🪴♟️☕️🏃♂️


Preprint out today that tests o1-preview's medical reasoning experiments against a baseline of 100s of clinicians. In this case the title says it all: Superhuman performance of a large language model on the reasoning tasks of a physician Link: arxiv.org/abs/2412.10849 A 🧵⬇️



We’re excited to announce two @Nature publications from Project AMIE (Articulate Medical Intelligence Explorer), a research AI system optimized for diagnostic reasoning and conversations 💬 Paper 1: goo.gle/4lpQ8xg Paper 2: goo.gle/3G4DNPe







Interesting explanation of the state of AI research. US labs focused on the scaling race, which used up all resources, preventing the usual multiple exploratory side projects. "Exploit" mode rather than "explore" mode, as the rewards from scaling seemed so high. Hence DeepSeek


We’re releasing Humanity’s Last Exam, a dataset with 3,000 questions developed with hundreds of subject matter experts to capture the human frontier of knowledge and reasoning. State-of-the-art AIs get <10% accuracy and are highly overconfident. @ai_risk @scaleai

“The current pass/fail era and resultant shadow economy of effort risk creating a triple harm by devaluing clinical excellence, burning out medical students, and potentially producing superficial, or worse inauthentic, academic and community work.” doi.org/10.1097/ACM.00…

True
