OpenEvidence

404 posts

OpenEvidence banner
OpenEvidence

OpenEvidence

@EvidenceOpen

OpenEvidence is the most widely used AI-powered medical search, helping doctors access the world's knowledge at the moment it matters.

Miami, FL & San Francisco, CA Katılım Kasım 2022
363 Takip Edilen34.8K Takipçiler
Sabitlenmiş Tweet
OpenEvidence
OpenEvidence@EvidenceOpen·
“We did the hardest thing in the history of American health care. We got the majority of American doctors to all voluntarily adopt a single technology platform.” NBC News on how that happened, what U.S. physicians actually do with OpenEvidence, and how partnerships with NEJM, JAMA, NCCN, and Wiley make it possible.
OpenEvidence tweet media
English
22
52
226
149.2K
OpenEvidence
OpenEvidence@EvidenceOpen·
Rigorous evaluation of medical AI is good for everyone, and we welcome it. Counter to a half-dozen independent studies from institutions such as the Mayo Clinic that were highly positive on OpenEvidence—a lone paper now purports to show that generalized AI beats specialized clinical AI (@UpToDate, @EvidenceOpen). The paper has a massive undisclosed conflict of interest and irredeemable methodological flaws. Behind the scenes: The study authors run a competing in-house medical AI at their hospital, and asked OpenEvidence for an API to power it — including rights to build a "competing product" with OpenEvidence's own API. OpenEvidence declined. Then, this paper coincidentally appeared. Point-by-point, looking closely at the datasets used in the study, the disingenuous and fatal flaws become immediately apparent 🧵.
OpenEvidence tweet media
English
29
72
369
161.4K
OpenEvidence
OpenEvidence@EvidenceOpen·
OpenEvidence@EvidenceOpen

Rigorous evaluation of medical AI is good for everyone, and we welcome it. Counter to a half-dozen independent studies from institutions such as the Mayo Clinic that were highly positive on OpenEvidence—a lone paper now purports to show that generalized AI beats specialized clinical AI (@UpToDate, @EvidenceOpen). The paper has a massive undisclosed conflict of interest and irredeemable methodological flaws. Behind the scenes: The study authors run a competing in-house medical AI at their hospital, and asked OpenEvidence for an API to power it — including rights to build a "competing product" with OpenEvidence's own API. OpenEvidence declined. Then, this paper coincidentally appeared. Point-by-point, looking closely at the datasets used in the study, the disingenuous and fatal flaws become immediately apparent 🧵.

QME
0
0
8
2.4K
OpenEvidence
OpenEvidence@EvidenceOpen·
This paper and its benchmarks are clearly ill-equipped to judge clinical decision support tools or draw conclusions about specialized clinical AI. So how should we actually evaluate these systems? Certainly not by boiling the answer down to a contaminated multiple choice quiz, or by measuring adherence to an arbitrary style guide written by one of the model creators. Here are a few starting ideas: the right evaluation should match the realities and distribution of real-world use of clinical AI. It should engage holistically with real clinical use, end to end. And ultimately, it should measure real clinical impact. Let's figure out the right way to do this in good faith.
English
1
2
42
5.7K
OpenEvidence
OpenEvidence@EvidenceOpen·
Third: Public peer review records indicate that the two flawed datasets above were the only evaluations included in the initial submission. After peer reviewers pointed out that results “lack[ed] epistemic grounding,” the RCQ dataset was subsequently added. The authors do not make the RCQ dataset public, nor do they provide even basic question characteristics or the process they used to create the dataset. They likewise exclude the selection process for the reviewers themselves. Without more information it is challenging to speak in detail, but their obvious conflict of interest and inattention to basic data contamination concerns do not inspire confidence.
OpenEvidence tweet media
English
1
1
27
5.9K
OpenEvidence
OpenEvidence@EvidenceOpen·
Second, misrepresented metrics: HealthBench, created by OpenAI, scores responses largely based on arbitrary/subjective stylistic choices. Here is an example of OE scoring 20% "worse" because it didn’t use a specific email header.
OpenEvidence tweet media
English
1
3
36
6.7K
OpenEvidence
OpenEvidence@EvidenceOpen·
First, contamination: Here’s what happens when you plug common benchmark questions into other LLMs–they have seen the questions–and answers!
OpenEvidence tweet mediaOpenEvidence tweet media
English
1
4
33
7.1K
OpenEvidence retweetledi
American Academy of Ophthalmology
40% of U.S. doctors are already using AI to make real-time clinical decisions. The catch? Most of these tools learned medicine from the open internet. Stephen D. McLeod, MD, CEO of the American Academy of Ophthalmology, makes the case for something better: an AI built on eye care's own gold-standard guidance. A clear-eyed take on where these tools help, where they fail, and what comes next. Read: aao.org/eyenet/article… #AAO #AIMedicine #EyeCare #EyeNetMag #Ophthalmology
American Academy of Ophthalmology tweet media
English
1
6
21
2.8K
OpenEvidence
OpenEvidence@EvidenceOpen·
@JReinerMD @nytimes The “me with unlimited time” line is a great one. Going to be hard not to steal it.
English
0
0
4
357
OpenEvidence retweetledi
Jonathan Reiner
Jonathan Reiner@JReinerMD·
The Open Evidence clinic encounter tool creates a note that’s written as if my scribe was me, but me with unlimited time. It also shows what clinical trial data and guidelines think about my plan. It’s amazing. nytimes.com/2026/06/08/bus… via @NYTimes
English
1
8
26
3.6K
OpenEvidence retweetledi
American Academy of Ophthalmology
We're proud to announce our partnership with @EvidenceOpen. OpenEvidence is the most widely used AI-based clinical decision support tool across the United States. Under this partnership, the Academy will share its Preferred Practice Patterns® and other clinical guidelines with OpenEvidence for incorporation into its training libraries and report on output for quality and consistency. We encourage our members to share their experiences with the OpenEvidence tool, including observations about performance and recommendations for improvements, using a feedback form that is available from the Academy’s OpenEvidence page. Read: aao.org/openevidence #Ophthalmology #EyeCare #MedicalAI #EvidenceBasedMedicine #AAO #OpenEvidence
American Academy of Ophthalmology tweet media
English
0
4
18
2.1K
OpenEvidence
OpenEvidence@EvidenceOpen·
Ophthalmology is a field where the right answer lives three layers down. Not “treat glaucoma,” but which target pressure, for which optic nerve, given which visual field trend, in a patient who can’t tolerate first-line drops. Starting today, the American Academy of Ophthalmology’s Preferred Practice Patterns are integrated into OpenEvidence. Decades of subspecialty judgment from the world’s largest eye physician and surgeon association, now cited and searchable alongside peer-reviewed literature. When an ophthalmologist asks a question at the point of care, the @aao_ophth authoritative guidance is right there in the answer, every claim cited back to the source.
OpenEvidence tweet media
English
2
5
126
1.5M
OpenEvidence retweetledi
Brandon Luu, MD
Brandon Luu, MD@BrandonLuuMD·
AI is not replacing doctors. It is helping us catch what might otherwise be missed and make better decisions, faster. Tools like this are already saving and improving lives.
OpenEvidence@EvidenceOpen

Last summer an ER doctor at Mount Sinai watched two med students and a resident pull up OpenEvidence mid-shift on a hard case. He assumed they were unusual. Then the health system found that a third of its 9,000 physicians were already using it. That's how OpenEvidence spread. Doctors found it, tried it on real cases, and told the doctors next to them. The hospitals are now formalizing what their clinicians already do. @SteveLohr piece in today’s New York Times follows that thread out to a small hospital in Alaska, where Dr. Barbara Creighton uses it on complex cases she’d otherwise send to a paid specialist consult. She calls it “like having a bunch of specialists in your pocket.” The hype around medical AI usually skips the part doctors care about most, which is what the tool is for. Daniel Nadler summed it up in one line: “It’s not an oracle, it’s a tool. Knowledge and knowledge workers still matter.”

English
8
5
106
17.3K
Samyukta Mullangi
Samyukta Mullangi@SamyuktaMD·
@EvidenceOpen The most important bit is that last line -the respect that OE has for the medical profession.
English
1
1
5
848
OpenEvidence
OpenEvidence@EvidenceOpen·
Last summer an ER doctor at Mount Sinai watched two med students and a resident pull up OpenEvidence mid-shift on a hard case. He assumed they were unusual. Then the health system found that a third of its 9,000 physicians were already using it. That's how OpenEvidence spread. Doctors found it, tried it on real cases, and told the doctors next to them. The hospitals are now formalizing what their clinicians already do. @SteveLohr piece in today’s New York Times follows that thread out to a small hospital in Alaska, where Dr. Barbara Creighton uses it on complex cases she’d otherwise send to a paid specialist consult. She calls it “like having a bunch of specialists in your pocket.” The hype around medical AI usually skips the part doctors care about most, which is what the tool is for. Daniel Nadler summed it up in one line: “It’s not an oracle, it’s a tool. Knowledge and knowledge workers still matter.”
English
7
24
284
1.7M
Steve Lohr
Steve Lohr@SteveLohr·
"It’s not an oracle, it’s a tool. . . . Knowledge and knowledge workers still matter."- Daniel Nadler, chief executive of OpenEvidence nytimes.com/2026/06/08/bus…
English
1
2
7
2.1K
OpenEvidence
OpenEvidence@EvidenceOpen·
Thanks for the careful reporting, Steve. A doctor in Fairbanks getting specialist-level answers she’d otherwise pay to consult out shows why physicians are adopting OpenEvidence on their own. Most of medicine happens outside the big academic centers, and that’s where the gap is widest.
English
0
0
3
557