Ethan Mollick: "This paper shows people are asking a lot of medical questions of AI already, but"

Post

This paper shows people are asking a lot of medical questions of AI already, but we have little evidence of how good or bad this is. Most of the published research uses old models & compares to doctors. How do new models compare to the info people would have gotten without AI?

English

255

26.9K

Ethan Mollick@emollick·19 Nis

We only have spotty information about this very important topic. It suggests AI can be good at diagnosis, but the real world doesn't always match the experiments. x.com/emollick/statu…

Ethan Mollick@emollick

Across most medical benchmarks, including when real cases & human doctors are involved, there is a clear trend of AI models improving over time (and many where today's AI beats human doctors) But we do not have many studies measuring real-world performance of AI in medicine, yet

English

14.8K

Austin Meyer@austingmeyer·19 Nis

Most importantly and probably the biggest bottleneck, is they need to stop testing against benchmarks that consist of cases that have been well curated and summarized by medical professionals. If people want to see how well these models perform, they need to allow random patients to input their own queries and allow the models to ask their own questions in response. The cognitive disorganization of real patients without a doctor filtering it for the models is probably a significant limitation in training.

English

Paylaş