Post

Ethan Mollick
Ethan Mollick@emollick·
This paper shows people are asking a lot of medical questions of AI already, but we have little evidence of how good or bad this is. Most of the published research uses old models & compares to doctors. How do new models compare to the info people would have gotten without AI?
Ethan Mollick tweet media
English
29
34
255
26.9K
Austin Meyer
Austin Meyer@austingmeyer·
Most importantly and probably the biggest bottleneck, is they need to stop testing against benchmarks that consist of cases that have been well curated and summarized by medical professionals. If people want to see how well these models perform, they need to allow random patients to input their own queries and allow the models to ask their own questions in response. The cognitive disorganization of real patients without a doctor filtering it for the models is probably a significant limitation in training.
English
0
0
0
14
Paylaş