Study: ChatGPT Health underestimated the severity of medical emergencies 51.6% of the time and overestimated the severity of nonurgent cases 64.8% of the time
Researchers tested different medical scenarios with the chatbot. In more than half of cases where doctors would send a patient to the ER …
NBC News Kaan Ozcan
Related Coverage
- ChatGPT Health performance in a structured test of triage recommendations Nature Medicine
- OpenAI's ChatGPT Health chatbot struggles to identify urgent medical cases, study reveals Moneycontrol · Ankita Chakravarti
- ChatGPT Health Underestimates Medical Emergencies, Study Finds Gizmodo · Ece Yildirim
- Is ChatGPT Health safe? Study finds AI missed half of medical emergencies Digit · Vyom Ramani
- AI: Errors? Hey, No Problemo. Beyond Search · Stephen E. Arnold
- Is ChatGPT Health Reliable? Study Finds It ‘Underestimating’ Health Concerns, Emergencies International Business Times · Isaiah Richard
- ChatGPT misses ‘high-risk emergencies’ when it is used as a doctor, study finds The Independent · Andrew Griffin
- ‘Transformative’ tech: How AI is plugging in to doctor's offices and emergency rooms Chicago Tribune · Emily Brindley
Discussion
-
@urocklive1
@urocklive1
on bluesky
Seems maybe Dr. Oz's plan to replace the rural hospitals that are closing with AI is not yet ready for Prime Time. — Open AI's health chatbot can pass medical exams, but is doing a bad job a correctly diagnosing health problems and assessing how critical they are. — www.nbcne…
-
@drianweissman
@drianweissman
on bluesky
ChatGPT Health ‘under-triaged’ half of medical emergencies in a new study. Researchers tested different medical scenarios with the chatbot. In more than half of cases in which doctors would send patients to the ER, the chatbot said it was OK to delay care. — www.nbcnews.com/h…
-
@cingraham
Chris Ingraham
on bluesky
First independent evaluation of ChatGPT Health: “Among gold-standard emergencies, the system under-triaged 52% of cases... Crisis intervention messages activated unpredictably across suicidal ideation presentations.” www.nature.com/articles/s41...
-
@erictopol
Eric Topol
on bluesky
🆕 at @naturemedicine.bsky.social — How does ChatGPT Health do for appropriately triaging a person as to whether to go to the emergency room or stay home? www.nature.com/articles/s41... Not very well. Under-triaged 52% of case vignettes that are considered gold-standard emerge…
-
@dannycrichton
Danny Crichton
on x
I find that ChatGPT for health is basically a neutral observer (which matches this data - everything it returns is the median crisis level). How critical you take the information it offers is really left up to the user, which isn't great