{"id":556332,"date":"2026-05-04T14:00:00","date_gmt":"2026-05-04T14:00:00","guid":{"rendered":"https:\/\/winklersart.com\/?p=556332"},"modified":"2026-05-04T14:00:00","modified_gmt":"2026-05-04T14:00:00","slug":"a-i-outperformed-doctors-at-diagnosing-real-world-e-r-patients-in-a-new-study-that-doesnt-mean-computers-will-replace-clinicians","status":"publish","type":"post","link":"https:\/\/winklersart.com\/?p=556332","title":{"rendered":"A.I. Outperformed Doctors at Diagnosing Real-World E.R. Patients in a New Study. That Doesn&#8217;t Mean Computers Will Replace Clinicians"},"content":{"rendered":"<header class=\"article-header\">\n<h2 class=\"tagline article-tagline\" itemprop=\"description\">One of OpenAI\u2019s large language models did better than physicians in several experiments, hinting that A.I.-assisted emergency medical care could be around the corner<\/h2>\n<div class=\"article-line\">\n<section class=\"author-box by-line\">\n<div class=\"author-text\">\n<p class=\"author\" itemprop=\"author\">\n<p>          Rudy Molinek<\/p>\n<p>            | <span class=\"author-short-bio\">Reporter<\/span><\/p>\n<p>      <time class=\"pub-date\" itemprop=\"datePublished\" data-pubdate=\"May 4, 2026, 10 a.m.\">May 4, 2026 10:00 a.m.<\/time><\/p><\/div>\n<\/section><\/div>\n<\/header>\n<figure class=\"article-image lead-article-image\">\n<picture class=\"responsive-image\"><source media=\"(max-width: 600px)\" srcset=\"https:\/\/th-thumbnailer.cdn-si-edu.com\/-iTaUJ25rtxyhFNp8Ff0_o5s4YA=\/600x400\/filters:no_upscale():focal(1990x1402:1991x1403)\/https:\/\/tf-cmsv2-smithsonianmag-media.s3.amazonaws.com\/filer_public\/11\/a7\/11a750e6-c82c-4b27-8fdb-8a14f17be4fa\/the_emergency_entrance_at_erlanger_western_carolina_hospital_in_peachtree_north_carolina.jpg\" width=\"600\" height=\"400\"><source media=\"(max-width: 768px)\" srcset=\"https:\/\/th-thumbnailer.cdn-si-edu.com\/r67IoPUEHB0yVmbM0ULx2YsaBiI=\/768x512\/filters:no_upscale():focal(1990x1402:1991x1403)\/https:\/\/tf-cmsv2-smithsonianmag-media.s3.amazonaws.com\/filer_public\/11\/a7\/11a750e6-c82c-4b27-8fdb-8a14f17be4fa\/the_emergency_entrance_at_erlanger_western_carolina_hospital_in_peachtree_north_carolina.jpg\" width=\"768\" height=\"512\"><source media=\"(max-width: 1000px)\" srcset=\"https:\/\/th-thumbnailer.cdn-si-edu.com\/r67IoPUEHB0yVmbM0ULx2YsaBiI=\/768x512\/filters:no_upscale():focal(1990x1402:1991x1403)\/https:\/\/tf-cmsv2-smithsonianmag-media.s3.amazonaws.com\/filer_public\/11\/a7\/11a750e6-c82c-4b27-8fdb-8a14f17be4fa\/the_emergency_entrance_at_erlanger_western_carolina_hospital_in_peachtree_north_carolina.jpg, https:\/\/winklersart.com\/wp-content\/uploads\/2026\/05\/a-i-outperformed-doctors-at-diagnosing-real-world-e-r-patients-in-a-new-study-that-doesnt-mean-computers-will-replace-clinicians.webp 2x\" width=\"768\" height=\"512\"><img decoding=\"async\" src=\"https:\/\/winklersart.com\/wp-content\/uploads\/2026\/05\/a-i-outperformed-doctors-at-diagnosing-real-world-e-r-patients-in-a-new-study-that-doesnt-mean-computers-will-replace-clinicians.webp\" width=\"1026\" height=\"684\" alt=\"ER Entrance\" itemprop=\"image\" loading=\"lazy\">\n            <\/picture><figcaption class=\"caption\">\n<p>                The A.I. model outperformed two doctors when presented with data from dozens of real E.R. patients.<br \/>\n              <span class=\"credit\">Harrison Keely via Wikimedia Commons under CC BY 4.0<\/span><br \/>\n            <\/figcaption><\/figure>\n<p>Since the&nbsp;1950s, scientists have been comparing human doctors to computers to see if the machines\u2019 algorithms can accurately diagnose complex health conditions. In a standard test, computers attempt to puzzle out challenging case studies from the&nbsp;<em>New England Journal of Medicine<\/em>.<\/p>\n<p>Machines have recently improved at the task, primarily because of artificial intelligence built on large language models, like the one powering OpenAI\u2019s ChatGPT. But so far, A.I. has done well only when it comes to curated case studies.<\/p>\n<p>Now, researchers have put an A.I. model\u2014a preview version of&nbsp;OpenAI\u2019s o1\u2014to the diagnostic test with real hospital records. Based on written documentation alone, the technology outperformed practicing physicians, according to a study published April 30 in the journal&nbsp;Science. The findings suggest that A.I. could be a powerful aid to health care workers during stressful, time-sensitive situations.<\/p>\n<p>\u201cThis is the big conclusion for me: It works with the messy real-world data of the emergency department,\u201d says study co-author&nbsp;Adam Rodman, a clinical researcher at Beth Israel Deaconess Medical Center in Boston, to&nbsp;NPR\u2019s&nbsp;Will Stone. \u201cIt works for making diagnoses in the real world.\u201d<\/p>\n<p>In one test, Rodman and his colleagues presented the A.I. and two doctors with E.R. health records of 76 Beth Israel patients at three stages of care: initial triage with a health worker, first interaction with a doctor and admission to the hospital. The experiment had no impact on actual patient care.<\/p>\n<p>These records typically include just a few sparse details, like vital signs, demographic information and a brief description written by staff, per the <em>Guardian<\/em>\u2019s Robert Booth. Initial interactions are critical points in a patient\u2019s care, as sometimes life-or-death decisions must be made quickly in chaotic situations.<\/p>\n<p>Analyses revealed that the two physicians came up with exact or near-exact diagnoses in 50 percent and 55 percent of the cases, while the A.I. was close or exactly right 67 percent of the time.<\/p>\n<p>This test was the \u201cmost important\u201d of the&nbsp;study\u2019s six experiments, says co-author&nbsp;Thomas Buckley, a computer scientist at Harvard Medical School, to&nbsp;<em>Science<\/em>\u2019s&nbsp;Perri Thaler. A.I. did well in the others too\u2014so well that the researchers feared people wouldn\u2019t believe the results, Rodman tells the outlet.<\/p>\n<p>Clinicians are increasingly incorporating A.I. into their daily work,&nbsp;using the programs&nbsp;for tasks like transcribing notes from patient interactions, reviewing health scans and detecting early signs of disease. The new findings suggest that A.I. reasoning models like o1, which can execute and explain step-by-step logical thinking, could soon regularly help doctors formulate diagnoses.<\/p>\n<div class=\"insight\" readability=\"7.7524752475248\">\n<div readability=\"11.19801980198\">\n<p class=\"h4-style\">Did you know? A.I. in breast cancer detection<\/p>\n<p>In a study published in January, researchers found that A.I. could help doctors find hard-to-detect signs of breast cancer. These can be missed during routine mammography screenings, which are generally recommended to take place once every one or two years.<\/p>\n<\/p><\/div>\n<\/div>\n<p>The study is especially notable given that o1 was first released at the end of 2024. \u201cThat\u2019s kind of like ancient history now in machine learning time,\u201d Buckley tells <em>Science<\/em>.<\/p>\n<p>Still, the researchers, as well as doctors and scientists not involved in the new study, caution that these promising results don\u2019t mean physicians are about to be replaced by A.I. For one thing, purely logical reasoning excludes human aspects of a doctor\u2019s work.<\/p>\n<p>\u201cWhen we say clinical reasoning, it doesn\u2019t mean the same thing as moral reasoning,\u201d&nbsp;Arya Rao, a biomedical informaticist at Harvard Medical School who wasn\u2019t involved in the study, tells&nbsp;<em>Science News<\/em>\u2019&nbsp;Kathryn Hulick. \u201cThese models have been optimized to do this kind of sequential thought that we call reasoning, but it\u2019s not at all the same thing as how we teach medical students to reason.\u201d<\/p>\n<p>Additionally, the researchers say the A.I. might not perform as well with larger amounts of patient data, such as from someone who\u2019s admitted to the hospital for a days-long stay.<\/p>\n<p>Going forward, the team plans to conduct clinical trials to figure out how best to integrate A.I. into patient care, reports&nbsp;<em>Science News<\/em>.<\/p>\n<p>This matches the view of physician Nour Khatib, of Oak Valley Health in Canada, who was not involved in the study. \u201cIt\u2019s just another tool to help us give the patient the highest quality care possible,\u201d she tells the&nbsp;CBC\u2019s&nbsp;Nick Logan.<\/p>\n<div id=\"id_related_pages\" class=\"widget-related-articles\">\n<h3>You Might Also Like<\/h3>\n<ul>\n<li>\n<div class=\"containment\">\n<p>May 4, 2026<\/p>\n<\/p><\/div>\n<\/li>\n<li>\n<div class=\"containment\">\n<p>May 1, 2026<\/p>\n<\/p><\/div>\n<\/li>\n<li>\n<div class=\"containment\">\n<p>May 1, 2026<\/p>\n<\/p><\/div>\n<\/li>\n<li>\n<div class=\"containment\">\n<p>May 1, 2026<\/p>\n<\/p><\/div>\n<\/li>\n<li>\n<div class=\"containment\">\n<p>May 1, 2026<\/p>\n<\/p><\/div>\n<\/li>\n<\/ul>\n<\/div>\n<div class=\"in-article-newsletter\">\n<div class=\"leade\" readability=\"4.5563909774436\">\n<h3>Get the latest stories in your inbox every weekday.<\/h3>\n<\/p><\/div>\n<\/p><\/div>\n<section class=\"tag-list\">\n<nav class=\"nav-tags\">\n<\/nav>\n<\/section>\n","protected":false},"excerpt":{"rendered":"<p>One of OpenAI\u2019s large language models did better than physicians in several experiments, hinting that A.I.-assisted emergency medical care could be around the corner Rudy Molinek | Reporter May 4, 2026 10:00 a.m. The A.I. model outperformed two doctors when presented with data from dozens of real E.R. patients. Harrison Keely via Wikimedia Commons under CC BY 4.0 Since the&nbsp;1950s, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":556333,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"Default","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-556332","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/winklersart.com\/index.php?rest_route=\/wp\/v2\/posts\/556332","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/winklersart.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/winklersart.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/winklersart.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/winklersart.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=556332"}],"version-history":[{"count":0,"href":"https:\/\/winklersart.com\/index.php?rest_route=\/wp\/v2\/posts\/556332\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/winklersart.com\/index.php?rest_route=\/wp\/v2\/media\/556333"}],"wp:attachment":[{"href":"https:\/\/winklersart.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=556332"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/winklersart.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=556332"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/winklersart.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=556332"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}