{"id":26205,"date":"2025-06-20T18:31:24","date_gmt":"2025-06-20T18:31:24","guid":{"rendered":"https:\/\/www.tun.com\/home\/?p=26205"},"modified":"2025-06-20T18:31:25","modified_gmt":"2025-06-20T18:31:25","slug":"human-ai-collectives-lead-to-most-accurate-medical-diagnoses-new-study","status":"publish","type":"post","link":"https:\/\/www.tun.com\/home\/human-ai-collectives-lead-to-most-accurate-medical-diagnoses-new-study\/","title":{"rendered":"Human-AI Collectives Lead to Most Accurate Medical Diagnoses: New Study"},"content":{"rendered":"\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-uagb-blockquote uagb-block-e7eb3fc3 uagb-blockquote__skin-border uagb-blockquote__stack-img-none\"><blockquote class=\"uagb-blockquote\"><div class=\"uagb-blockquote__content\">In a landmark study, combining human expertise with AI has proven to significantly improve diagnostic accuracy in medicine, offering a transformative path forward for patient care and safety.<\/div><footer><div class=\"uagb-blockquote__author-wrap uagb-blockquote__author-at-left\"><\/div><\/footer><\/blockquote><\/div>\n\n\n\n<div class=\"wp-block-group is-content-justification-space-between is-nowrap is-layout-flex wp-container-core-group-is-layout-b0ffac9c wp-block-group-is-layout-flex\"><div style=\"font-size:16px\" class=\"has-text-align-left wp-block-post-author\"><div class=\"wp-block-post-author__content\"><p class=\"wp-block-post-author__name\">The University Network<\/p><\/div><\/div>\n\n\n<div class=\"wp-block-uagb-social-share uagb-social-share__outer-wrap uagb-social-share__layout-horizontal uagb-block-ee584a31\">\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-ec619ce7\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/www.facebook.com\/sharer.php?u=\" tabindex=\"0\" role=\"button\" aria-label=\"facebook\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\"><path d=\"M504 256C504 119 393 8 256 8S8 119 8 256c0 123.8 90.69 226.4 209.3 245V327.7h-63V256h63v-54.64c0-62.15 37-96.48 93.67-96.48 27.14 0 55.52 4.84 55.52 4.84v61h-31.28c-30.8 0-40.41 19.12-40.41 38.73V256h68.78l-11 71.69h-57.78V501C413.3 482.4 504 379.8 504 256z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n\n\n\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-32d99934\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/twitter.com\/share?url=\" tabindex=\"0\" role=\"button\" aria-label=\"twitter\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\"><path d=\"M389.2 48h70.6L305.6 224.2 487 464H345L233.7 318.6 106.5 464H35.8L200.7 275.5 26.8 48H172.4L272.9 180.9 389.2 48zM364.4 421.8h39.1L151.1 88h-42L364.4 421.8z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n\n\n\n<div class=\"wp-block-uagb-social-share-child uagb-ss-repeater uagb-ss__wrapper uagb-block-1d136f14\"><span class=\"uagb-ss__link\" data-href=\"https:\/\/www.linkedin.com\/shareArticle?url=\" tabindex=\"0\" role=\"button\" aria-label=\"linkedin\"><span class=\"uagb-ss__source-wrap\"><span class=\"uagb-ss__source-icon\"><svg xmlns=\"https:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 448 512\"><path d=\"M416 32H31.9C14.3 32 0 46.5 0 64.3v383.4C0 465.5 14.3 480 31.9 480H416c17.6 0 32-14.5 32-32.3V64.3c0-17.8-14.4-32.3-32-32.3zM135.4 416H69V202.2h66.5V416zm-33.2-243c-21.3 0-38.5-17.3-38.5-38.5S80.9 96 102.2 96c21.2 0 38.5 17.3 38.5 38.5 0 21.3-17.2 38.5-38.5 38.5zm282.1 243h-66.4V312c0-24.8-.5-56.7-34.5-56.7-34.6 0-39.9 27-39.9 54.9V416h-66.4V202.2h63.7v29.2h.9c8.9-16.8 30.6-34.5 62.9-34.5 67.2 0 79.7 44.3 79.7 101.9V416z\"><\/path><\/svg><\/span><\/span><\/span><\/div>\n<\/div>\n<\/div>\n<\/div><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Hybrid diagnostic collectives consisting of human experts and artificial intelligence systems significantly outperform traditional diagnosis methods, according to an international study led by the Max Planck Institute for Human Development.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Diagnostic errors remain a critical challenge in medical practice. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">While AI systems, especially large language models (LLMs) like ChatGPT-4, Gemini and Claude 3, offer innovative diagnostic support, they occasionally generate false information and reflect existing biases.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A research team from the Max Planck Institute for Human Development, in collaboration with the Human Diagnosis Project in San Francisco and the Institute of Cognitive Sciences and Technologies of the Italian National Research Council, has investigated the optimal collaboration between humans and AI. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The study&#8217;s results are promising: hybrid diagnostic collectives, combining human and AI inputs, yield significantly higher diagnostic accuracy than either humans or AI alone, particularly in complex, open-ended cases.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">&#8220;Our results show that cooperation between humans and AI models has great potential to improve patient safety,&#8221; lead author Nikolas Z\u00f6ller, a postdoctoral researcher in the Center for Adaptive Rationality at the Max Planck Institute for Human Development, said in a news release.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Realistic Simulations and Comprehensive Analysis<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The research team utilized data from over 2,100 clinical vignettes provided by the Human Diagnosis Project. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These case studies, paired with correct diagnoses, enabled a comparison between diagnoses made by medical professionals and those generated by five leading AI models. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The researchers simulated various diagnostic scenarios \u2014 individuals, human collectives, AI models and mixed human\u2013AI collectives \u2014 resulting in an analysis of more than 40,000 diagnoses.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The study revealed that while multiple AI models collectively outperformed 85% of human diagnosticians, human experts still excelled in many cases. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Notably, the combination of human and AI inputs led to the highest diagnostic accuracy. This approach leverages the complementary nature of human and AI errors: when one fails, the other often succeeds, creating a powerful safety net.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cIt\u2019s not about replacing humans with machines. Rather, we should view artificial intelligence as a complementary tool that unfolds its full potential in collective decision-making,\u201d added co-author Stefan Herzog, a senior research scientist at the Max Planck Institute for Human Development.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Challenges and Future Directions<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Despite the promising results, the researchers emphasize that the study was limited to text-based clinical vignettes and did not involve live clinical settings. Further studies are needed to determine whether these findings translate to real-world practice.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Additionally, the research focused solely on diagnosis rather than treatment, and the accuracy of a diagnosis does not always ensure optimal treatment outcomes. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Practical implementation and acceptance of AI-based support systems by medical staff and patients, as well as potential biases, remain areas for future research.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Broader Applications and Ethical Considerations<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The study, part of the Horizon Europe-funded Hybrid Human Artificial Collective Intelligence in Open-Ended Decision Making (HACID) project, aims to enhance clinical decision-support systems by integrating human and machine intelligence. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The potential applications extend beyond health care, including the legal system, disaster response and climate policy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cThe approach can also be transferred to other critical areas \u2014 such as the legal system, disaster response or climate policy \u2014 anywhere that complex, high-risk decisions are needed. For example, the HACID project is also developing tools to enhance decision-making in climate adaptation,\u201d added co-author Vito Trianni, a coordinator of the HACID project.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Conclusion<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Hybrid human\u2013AI collectives exhibit unparalleled potential in improving diagnostic accuracy and patient safety. As research progresses, this innovative approach could revolutionize health care delivery, ultimately leading to more equitable and effective patient care worldwide.<\/p>\n\n\n\n<div style=\"height:13px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Source:<\/strong> <a href=\"https:\/\/www.mpib-berlin.mpg.de\/press-releases\/human-ai-collectives-medicine\" target=\"_blank\" rel=\"noopener\" title=\"\">Max Planck Institute for Human Development<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Hybrid diagnostic collectives consisting of human experts and artificial intelligence systems significantly outperform traditional diagnosis methods, according to an international study led by the Max Planck Institute for Human Development. Diagnostic errors remain a critical challenge in medical practice. While AI systems, especially large language models (LLMs) like ChatGPT-4, Gemini and Claude 3, offer innovative [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"single-no-separators","format":"standard","meta":{"_acf_changed":false,"_uag_custom_page_level_css":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[8,27],"tags":[272],"class_list":["post-26205","post","type-post","status-publish","format-standard","hentry","category-ai","category-health-care","tag-max-planck-institute"],"acf":[],"aioseo_notices":[],"uagb_featured_image_src":{"full":false,"thumbnail":false,"medium":false,"medium_large":false,"large":false,"1536x1536":false,"2048x2048":false},"uagb_author_info":{"display_name":"The University Network","author_link":"https:\/\/www.tun.com\/home\/author\/funky_junkie\/"},"uagb_comment_info":0,"uagb_excerpt":"Hybrid diagnostic collectives consisting of human experts and artificial intelligence systems significantly outperform traditional diagnosis methods, according to an international study led by the Max Planck Institute for Human Development. Diagnostic errors remain a critical challenge in medical practice. While AI systems, especially large language models (LLMs) like ChatGPT-4, Gemini and Claude 3, offer innovative&hellip;","_links":{"self":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/26205","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/comments?post=26205"}],"version-history":[{"count":8,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/26205\/revisions"}],"predecessor-version":[{"id":26278,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/posts\/26205\/revisions\/26278"}],"wp:attachment":[{"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/media?parent=26205"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/categories?post=26205"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tun.com\/home\/wp-json\/wp\/v2\/tags?post=26205"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}