Diagens Launches DoctorBench, Setting a New Global Benchmark for Real-World Clinical Performance in Medical Foundation Models
ID: 2248477
For the first time, the evaluation framework places “real-world clinical performance” at the center, constructing a multi-dimensional benchmarking system that closely mirrors authentic diagnostic and treatment scenarios.
As medical foundation models accelerate their transition from laboratory research to clinical application worldwide, the industry has long lacked a metric that genuinely measures a model’s “clinical competence.” Existing evaluations predominantly focus on medical knowledge recall, failing to capture a model’s comprehensive performance in complex clinical contexts. This gap between benchmarking and clinical reality has become a global obstacle hindering the deployment of medical AI.
OpenAI previously launched HealthBench, signaling that leading players are beginning to take this challenge seriously. However, medicine is inherently localized — diagnostic and treatment guidelines, language conventions, and patient populations vary significantly across countries and regions, rendering any single evaluation system insufficient for universal applicability.
Driven by a profound understanding of this global challenge, Diagens developed the DoctorBench platform. The platform’s creation is rooted in nearly a decade of deep collaboration by a cross-disciplinary team. Diagens brought together experts in basic medicine, clinical medicine, artificial intelligence, and the healthcare industry, tightly integrating rigorous clinical logic with cutting-edge deep learning algorithms. This enables DoctorBench to both comprehend the boundaries of AI technology and grasp the intricate demands of clinical practice, using that standard to construct its evaluation framework.
The core philosophy of DoctorBench is no longer to test a model’s “knowledge base,” but to assess its clinical communication and decision-making ability — its capacity to “think like a doctor.” The platform features three leaderboard tracks: the Medical Leaderboard (LLM), the Multimodal Leaderboard (VLM), and the Agent Leaderboard — evaluating textual diagnostic ability, multimodal understanding, and multi-turn decision-making with tool-use inside a simulated clinical environment respectively.
On the evaluation mechanism, DoctorBench pioneers a multi-dimensional architecture combining “2 Core Dimensions (Safety and Accuracy) + 3 General Dimensions (Interaction Quality, Information Prioritization, Proactive Inquiry) + 5 Specialized Modules (Evidence & Citation, Explainable Reasoning, Actionability, Personalized Adaptation, Emotional Support).” It is equipped with “Scenario-Adaptive Weighting,” dynamically adjusting the weight of each dimension according to the risk level of different clinical scenarios, making the scoring logic closely aligned with real-world diagnostic decision-making.
Crucially, the platform designates “Medical Factual Accuracy” and “Safety and Risk Control” as inviolable red lines with a “one-vote veto” power. Any model that exhibits critical deviations on issues affecting patient safety will be unable to achieve a high score, regardless of outstanding performance in other dimensions. This design stems from the team’s deep understanding of the essence of medicine: in a field where lives are at stake, safety is always the paramount principle and leaves no room for compromise.
“The advancement of medical AI is a long-distance race concerning the health and well-being of all humanity. It demands not only disruptive technological innovation and deep cross-disciplinary, cross-regional collaboration, but also an absolute reverence for and unwavering commitment to life and health,” said Dr. Song Ning, Founder of Diagens. He expressed the hope of joining hands with more global research institutions, clinical centers, and industry partners, so that truly capable technologies can be recognized, trusted, and ultimately used to benefit every patient.Unternehmensinformation / Kurzprofil:
Bereitgestellt von Benutzer: acnnewswire
Datum: 01.05.2026 - 07:50 Uhr
Sprache: Deutsch
News-ID 2248477
Anzahl Zeichen: 4462
Kontakt-Informationen:
Stadt:
Hong Kong
Kategorie:
Pharma
Meldungsart: bitte
Versandart: Veröffentlichung
Freigabedatum: 04.30.2026
Diese Pressemitteilung wurde bisher 552 mal aufgerufen.
Die Pressemitteilung mit dem Titel:
"Diagens Launches DoctorBench, Setting a New Global Benchmark for Real-World Clinical Performance in Medical Foundation Models"
steht unter der journalistisch-redaktionellen Verantwortung von
Hangzhou Diagens Biotechnology Co., Ltd. (Nachricht senden)
Beachten Sie bitte die weiteren Informationen zum Haftungsauschluß (gemäß TMG - TeleMedianGesetz) und dem Datenschutz (gemäß der DSGVO).
Weitere Mitteilungen von Hangzhou Diagens Biotechnology Co., Ltd.
Deutschland blickt auf Anusflee: Innovationen in der Analpflege aus dem viertgrößten Pharmamarkt der Welt ...
Der deutsche Markt, der weltweit für seine führende Rolle im Bereich Pharma und Healthcare bekannt ist, richtet seine Aufmerksamkeit auf die südkoreanische Premium-Marke Anusflee. Während Analleiden für viele moderne Menschen ein verschwiegenes Problem darstellen und der Gang zum Arzt oft eine
Everest Medicines Announces Positive First-in-Human Data for Personalized mRNA Cancer Vaccine EVM16 at AACR 2026 ...
HONG KONG, Apr 22, 2026 - (ACN Newswire) - Apr 20, Everest Medicines announced that the first-in-human (FIH) clinical trial data of EVM16, a proprietary personalized mRNA cancer vaccine, were presented at the 2026 American Association for Cancer Research Annual Meeting (AACR 2026). The data include
Rettung statt Vernichtung: Wie GreenRX® Medikamente wieder in die Versorgung bringt ...
Beim Transport von Arzneimitteln entstehen häufig Schäden. In vielen Fällen handelt es sich dabei ausschließlich um kleine Dellen oder Risse in der Verpackung, sogenannte kosmetische Fehler. Das Medikament selbst ist jedoch einwandfrei, allerdings muss die Verpackung individuell begutachtet und
Energieeffizienz im Labor: Moderne Inkubatoren senken Stromverbrauch und CO?! ...
Mit den neuen POL-EKO Inkubatoren, die CiK Solutions nun in Deutschland vertreibt, lassen sich beide Aspekte adressieren. Ein Beispiel ist die ILP-Serie mit Peltier-Technologie: Sie arbeitet vollständig ohne Kompressor und Kältemittel. Das Ergebnis: ein deutlich reduzierter Energieverbrauch im Ver




