Journal article icon

Journal article

Metrics for artificial intelligence in medicine: a reference resource

Abstract:
The effective integration of artificial intelligence (AI) systems into clinical medicine depends on comprehensive and transparent performance evaluation; however, the lack of standardized and widely accepted metrics poses challenges for reproducibility and model adoption. A comprehensive, machine-interpretable framework is presented to formalize the nomenclature and descriptions of 207 graphical, matrix, and scalar metrics used to measure AI model performance. The metrics taxonomy, developed as part of the Radiology Ontology of AI Datasets, Models and Projects (ROADMAP), provides a logically structured representation that captures the semantics of AI evaluation metrics, supports reasoning over metric classes, and enables automated completeness checks for AI model reporting. For each metric, the taxonomy incorporates a definition and citations to authoritative reference sources; where applicable, the taxonomy also includes synonyms, abbreviations, alternate language forms, mathematical formulae, and numerical bounds. The taxonomy supports evaluation of models operating on structured data, medical images, audio signals, and/or unstructured text. Logical axioms link each metric to one or more of 18 AI model performance criteria, including classification, calibration, image segmentation, and text analysis. By harmonizing terminology and enabling structured queries, ROADMAP's taxonomy of AI performance metrics facilitates model comparison, bias detection, and selection of appropriate evaluation methods across diverse datasets and clinical tasks. © RSNA, 2026 See also accompanying Special Report on ROADMAP ontology.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1148/ryai.260070

Authors

More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Radcliffe Department of Medicine
Oxford college:
Balliol College
Role:
Author
ORCID:
0000-0002-9384-4602


Publisher:
Radiological Society of North America
Journal:
Radiology: Artificial Intelligence More from this journal
Volume:
8
Issue:
3
Article number:
e260070
Place of publication:
United States
Publication date:
2026-03-11
Acceptance date:
2026-02-23
DOI:
EISSN:
2638-6100
ISSN:
2638-6100
Pmid:
41811079


Language:
English
Pubs id:
2388506
Local pid:
pubs:2388506
Deposit date:
2026-04-30
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP