Journal article icon

Journal article : Review

Systematic review identifies the design and methodological conduct of studies on machine learning-based prediction models

Abstract:

Background and Objectives

We sought to summarize the study design, modelling strategies, and performance measures reported in studies on clinical prediction models developed using machine learning techniques.

Methods

We search PubMed for articles published between 01/01/2018 and 31/12/2019, describing the development or the development with external validation of a multivariable prediction model using any supervised machine learning technique. No restrictions were made based on study design, data source, or predicted patient-related health outcomes.

Results

We included 152 studies, 58 (38.2% [95% CI 30.8–46.1]) were diagnostic and 94 (61.8% [95% CI 53.9–69.2]) prognostic studies. Most studies reported only the development of prediction models (n = 133, 87.5% [95% CI 81.3–91.8]), focused on binary outcomes (n = 131, 86.2% [95% CI 79.8–90.8), and did not report a sample size calculation (n = 125, 82.2% [95% CI 75.4–87.5]). The most common algorithms used were support vector machine (n = 86/522, 16.5% [95% CI 13.5–19.9]) and random forest (n = 73/522, 14% [95% CI 11.3–17.2]). Values for area under the Receiver Operating Characteristic curve ranged from 0.45 to 1.00. Calibration metrics were often missed (n = 494/522, 94.6% [95% CI 92.4–96.3]).

Conclusion

Our review revealed that focus is required on handling of missing values, methods for internal validation, and reporting of calibration to improve the methodological conduct of studies on machine learning–based prediction models.

Systematic review registration

PROSPERO, CRD42019161764.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Publisher copy:
10.1016/j.jclinepi.2022.11.015

Authors


More by this author
Role:
Author
ORCID:
0000-0002-7745-2887
More by this author
Role:
Author
ORCID:
0000-0001-7401-4593
More by this author
Role:
Author
ORCID:
0000-0002-5529-1541
More by this author
Role:
Author
ORCID:
0000-0002-8032-6224
More by this author
Role:
Author
ORCID:
0000-0001-6798-2078



Publisher:
Elsevier
Journal:
Journal of Clinical Epidemiology More from this journal
Volume:
154
Pages:
8-22
Place of publication:
United States
Publication date:
2022-11-24
Acceptance date:
2022-11-22
DOI:
EISSN:
1878-5921
ISSN:
0895-4356
Pmid:
36436815


Language:
English
Keywords:
Subtype:
Review
Pubs id:
1311806
Local pid:
pubs:1311806
Deposit date:
2024-01-26

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP