Journal article
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models
- Abstract:
- OBJECTIVE: To compare performance of logistic regression (LR) with machine learning (ML) for clinical prediction modeling. STUDY DESIGN AND SETTING: We conducted a Medline literature search (1/2016 to 8/2017), and extracted comparisons between LR and ML models for binary outcomes. RESULTS: We included 71 out of 927 studies. The median sample size was 1250 (range 72-3,994,872), with 19 predictors considered (range 5-563) and 8 events per predictor (range 0.3-6,697). The most common ML methods were classification trees (30 studies), random forests (28), artificial neural networks (26), and support vector machines (24). Sixty-four (90%) studies used the area under the receiver operating characteristic curve (AUC) to assess discrimination. Calibration was not addressed in 56 (79%) studies. We identified 282 comparisons between a LR and ML model (AUC range, 0.52-0.99). For 145 comparisons at low risk of bias, the difference in logit(AUC) between LR and ML was 0.00 (95% confidence interval, -0.18 to 0.18). For 137 comparisons at high risk of bias, logit(AUC) was 0.34 (0.20 to 0.47) higher for ML. CONCLUSIONS: We found no evidence of superior performance of ML over LR for clinical prediction modeling, but improvements in methodology and reporting are needed for studies that compare modeling algorithms.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 1.4MB, Terms of use)
-
- Publisher copy:
- 10.1016/j.jclinepi.2019.02.004
Authors
- Publisher:
- Elsevier
- Journal:
- Journal of Clinical Epidemiology More from this journal
- Volume:
- 110
- Pages:
- 12-22
- Publication date:
- 2019-02-11
- Acceptance date:
- 2019-02-05
- DOI:
- ISSN:
-
1878-5921
- Language:
-
English
- Keywords:
- Pubs id:
-
pubs:973554
- UUID:
-
uuid:4167c905-5171-4fc7-b157-ff36d56a73db
- Local pid:
-
pubs:973554
- Source identifiers:
-
973554
- Deposit date:
-
2019-02-15
Terms of use
- Copyright holder:
- Elsevier Inc
- Copyright date:
- 2019
- Rights statement:
- © 2019 Elsevier Inc. All rights reserved.
- Notes:
- This is the accepted manuscript version of the article. The final version is available online from Elsevier at: https://doi.org/10.1016/j.jclinepi.2019.02.004
If you are the owner of this record, you can report an update to it here: Report update to this record