Journal article : Review
Machine learning-based asthma attack prediction models from routinely collected electronic health records: systematic scoping review
- Abstract:
-
Background: An early warning tool to predict attacks could enhance asthma management and reduce the likelihood of serious consequences. Electronic health records (EHRs) providing access to historical data about patients with asthma coupled with machine learning (ML) provide an opportunity to develop such a tool. Several studies have developed ML-based tools to predict asthma attacks.
Objective: This study aims to critically evaluate ML-based models derived using EHRs for the prediction of asthma attacks.
Methods: We systematically searched PubMed and Scopus (the search period was between January 1, 2012, and January 31, 2023) for papers meeting the following inclusion criteria: (1) used EHR data as the main data source, (2) used asthma attack as the outcome, and (3) compared ML-based prediction models’ performance. We excluded non-English papers and nonresearch papers, such as commentary and systematic review papers. In addition, we also excluded papers that did not provide any details about the respective ML approach and its result, including protocol papers. The selected studies were then summarized across multiple dimensions including data preprocessing methods, ML algorithms, model validation, model explainability, and model implementation.
Results: Overall, 17 papers were included at the end of the selection process. There was considerable heterogeneity in how asthma attacks were defined. Of the 17 studies, 8 (47%) studies used routinely collected data both from primary care and secondary care practices together. Extreme imbalanced data was a notable issue in most studies (13/17, 76%), but only 38% (5/13) of them explicitly dealt with it in their data preprocessing pipeline. The gradient boosting–based method was the best ML method in 59% (10/17) of the studies. Of the 17 studies, 14 (82%) studies used a model explanation method to identify the most important predictors. None of the studies followed the standard reporting guidelines, and none were prospectively validated.
Conclusions: Our review indicates that this research field is still underdeveloped, given the limited body of evidence, heterogeneity of methods, lack of external validation, and suboptimally reported models. We highlighted several technical challenges (class imbalance, external validation, model explanation, and adherence to reporting guidelines to aid reproducibility) that need to be addressed to make progress toward clinical adoption.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 278.2KB, Terms of use)
-
- Publisher copy:
- 10.2196/46717
Authors
- Publisher:
- JMIR Publications
- Journal:
- JMIR AI More from this journal
- Volume:
- 2
- Issue:
- 1
- Article number:
- e46717
- Publication date:
- 2023-12-07
- Acceptance date:
- 2023-10-09
- DOI:
- EISSN:
-
2817-1705
- Pmid:
-
38875586
- Language:
-
English
- Keywords:
- Subtype:
-
Review
- Pubs id:
-
2039733
- Local pid:
-
pubs:2039733
- Deposit date:
-
2024-10-25
Terms of use
- Copyright holder:
- Budiarto et al.
- Copyright date:
- 2023
- Rights statement:
- © Arif Budiarto, Kevin C H Tsang, Andrew M Wilson, Aziz Sheikh, Syed Ahmar Shah. Originally published in JMIR AI (https://ai.jmir.org), 07.12.2023. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR AI, is properly cited. The complete bibliographic information, a link to the original publication on https://www.ai.jmir.org/, as well as this copyright and license information must be included.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record