Journal article
The diagnosis of dengue in patients presenting with acute febrile illness using supervised machine learning and impact of seasonality
- Abstract:
- Background: Symptomatic dengue infection can result in a life-threatening shock syndrome and timely diagnosis is essential. Point-of-care tests for non-structural protein 1 and IgM are used widely but performance can be limited. We developed a supervised machine learning model to predict whether patients with acute febrile illnesses had a diagnosis of dengue or other febrile illnesses (OFI). The impact of seasonality on model performance over time was examined. Methods: We analysed data from a prospective observational clinical study in Vietnam. Enrolled patients presented with an acute febrile illness of <72 h duration. A gradient boosting model (XGBoost) was used to predict final diagnosis using age, sex, haematocrit, platelet, white cell, and lymphocyte count collected on enrolment. Data was randomly split 80/20% into a training and hold-out set, respectively, with the latter not used in model development. Cross-validation and hold out set testing was used, with performance over time evaluated through a rolling window approach. Results: We included 8,100 patients recruited between 16th October 2010 and 10th December 2014. In total 2,240 (27.7%) patients were diagnosed with dengue infection. The optimised model from training data had an overall median area under the receiver operator curve (AUROC) of 0.86 (interquartile range 0.84–0.86), specificity of 0.92, sensitivity of 0.56, positive predictive value of 0.73, negative predictive value (NPV) of 0.84, and Brier score of 0.13 in predicting the final diagnosis, with similar performances in hold-out set testing (AUROC of 0.86). Model performances varied significantly over time as a function of seasonality and other factors. Incorporation of a dynamic threshold which continuously learns from recent cases resulted in a more consistent performance throughout the year (NPV >90%). Conclusion: Supervised machine learning models are able to discriminate between dengue and OFI diagnoses in patients presenting with an early undifferentiated febrile illness. These models could be of clinical utility in supporting healthcare decision-making and provide passive surveillance across dengue endemic regions. Effects of seasonality and changing disease prevalence must however be taken into account—this is of significant importance given unpredictable effects of human-induced climate change and the impact on health.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, 386.0KB, Terms of use)
-
- Publisher copy:
- 10.3389/fdgth.2022.849641
Authors
Contributors
- Publisher:
- Frontiers Media
- Journal:
- Frontiers in Digital Health More from this journal
- Volume:
- 4
- Article number:
- 849641
- Publication date:
- 2022-03-14
- Acceptance date:
- 2022-02-15
- DOI:
- EISSN:
-
2673-253X
- Pmid:
-
35360365
- Language:
-
English
- Keywords:
- Pubs id:
-
1253483
- Local pid:
-
pubs:1253483
- Deposit date:
-
2022-06-10
Terms of use
- Copyright holder:
- Ming et al.
- Copyright date:
- 2022
- Rights statement:
- Copyright © 2022 Ming, Tuan, Hernandez, Sangkaew, Vuong, Chanh, Chau, Simmons, Wills, Georgiou, Holmes and Yacoub. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record