Journal article
Investigating automated regression models for estimating left ventricular ejection fraction levels in heart failure patients using circadian ECG features
- Abstract:
- Heart failure (HF) stands as a major global health problem where precise and early prediction of patient prognosis is essential for improving clinical management and patient care. A common obstacle for standard machine learning models in this domain is the prevalent issue of class imbalance within clinical datasets. To overcome this challenge, this study introduces a systematically optimized ensemble learning model for the accurate classification of patient survival. The methodology was applied to a publicly accessible clinical dataset of 299 heart failure patients. Its comprehensive framework included logarithmic transformation, stratified data splitting (80:20), SHAP-based selection of eight key features, and hyperparameter tuning with Optuna over 75 trials, with the specific objective of maximizing the F1-score using 10-fold cross-validation. The performance of three ensemble models (Random Forest, XGBoost, and LightGBM) was refined using decision threshold tuning. The results revealed that the fully optimized Random Forest model yielded superior outcomes, attaining an accuracy of 96.67%, an F1-score of 0.9474, and precision and recall values of 0.95, demonstrating high reliability with only a single instance of a False Negative and False Positive. The study concludes that the systematic application of SHAP, SMOTE, and Optuna within an ensemble framework substantially improves classification performance for imbalanced HF data, surpassing existing benchmarks. This work thus provides a replicable and systematic framework for developing reliable machine learning models from complex, imbalanced medical datasets, contributing a valuable methodology to the field of computational science
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 2.8MB, Terms of use)
-
- Publisher copy:
- 10.1371/journal.pone.0295653
Authors
- Publisher:
- Public Library of Science
- Journal:
- PLoS ONE More from this journal
- Volume:
- 18
- Issue:
- 12
- Pages:
- e0295653-e0295653
- Publication date:
- 2023-12-11
- DOI:
- EISSN:
-
1932-6203
- ISSN:
-
1932-6203
- Language:
-
English
- Keywords:
- Pubs id:
-
1585614
- Local pid:
-
pubs:1585614
- Source identifiers:
-
W4389547209
- Deposit date:
-
2026-06-04
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2023
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record