Journal article icon

Journal article

Investigating automated regression models for estimating left ventricular ejection fraction levels in heart failure patients using circadian ECG features

Abstract:
Heart failure (HF) stands as a major global health problem where precise and early prediction of patient prognosis is essential for improving clinical management and patient care. A common obstacle for standard machine learning models in this domain is the prevalent issue of class imbalance within clinical datasets. To overcome this challenge, this study introduces a systematically optimized ensemble learning model for the accurate classification of patient survival. The methodology was applied to a publicly accessible clinical dataset of 299 heart failure patients. Its comprehensive framework included logarithmic transformation, stratified data splitting (80:20), SHAP-based selection of eight key features, and hyperparameter tuning with Optuna over 75 trials, with the specific objective of maximizing the F1-score using 10-fold cross-validation. The performance of three ensemble models (Random Forest, XGBoost, and LightGBM) was refined using decision threshold tuning. The results revealed that the fully optimized Random Forest model yielded superior outcomes, attaining an accuracy of 96.67%, an F1-score of 0.9474, and precision and recall values of 0.95, demonstrating high reliability with only a single instance of a False Negative and False Positive. The study concludes that the systematic application of SHAP, SMOTE, and Optuna within an ensemble framework substantially improves classification performance for imbalanced HF data, surpassing existing benchmarks. This work thus provides a replicable and systematic framework for developing reliable machine learning models from complex, imbalanced medical datasets, contributing a valuable methodology to the field of computational science
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1371/journal.pone.0295653

Authors

More by this author
Role:
Author
ORCID:
0000-0002-9932-9302
More by this author
Role:
Author
ORCID:
0000-0003-0989-041X
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Radcliffe Department of Medicine
Sub department:
RDM-Strategic
Role:
Author
ORCID:
0000-0002-5248-6327


Publisher:
Public Library of Science
Journal:
PLoS ONE More from this journal
Volume:
18
Issue:
12
Pages:
e0295653-e0295653
Publication date:
2023-12-11
DOI:
EISSN:
1932-6203
ISSN:
1932-6203


Language:
English
Keywords:
Pubs id:
1585614
Local pid:
pubs:1585614
Source identifiers:
W4389547209
Deposit date:
2026-06-04
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP