Journal article icon

Journal article

Development and validation of a parsimonious AI-based risk score for mortality in heart failure: a UK cohort study

Abstract:
Background: Accurate risk stratification in heart failure (HF) populations is crucial. Existing simple scores, such as MAGGIC, show limited discrimination (C-index < 0.80) and require specialised tests like echocardiography. Conversely, while advanced Artificial Intelligence (AI) models offer greater accuracy, their operational complexity and data access requirements often hinder clinical adoption. Purpose: To develop and validate a parsimonious, high-performing AI-based risk model for all-cause mortality in HF patients by leveraging insights from a complex AI model and feature engineering. Methods: Using a cohort of 373,389 patients with HF (≥18 years; 1,153 English practices for derivation, 289 for validation) from the Clinical Practice Research Datalink (CPRD) Aurum dataset, we developed and validated an AI-based risk prediction model for all-cause mortality. A MAGGIC-based Cox proportional hazards model adapted for electronic health records (EHR) was established as a performance benchmark. An initial Multi-layer Perceptron (MLP) model with a survival framework was trained with MAGGIC variables. This MLP model was then enhanced by incorporating highly predictive comorbidity features—identified by an explainable Transformer model trained on longitudinal electronic health record (EHR) data—and feature distillation to derive a final, optimised 11-variable model (named MLP-Distilled). The model employed readily ascertainable variables including age, BMI, year of birth (as a proxy for birth cohort effects), and key comorbidities such as cancers. Results: MLP-Distilled demonstrated significantly improved discriminatory performance (C-index: 0.801, 95% CI [0.796, 0.806]) compared to the benchmark MAGGIC-EHR model (0.735, [0.729, 0.742]), while maintaining excellent calibration. This parsimonious model uses fewer and more accessible variables than MAGGIC (11 vs. 13) and requires no specialised tests. The model was validated for secondary outcomes of cardiovascular events and rehospitalisation. Conclusion: Utilising knowledge from a complex EHR-trained AI model combined with feature distillation yielded a parsimonious yet powerful risk score for HF mortality. This novel approach offers a pathway to clinically implementable tools that balance predictive accuracy with pragmatic utility, potentially improving routine HF risk stratification.Discrimination_Calibration.jpeg Impact_Analysis_50_Threshold.jpeg
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1093/ehjdh/ztaf143.036

Authors

More by this author
Institution:
University of Oxford
Role:
Author
More by this author
Institution:
University of Oxford
Role:
Author
More by this author
Institution:
University of Oxford
Role:
Author


Publisher:
Oxford University Press
Journal:
European Heart Journal – Digital Health More from this journal
Volume:
7
Issue:
Supplement_1
Article number:
ztaf143.036
Publication date:
2026-01-12
DOI:
EISSN:
2634-3916


Language:
English
Source identifiers:
3656228
Deposit date:
2026-01-12
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP