Journal article icon

Journal article

Imputation Free Deep Survival Prediction with Conditional Variational Autoencoders

Abstract:
Electronic Health Records (EHRs) provide rich opportunities for developing risk prediction tools to support clinical decision-making, yet they are inherently incomplete because data are recorded selectively during routine care. Such missingness may be informative, reflecting clinical judgment and patient status, and missing data patterns can shift between model development and real-world deployment. These challenges limit the reliability and transportability of predictive models in healthcare settings. We propose an imputation-free framework that jointly trains Conditional Variational Autoencoders with deep survival models to enable risk prediction directly from incomplete EHR data. We demonstrate the approach using the deep survival model DeSurv and evaluate its performance through simulation studies and two retrospective cohorts from the Clinical Practice Research Datalink primary care database. The proposed framework consistently outperforms conventional missing data methods, achieving superior performance on ground-truth metrics in simulations and improved calibration-based survival metrics in real-world cohorts. It also demonstrates increased robustness to unseen missingness patterns and distributional shifts. By providing a unified strategy for handling missing data across development, validation, and deployment, this work advances methodological robustness in healthcare informatics and supports more reliable clinical risk prediction in practice.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Authors

More by this author
Institution:
University of Oxford
Role:
Author
More by this author
Institution:
University of Oxford
Role:
Author


More from this funder
Funder identifier:
https://ror.org/029chgv08
Grant:
218529/Z/19/Z
More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/S02428X/1
More from this funder
Funder identifier:
https://ror.org/0187kwz08
Grant:
NIHR202632


Publisher:
Springer
Journal:
Journal of Healthcare Informatics Research More from this journal
Volume:
10
Issue:
2
Pages:
275-298
Publication date:
2026-04-10
Acceptance date:
2026-03-31
DOI:
EISSN:
2509-498X
ISSN:
2509-4971


Language:
English
Keywords:
Source identifiers:
4026901
Deposit date:
2026-05-08
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP