Journal article icon

Journal article

Clinically reported covert cerebrovascular disease and risk of neurological disease: a whole-population cohort of 367 988 people using natural language processing

Abstract:
Background The relevance of covert cerebrovascular disease (CCD) in practice is uncertain, partly because estimation of risk in whole clinical populations is difficult. Studies have had success extracting CCD from clinical text using natural language processing (NLP), though they have been limited to specific CCD phenotypes. Here, we used NLP to measure multiple clinically-reported CCD phenotypes in a large clinical cohort and estimated subsequent disease risk in health record data. Methods From all people with brain imaging in Scotland (2010–2018), we selected people with no prior hospitalisation for neurological disease (n=367 988). NLP of imaging reports identified: white matter hypoattenuation or hyperintensities (WMH), lacunes, cortical infarcts and cerebral atrophy. Adjusted HRs (aHRs) were estimated between each phenotype and stroke, dementia and Parkinson’s disease (conditions previously associated with CCD), epilepsy and colorectal cancer (control conditions). Results For each phenotype, the aHR of stroke was WMH 1.4 (95% CI 1.3–1.4), lacunes 1.6 (1.5–1.6), cortical infarct 1.8 (1.7–1.9) and cerebral atrophy 1.1 (1.0–1.1). The aHR of dementia was WMH 1.3 (1.3–1.3), lacunes 1.0 (0.9–1.0), cortical infarct 1.1 (1.1–1.2) and cerebral atrophy 1.7 (1.7–1.8). The aHR of Parkinson’s disease was WMH 1.1 (1.0–1.2), lacunes 1.1 (0.9–1.2), cortical infarct 0.7 (0.6–0.9) and cerebral atrophy 1.4 (1.3–1.5). The aHRs between CCD phenotypes and epilepsy and colorectal cancer were around the null. Conclusion CCD and atrophy have implications for future disease risk and can be identified at scale using NLP of clinical reports. Prevention of neurological disease in people with CCD should be a priority for healthcare policy makers.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1136/jnnp-2025-337689

Authors

More by this author
Role:
Author
ORCID:
0000-0002-7242-0456
More by this author
Institution:
University of Oxford
Division:
Societies
Department:
Kellogg College
Oxford college:
Kellogg College
Role:
Author
ORCID:
0000-0002-3083-436X
More by this author
Role:
Author
ORCID:
0000-0002-5182-8495
More by this author
Role:
Author
ORCID:
0000-0002-0184-9319
More by this author
Role:
Author
ORCID:
0000-0002-1732-9713


Publisher:
BMJ Publishing Group
Journal:
Journal of Neurology, Neurosurgery and Psychiatry More from this journal
Pages:
jnnp-2025
Publication date:
2026-03-13
Acceptance date:
2026-02-24
DOI:
EISSN:
1468-330X
ISSN:
0022-3050


Language:
English
Keywords:
Pubs id:
2392755
Local pid:
pubs:2392755
Source identifiers:
W7135178180
Deposit date:
2026-03-22
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP