Journal article icon

Journal article

Variation in multimorbidity estimates due to clinical coding: a cross-sectional study of 7.2 million adults in England

Abstract:
Objective: To examine how variation in clinical coding systems and the number of conditions included under different study criteria influence estimates of multimorbidity prevalence in a nationally representative adult population in England. Methods and analysis: We conducted a cross-sectional analysis of anonymised records from 7.2 million adults in the Clinical Practice Research Datalink, linked to Hospital Episode Statistics, covering the period from 1987 to 2020. Adults were included if they had at least two recorded health conditions. Multimorbidity was defined as ≥2 health conditions selected from a list of 54 conditions. Prevalence was estimated separately using general practice (GP) data, hospital data and combined sources, and stratified by age, sex, ethnicity and deprivation. A stepwise inclusion approach assessed the impact of expanding the number of conditions included after different study criteria. Gradient boosting (XGBoost) with Shapley Additive Explanations values identified predictors of multimorbidity recorded only in GP data. Directionality was examined using Pearson correlations. Results: Multimorbidity prevalence was 92.3% using GP data, 63.2% using hospital data and 100% when both sources were combined. Prevalence increased consistently as more conditions were included under different study criteria and was always higher in GP data. Discrepancies were most pronounced among younger adults and ethnic minority groups. GP-only coding was associated with younger age, female sex, shorter hospital stays, absence of Accident & Emergency use, no palliative care coding and lower deprivation. Conclusion: Estimates of multimorbidity prevalence are highly sensitive to both the clinical coding system used and the number of conditions included under different study criteria. Standardised approaches to condition selection and the integration of data sources are essential to ensure accurate measurement and equitable representation.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1136/bmjph-2025-004120

Authors

More by this author
Role:
Author
ORCID:
0000-0003-3261-3680
More by this author
Role:
Author
ORCID:
0009-0006-7720-7634
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0002-6170-4402
More by this author
Role:
Author
ORCID:
0000-0003-0175-443X



Publisher:
BMJ Publishing Group
Journal:
BMJ Public Health More from this journal
Volume:
4
Issue:
1
Article number:
bmjph-2025-004120
Publication date:
2026-01-05
Acceptance date:
2025-12-17
DOI:
EISSN:
2753-4294
ISSN:
2753-4294


Language:
English
Keywords:
Pubs id:
2357737
UUID:
uuid_f1d47d73-cada-4e64-977d-2b9503218bfd
Local pid:
pubs:2357737
Source identifiers:
3643267
Deposit date:
2026-01-08
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP