Journal article icon

Journal article

Identifying disease phenotypes from linked health data: Comparison of self-report, hospital inpatient and primary care in UK Biobank

Abstract:
Objectives Administrative health data is commonly used for epidemiological research, however it is not well understood how disease phenotypes replicate across different data sources. Approach UK Biobank is a prospective cohort study of 500,000 adults, with ascertainment of health outcomes using administrative health data.  Prevalence at recruitment for 33 diseases were calculated in each health data source: self-report, primary-care, and hospital episode statistics (HES). Consistency of disease identification between sources, and median days between first diagnosis across data sources was determined. Linear regression was used to investigate determinant of differences in the average time between first diagnosis in primary-care and HES data. Results Hypertension was the most commonly identified disease in both self-report and HES (26.5% and 12.1% respectively), and anxiety in primary-care (12.7%). Diseases could be grouped into: 1) those identified largely by self-report alone (e.g. migraine, constipation), with inconsistency in the date first diagnosed; 2) those identified largely by primary care alone (e.g. anxiety, depression), also with inconsistency in the date first diagnosed; and 3) those that appeared mostly in all three sources, with highly consistent date of first report (many were emergency hospital admissions [e.g. stroke]). A number of variables were associated with time between primary-care and HES diagnosis. For example, heavier smokers had a significantly shorter period between first primary-care and first HES record for asthma, diabetes and hypertension. Conclusions and Implications These results indicate that there are inherent biases in diseases ascertained from linked health data that must be taken into account for epidemiological studies
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.23889/ijpds.v9i5.2647

Authors

More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0002-3847-6202
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0003-0139-2934
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0003-3035-4697
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0003-1938-5038


Publisher:
Swansea University
Journal:
International Journal of Population Data Science More from this journal
Volume:
9
Issue:
5
Publication date:
2024-09-10
DOI:
EISSN:
2399-4908
ISSN:
2399-4908


Language:
English
Keywords:
Pubs id:
2031848
Local pid:
pubs:2031848
Source identifiers:
W4402405306
Deposit date:
2026-02-05
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP