Journal article icon

Journal article

CohortCharacteristics: an R package for population characterisation in observational studies using the OMOP common data model

Abstract:
Describing cohort characterisation ensures comparability and reproducibility in multi-database observational studies. To address this need, we developed CohortCharacteristics, an open-source R package that facilitates standardised cohort characterisation in datasets mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). This study aims to explain the development of the package and demonstrate its core functionality. We developed CohortCharacteristics, an open-source R package that can perform cohort characterisation for various types of databases. To demonstrate its functionality, we then used CohortCharacteristics to generate descriptive statistics on demographics, comorbidities, medication exposures, cohort overlap, and timing of cohort entries. The study included data from CPRD GOLD (UK), DK-DHR (Denmark), IPCI (Netherlands), IQVIA Longitudinal Patient Database Belgium (IQVIA LPD Belgium), IQVIA DA Germany, NAJS (Croatia), and SIDIAP (Spain), all mapped to the OMOP CDM. The CohortCharacteristics R package is freely available on CRAN with detailed vignettes and documentation on its functionality. Cohort characteristics were generally consistent across databases, with similar age distributions and female representation. CPRD GOLD, NAJS, and SIDIAP exhibited higher prescribing rates for respiratory, cardiovascular, and nervous system medications, while IQVIA databases and DK-DHR reported lower rates. Timing analysis showed that dementia diagnoses typically followed insomnia diagnoses in several databases, supporting existing literature. Antipsychotic prescriptions often occurred after dementia diagnosis, reflecting prescribing practices aligned with clinical guidelines. CohortCharacteristics enables consistent cohort characterisation across a network of data mapped to the OMOP CDM, thereby improving transparency in multi-database research. The package’s functionality, demonstrated in this study, illustrates its applicability in observational studies with OMOP CDM data.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1007/s10654-025-01352-4

Authors

More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Role:
Author
ORCID:
0000-0002-9517-8834
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Role:
Author


Publisher:
Springer
Journal:
European Journal of Epidemiology More from this journal
Pages:
1-11
Publication date:
2026-04-03
DOI:
EISSN:
1573-7284
ISSN:
0393-2990


Language:
English
Keywords:
Pubs id:
2404740
Local pid:
pubs:2404740
Source identifiers:
W7148648599
Deposit date:
2026-04-23
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP