Journal article : Review
Identifying pregnancies in routinely collected health data: a scoping review of methods
- Abstract:
- Background: To map and describe the methods used to identify pregnancy episodes in routinely collected health data, to report validation practices and assess the transparency and reusability of methods. Methods: This study followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses for Scoping Reviews (PRISMA-ScR) guidelines. MEDLINE (Ovid), EMBASE (Ovid), OpenGrey, and Google Scholar were searched without time restrictions. Reference lists of relevant studies and reviews were screened for additional citations. All studies that utilised routinely collected health data to identify pregnancy episodes were included. Search results were imported into a reference management tool with duplicates removed. Title screening was conducted by one author. Two authors reviewed a subset (10%) of abstracts, with inter-rater agreement above 90% justifying the remainder of abstract review to be conducted by one author. This process was repeated at full-text review and during data extraction using a pre-piloted form. Results: From 5,859 records screened, 31 studies were included. 29 used rule-based backward-looking algorithms anchored to outcome codes and 2 used forward looking logic from early pregnancy makers. Nine studies incorporated hierarchical logic to estimate pregnancy start date and 19 introduced biologically plausible gaps between outcomes to mitigate misclassification. 15 studies conducted direct validation using chart review with inconsistent reporting of algorithm sensitivity, specificity, and PPV. While 22 studies shared code lists, only three provided reusable code. Conclusions: Future efforts should prioritise open-source algorithms, standardised validation protocols, and collaboration with clinical experts to ensure generalisability, reproducibility, and clinical relevance.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 1.7MB, Terms of use)
-
- Publisher copy:
- 10.1186/s12911-026-03423-2
Authors
+ National Institute for Health and Care Research
More from this funder
- Funder identifier:
- https://ror.org/0187kwz08
- Grant:
- NIHR206877
- Publisher:
- BioMed Central
- Journal:
- BMC Medical Informatics and Decision Making More from this journal
- Volume:
- 26
- Issue:
- 1
- Article number:
- 196
- Publication date:
- 2026-04-18
- Acceptance date:
- 2026-02-28
- DOI:
- EISSN:
-
1472-6947
- ISSN:
-
1472-6947
- Language:
-
English
- Keywords:
- Subtype:
-
Review
- Source identifiers:
-
4106306
- Deposit date:
-
2026-06-02
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2026
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record