Journal article
Adequate sample size for developing prediction models was not simply related to events per variable.
- Abstract:
- The choice of an adequate sample size for a Cox regression analysis is generally based on the rule of thumb derived from simulation studies (Peduzzi et al. (1995)) of a minimum of 10 events per variable (EPV). One simulation study suggested scenarios in which the 10 EPV rule can be relaxed (Vittinghoff and McCulloch (2007)). The effect of a range of binary predictors with varying prevalence, reflecting clinical practice, has not yet been fully investigated.We conducted an extended resampling study using a large general practice data set, comprising over 2 million anonymized patient records, to examine the EPV requirements for prediction models with low-prevalence binary predictors developed using Cox regression. The performance of the models was then evaluated using an independent external validation data set. We investigated both fully specified models and models derived using variable selection.Our results indicated that an EPV rule of thumb should be data-driven and that EPV > 10 generally eliminates bias in regression coefficients when many low-prevalence predictors are included in a Cox model.Higher EPV is needed when low-prevalence predictors are present in a model to eliminate bias in regression coefficients and improve predictive accuracy.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 1.5MB, Terms of use)
-
- Publisher copy:
- 10.1016/j.jclinepi.2016.02.031
Authors
+ Medical Research Council
More from this funder
- Funding agency for:
- Ogundimu, E
- Altman, D
- Collins, G
- Grant:
- PROGnosis RESearch Strategy (PROGRESS) group (G0902393
- G1100513
- G1100513
- Publisher:
- Elsevier
- Journal:
- Journal of clinical epidemiology More from this journal
- Publication date:
- 2016-01-01
- Acceptance date:
- 2016-02-29
- DOI:
- EISSN:
-
1878-5921
- ISSN:
-
0895-4356
- Language:
-
English
- Keywords:
- Pubs id:
-
pubs:610880
- UUID:
-
uuid:8da45333-9351-451c-91f5-7259c2a83ac7
- Local pid:
-
pubs:610880
- Source identifiers:
-
610880
- Deposit date:
-
2016-03-19
- ARK identifier:
Terms of use
- Copyright holder:
- Ogundimu et al
- Copyright date:
- 2016
- Notes:
- Copyright 2016 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record