Identifying direct risk factors in UK Biobank via simultaneous Bayesian-frequentist model-averaged hypothesis testing using Doublethink

Arning, N; Fryer, HR; Wilson, DJ

COVID-19 Collection

Journal article

Identifying direct risk factors in UK Biobank via simultaneous Bayesian-frequentist model-averaged hypothesis testing using Doublethink

Abstract:: Big data approaches to discovering nongenetic risk factors have lagged behind genome-wide association studies that routinely uncover novel genetic risk factors for diverse diseases. Instead, epidemiology typically focuses on candidate risk factors. Since modern biobanks contain thousands of potential risk factors, candidate approaches may introduce bias, inadequately control for multiple testing, and overlook important signals. Doublethink, a model-averaged hypothesis testing approach, offers a solution that simultaneously controls the Bayesian false discovery rate (FDR) and frequentist familywise error rate (FWER) while accounting for uncertainty in variable selection. Here, we investigate direct risk factors for COVID-19 hospitalization from among 1,912 variables in 201,917 UK Biobank participants by implementing a Doublethink-based exposome-wide association study using Markov Chain Monte Carlo. Focusing on the 2020 outbreak, we find nine individual variables and seven groups of variables exposome-wide significant at 9% FDR and 0.05% FWER. We identify significant direct effects among relatively overlooked risk factors including aging, dementia, and prior infection, which we evaluate in relation to studies of other populations. We detect significant direct effects among some commonly reported risk factors like age, sex, and obesity, but not others like cardiovascular disease. The effects of hypertension, depression, and diabetes appeared to be mediated via general comorbidity. Doublethink produces interchangeable posterior odds and P-values for individual variables and arbitrary groups, facilitating flexible and powerful post hoc hypothesis testing. We discuss the potential for impact and limitations of joint Bayesian-frequentist hypothesis testing, including the benefits of an agnostic exposome-wide approach to discovery.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Arning, N., Fryer, H. R., & Wilson, D. J. (2026). Identifying direct risk factors in UK Biobank via simultaneous Bayesian-frequentist model-averaged hypothesis testing using Doublethink. Proceedings of the National Academy of Sciences, 123(1).

MLA Style

Arning, N, et al. “Identifying Direct Risk Factors in UK Biobank via Simultaneous Bayesian-Frequentist Model-Averaged Hypothesis Testing Using Doublethink.” Proceedings of the National Academy of Sciences, vol. 123, no. 1, 2026.

Chicago Style

Arning, N, HR Fryer, and DJ Wilson. 2026. “Identifying Direct Risk Factors in UK Biobank via Simultaneous Bayesian-Frequentist Model-Averaged Hypothesis Testing Using Doublethink.” Proceedings of the National Academy of Sciences 123 (1).
Print

Access Document

Files:: Arning_et_al_2026_Identifying_direct_risk.pdf

(Preview, Version of record, pdf, 1.7MB, Terms of use)

Publisher copy:: 10.1073/pnas.2514138122

Authors

+ Arning, N More by this author

Institution:: University of Oxford
Division:: MSD
Department:: Nuffield Department of Population Health
Sub department:: Big Data Institute
Role:: Author

+ Fryer, HR More by this author

Institution:: University of Oxford
Division:: MSD
Department:: Nuffield Department of Population Health
Sub department:: Big Data Institute
Role:: Author
ORCID:: 0000-0001-9987-8160

+ Wilson, DJ More by this author

Institution:: University of Oxford
Role:: Author
ORCID:: 0000-0002-0940-3311

+ National Institute for Health Research Health Protection Research Unit (NIHR HPRU) More from this funder

Funder identifier:: https://doi.org/10.13039/100018336

+ Wellcome Trust (WT) More from this funder

Funder identifier:: https://doi.org/10.13039/100010269

+ Royal Society (The Royal Society) More from this funder

Funder identifier:: https://doi.org/10.13039/501100000288

+ Robertson Foundation (The Robertson Foundation) More from this funder

Funder identifier:: https://doi.org/10.13039/100013961

Publisher:: National Academy of Sciences
Journal:: Proceedings of the National Academy of Sciences More from this journal
Volume:: 123
Issue:: 1
Article number:: e2514138122
Publication date:: 2026-01-02
Acceptance date:: 2025-11-09
DOI:: 10.1073/pnas.2514138122
EISSN:: 1091-6490
ISSN:: 0027-8424

Language:: English
Keywords:: UK Biobank

FDR

COVID-19 hospitalization

FWER

exposome-wide association studies
Source identifiers:: 3623160
Deposit date:: 2026-01-02
ARK identifier:: ark:/29072/ora_ae918b04d01f44d387e2d0f5cbe92627

Terms of use

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Journal article

Identifying direct risk factors in UK Biobank via simultaneous Bayesian-frequentist model-averaged hypothesis testing using Doublethink

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Journal article

Identifying direct risk factors in UK Biobank via simultaneous Bayesian-frequentist model-averaged hypothesis testing using Doublethink

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions