Drug discovery under covariate shift with domain-informed prior distributions over functions

Klarner, L; Rudner, T; Reutlinger, M; Schindler, T; Morris, G; Deane, CM; Yeh, YW

AI Collection

Conference item

Drug discovery under covariate shift with domain-informed prior distributions over functions

Abstract:: Accelerating the discovery of novel and more effective therapeutics is an important pharmaceutical problem in which deep learning is playing an increasingly significant role. However, real-world drug discovery tasks are often characterized by a scarcity of labeled data and significant covariate shift—a setting that poses a challenge to standard deep learning methods. In this paper, we present Q-SAVI, a probabilistic model able to address these challenges by encoding explicit prior knowledge of the data-generating process into a prior distribution over functions, presenting researchers with a transparent and probabilistically principled way to encode data-driven modeling preferences. Building on a novel, gold-standard bioactivity dataset that facilitates a meaningful comparison of models in an extrapolative regime, we explore different approaches to induce data shift and construct a challenging evaluation setup. We then demonstrate that using Q-SAVI to integrate contextualized prior knowledge of drug-like chemical space into the modeling process affords substantial gains in predictive accuracy and calibration, outperforming a broad range of state-of-the-art self-supervised pre-training and domain adaptation techniques.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Klarner, L., Rudner, T., Reutlinger, M., Schindler, T., Morris, G., Deane, C. M., & Yeh, Y. W. (2023). Drug discovery under covariate shift with domain-informed prior distributions over functions. In K. Cho, B. Engelhardt, S. Sabato, J. Scarlett, A. Krause, & E. Brunskill (Eds.), 40th International Conference on Machine Learning (ICML 2023) (Vol. 202, pp. 17176–17197). Journal of Machine Learning Research.

MLA Style

Klarner, L, et al. “Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions.” 40th International Conference on Machine Learning (ICML 2023), edited by K Cho et al., vol. 202, Journal of Machine Learning Research, 2023, pp. 17176–97. Proceedings of Machine Learning Research.

Chicago Style

Klarner, L, T Rudner, M Reutlinger, T Schindler, G Morris, CM Deane, and YW Yeh. 2023. “Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions.” In 40th International Conference on Machine Learning (ICML 2023), edited by K Cho, B Engelhardt, S Sabato, J Scarlett, A Krause, and E Brunskill, 202:17176–97. Proceedings of Machine Learning Research. Journal of Machine Learning Research.
Print

Access Document

Files:: Klarner_et_al_2023_drug_discovery_under.pdf

(Preview, Version of record, pdf, 2.1MB, Terms of use)

Publication website:: https://proceedings.mlr.press/v202/klarner23a.html

Authors

+ Klarner, L More by this author

Role:: Author

+ Rudner, T More by this author

Role:: Author

+ Reutlinger, M More by this author

Role:: Author

+ Schindler, T More by this author

Role:: Author

+ Morris, G More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Oxford college:: Green Templeton College;Green Templeton College;Green Templeton College;Green Templeton College;Green Templeton College;Green Templeton College
Role:: Author
ORCID:: 0000-0003-1731-8405

More authors...

Contributors

+ Krause, A

Role:: Editor

+ Brunskill, E

Role:: Editor

+ Cho, K

Role:: Editor

+ Engelhardt, B

Role:: Editor

+ Sabato, S

Role:: Editor

More contributors...

Publisher:: Journal of Machine Learning Research
Volume:: 202
Pages:: 17176-17197
Series:: Proceedings of Machine Learning Research
Publication date:: 2023-08-31
Acceptance date:: 2023-06-14
Event title:: 40th International Conference on Machine Learning (ICML 2023)
Event location:: Honolulu, Hawaii, USA
Event website:: https://icml.cc/Conferences/2023/Dates
Event start date:: 2023-07-23
Event end date:: 2023-07-29
ISSN:: 2640-3498

Language:: English
Keywords:: FFR
Pubs id:: 1493157
Local pid:: pubs:1493157
Deposit date:: 2023-07-18
ARK identifier:: ark:/29072/ora_733eaa50f8354eadadc49e76a4c79c46

Terms of use

Copyright holder:: Klarner et al
Notes:: This paper was presented at the 40th International Conference on Machine Learning (ICML 2023), 23rd-29th July 2023, Honolulu, Hawaii, USA. This is the accepted manuscript version of the article. The final version is available online from Proceedings of Machine Learning Research at: https://proceedings.mlr.press/v202/klarner23a.html

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

Drug discovery under covariate shift with domain-informed prior distributions over functions

Actions

Access Document

Authors

Contributors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

Drug discovery under covariate shift with domain-informed prior distributions over functions

Actions

Access Document

Authors

Contributors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions