Journal article
Data set augmentation allows deep learning-based virtual screening to better generalize to unseen target classes and highlight important binding interactions
- Abstract:
- Current deep learning methods for structure-based virtual screening take the structures of both the protein and the ligand as input but make little or no use of the protein structure when predicting ligand binding. Here, we show how a relatively simple method of data set augmentation forces such deep learning methods to take into account information from the protein. Models trained in this way are more generalizable (make better predictions on protein/ligand complexes from a different distribution to the training data). They also assign more meaningful importance to the protein and ligand atoms involved in binding. Overall, our results show that data set augmentation can help deep learning-based virtual screening to learn physical interactions rather than data set biases.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, 6.8MB, Terms of use)
-
- Publisher copy:
- 10.1021/acs.jcim.0c00263
Authors
- Publisher:
- American Chemical Society
- Journal:
- Journal of Chemical Information and Modeling More from this journal
- Volume:
- 60
- Issue:
- 8
- Pages:
- 3722-3730
- Place of publication:
- United States
- Publication date:
- 2020-07-23
- Acceptance date:
- 2020-07-23
- DOI:
- EISSN:
-
1549-960X
- ISSN:
-
1549-9596
- Pmid:
-
32701288
- Language:
-
English
- Keywords:
- Pubs id:
-
1123443
- Local pid:
-
pubs:1123443
- Deposit date:
-
2020-08-25
Terms of use
- Copyright holder:
- American Chemical Society
- Copyright date:
- 2020
- Rights statement:
- © 2020 American Chemical Society.
- Notes:
- This is the accepted manuscript version of the article. The final version is available online from the American Chemical Society at: https://doi.org/10.1021/acs.jcim.0c00263
If you are the owner of this record, you can report an update to it here: Report update to this record