Journal article icon

Journal article

Flow-based fragment identification via binding site-specific latent representations

Abstract:
Fragment-based drug design is a promising strategy leveraging the binding of small chemical moieties that can efficiently guide drug discovery. The initial step of fragment identification remains challenging, as fragments often bind weakly and non-specifically. We developed a protein-fragment encoder that relies on a contrastive learning approach to map both molecular fragments and protein surfaces in a shared latent space. The encoder captures interaction-relevant features and achieves strong discrimination between binding and non-binding regions, reaching ROC–area under the curve values of 0.92 on pocket surfaces and enrichment factors of 22.85 across full protein surfaces. Building on this representation, our generative method LatentFrag produces chemically realistic fragment identities and positions conditioned on the protein surface. LatentFrag improves fragment recovery over docking-based virtual screening, achieving a sampling hit rate more than four times higher at a fraction of its computational cost providing a valuable starting point for fragment hit discovery. We further show the practical utility of LatentFrag and extend the workflow to full ligand design tasks. Together, these approaches contribute to advancing fragment identification and provide valuable tools for fragment-based drug discovery.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1088/2632-2153/ae5d85

Authors

More by this author
Role:
Author
ORCID:
0000-0002-3289-2766
More by this author
Role:
Author
ORCID:
0000-0002-6214-2827
More by this author
Role:
Author
ORCID:
0009-0000-9924-6921
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0002-1262-7252
More by this author
Role:
Author
ORCID:
0000-0003-3046-6576


More from this funder
Funder identifier:
10.13039/100018694
Grant:
945363
More from this funder
Funder identifier:
https://ror.org/03qf6ek79
Grant:
225147
More from this funder
Funder identifier:
https://ror.org/00yjd3n13
Grant:
310030 197724


Publisher:
IOP Publishing
Journal:
Machine Learning: Science and Technology More from this journal
Volume:
7
Issue:
3
Pages:
035015
Article number:
035015
Publication date:
2026-05-14
Acceptance date:
2026-04-09
DOI:
EISSN:
2632-2153
ISSN:
2632-2153


Language:
English
Keywords:
Source identifiers:
4049005
Deposit date:
2026-05-14
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP