Journal article
Flow-based fragment identification via binding site-specific latent representations
- Abstract:
- Fragment-based drug design is a promising strategy leveraging the binding of small chemical moieties that can efficiently guide drug discovery. The initial step of fragment identification remains challenging, as fragments often bind weakly and non-specifically. We developed a protein-fragment encoder that relies on a contrastive learning approach to map both molecular fragments and protein surfaces in a shared latent space. The encoder captures interaction-relevant features and achieves strong discrimination between binding and non-binding regions, reaching ROC–area under the curve values of 0.92 on pocket surfaces and enrichment factors of 22.85 across full protein surfaces. Building on this representation, our generative method LatentFrag produces chemically realistic fragment identities and positions conditioned on the protein surface. LatentFrag improves fragment recovery over docking-based virtual screening, achieving a sampling hit rate more than four times higher at a fraction of its computational cost providing a valuable starting point for fragment hit discovery. We further show the practical utility of LatentFrag and extend the workflow to full ligand design tasks. Together, these approaches contribute to advancing fragment identification and provide valuable tools for fragment-based drug discovery.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 3.7MB, Terms of use)
-
- Publisher copy:
- 10.1088/2632-2153/ae5d85
Authors
+ HORIZON EUROPE Marie Sklodowska-Curie Actions
More from this funder
- Funder identifier:
- 10.13039/100018694
- Grant:
- 945363
+ Swiss National Science Foundation
More from this funder
- Funder identifier:
- https://ror.org/00yjd3n13
- Grant:
- 310030 197724
- Publisher:
- IOP Publishing
- Journal:
- Machine Learning: Science and Technology More from this journal
- Volume:
- 7
- Issue:
- 3
- Pages:
- 035015
- Article number:
- 035015
- Publication date:
- 2026-05-14
- Acceptance date:
- 2026-04-09
- DOI:
- EISSN:
-
2632-2153
- ISSN:
-
2632-2153
- Language:
-
English
- Keywords:
- Source identifiers:
-
4049005
- Deposit date:
-
2026-05-14
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2026
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record