Journal article
Investigating the volume and diversity of data needed for generalizable antibody–antigen ΔΔ G prediction
- Abstract:
- Antibody–antigen binding affinity lies at the heart of therapeutic antibody development: efficacy is guided by specific binding and control of affinity. Here we present Graphinity, an equivariant graph neural network architecture built directly from antibody–antigen structures that achieves test Pearson’s correlations of up to 0.87 on experimental change in binding affinity (ΔΔG) prediction. However, our model, like previous methods, appears to be overtraining on the few hundred experimental data points available and performance is not robust to train–test cut-offs. To investigate the amount and type of data required to generalizably predict ΔΔG, we built synthetic datasets of nearly 1 million FoldX-generated and >20,000 Rosetta Flex ddG-generated ΔΔG values. Our results indicate that there are currently insufficient experimental data to accurately and robustly predict ΔΔG, with orders of magnitude more likely needed. Dataset size is not the only consideration; diversity is also an important factor for model predictiveness. These findings provide a lower bound on data requirements to inform future method development and data collection efforts.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 4.7MB, Terms of use)
-
(Supplementary materials, Terms of use)
-
- Publisher copy:
- 10.1038/s43588-025-00823-8
Authors
- Publisher:
- Nature Research
- Journal:
- Nature Computational Science More from this journal
- Volume:
- 5
- Issue:
- 8
- Pages:
- 635-647
- Publication date:
- 2025-07-08
- Acceptance date:
- 2025-05-21
- DOI:
- EISSN:
-
2662-8457
- Language:
-
English
- Source identifiers:
-
3226115
- Deposit date:
-
2025-08-23
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
If you are the owner of this record, you can report an update to it here: Report update to this record