Journal article icon

Journal article

Benchmarking inverse folding models for antibody CDR sequence design

Abstract:
Antibody-based therapies are at the forefront of modern medicine, addressing diverse challenges across oncology, autoimmune diseases, infectious diseases, and beyond. The ability to design antibodies with enhanced functionality and specificity is critical for advancing next-generation therapeutics. Recent advances in artificial intelligence (AI) have propelled the field of antibody engineering, particularly through inverse folding models for Complementarity-Determining Region (CDR) sequence design. These models aim to generate novel antibody sequences that fold into desired structures with high antigen-binding affinity. However, current evaluation metrics, such as amino acid recovery rates, are limited in their ability to assess the structural and functional accuracy of designed sequences. This study benchmarks state-of-the-art inverse folding models-ProteinMPNN, ESM-IF, LM-Design, and AntiFold-using comprehensive datasets and alternative evaluation metrics like sequence similarity. By systematically analyzing recovery rates, mutation prediction capabilities, and amino acid composition biases, we identify strengths and limitations across models. AntiFold exhibits superior performance in Fab antibody design, whereas LM-Design demonstrates adaptability across diverse antibody types, including VHH antibodies. In contrast, models trained on general protein datasets (e.g., ProteinMPNN and ESM-IF) struggle with antibody-specific nuances. Key insights include the models' varying reliance on antigen structure and their distinct capabilities in capturing critical residues for antigen binding. Our findings highlight the need for enhanced training datasets, integration of functional data, and refined evaluation metrics to advance antibody design tools. By addressing these challenges, future models can unlock the full potential of AI-driven antibody engineering, paving the way for innovative therapeutic applications.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1371/journal.pone.0324566

Authors

More by this author
Role:
Author
ORCID:
0000-0003-1470-8862
More by this author
Role:
Author
ORCID:
0000-0002-6106-1152
More by this author
Role:
Author
ORCID:
0000-0002-5605-728X
More by this author
Role:
Author
ORCID:
0000-0003-0969-3993
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0009-0008-1818-7789


Publisher:
Public Library of Science
Journal:
PLoS ONE More from this journal
Volume:
20
Issue:
6
Pages:
e0324566-e0324566
Publication date:
2025-06-04
DOI:
EISSN:
1932-6203
ISSN:
1932-6203


Language:
English
Keywords:
Pubs id:
2375214
Local pid:
pubs:2375214
Source identifiers:
W4411020279
Deposit date:
2026-02-17
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP