Thesis icon

Thesis

Machine learning for in silico optimisation and design of therapeutic antibodies

Abstract:
Antibodies are essential proteins in our immune systems, defending against foreign pathogens. Their unique ability to bind strongly and specifically to theoretically any target has also made them one of the most important classes of therapeutics. While target binding affinity lies at the heart of therapeutic antibody efficacy, a wide range of properties affecting safety and developability must be considered. Machine learning (ML) offers great promise to overcome the bottlenecks of laborious and trial-and-error experimental optimisation of these properties. In this thesis, I describe the development of ML models for in silico antibody optimisation.

I begin by detailing efforts to predict the effects of mutations on antibody-antigen binding affinity. Using experimental and synthetic data, and an equivariant graph neural network architecture, I demonstrate that there are currently orders of magnitude too little experimental data available for accurate, generalisable prediction. I also investigate the role of dataset diversity and suggest guidelines for robust ML model development and evaluation in this area.

In the next chapter, I explore the interpretability of graph neural network affinity predictions by examining the weighting of interface components. Overall, current methods were unable to provide meaningful insights into the factors most important for model predictions.

As antibody development requires solving a complex, multi-objective optimisation problem beyond affinity, I have also used ML to investigate additional properties. I outline our Random Forest-based approach, trained on millions of sequences, which can distinguish human from non-human antibodies with near-perfect accuracy. These models form the basis of Hu-mAb, our antibody humanization tool.

I then describe our fine-tuning strategy to produce an antibody inverse folding model. AntiFold can guide antibody optimisation by identifying mutations that are predicted to maintain the structure and, therefore, structure-related properties of an antibody.

In my DPhil, I have evaluated the applications and limitations of ML to accelerate multiple steps in the antibody design pipeline. These contributions set the foundation for simultaneous multi-objective optimisation, as well as biasing antibody design towards favourable properties.

Actions

Access Document

Files:

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Role:
Author
ORCID:
0000-0002-3023-2588

Contributors

Role:
Supervisor
ORCID:
0000-0003-1388-2252
Role:
Examiner
ORCID:
0000-0001-5100-8836
Role:
Examiner


More from this funder
Funder identifier:
https://ror.org/03x94j517
Funding agency for:
Hummer, AM
Grant:
MR/N013468/1
Programme:
Oxford-MRC Doctoral Training Partnership
More from this funder
Funding agency for:
Hummer, AM
Programme:
Wilfrid Knapp Scholarship


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford


Language:
English
Keywords:
Subjects:
Pubs id:
2350276
Local pid:
pubs:2350276
Deposit date:
2025-11-11
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP