Thesis icon

Thesis

POEM: prediction of immunogenic epitopes using mechanistic modelling of the MHC class I antigen processing pathway

Abstract:

Existing machine learning algorithms predicting class I antigen presentation are fundamentally awed due to the nature of the immunopeptidomics data used for their training. These models, rather than predicting antigen abundance, primarily indicate the presence of antigens on the cell surface. In this thesis, we integrate machine learning with mechanistic modelling to develop an enhanced model of the class I antigen processing pathway.

We begin by constructing a probabilistic model of epitope and precursor production by the proteasome, making use of existing algorithms to predict cleavage sites. We then develop a novel predictor of TAP binding affinity, PanTAP, which outperforms existingmethodsandformsaccuratepredictionsacrossdi erentmammalianspecies. Following this, we use a similar approach to train a predictive model of ERAP1 enzyme kinetics, enabling us to simulate the trimming of any potential substrate by ERAP1. These models are subsequently used to extend a previously validated systems biology model of peptide loading to MHC-I. This mechanistic model is validated using a study of SIINFEKL precursor processing and presentation in wild-type and ERAP1 knockdown cell lines, enabling us to infer the role of cytosolic aminopeptidases in epitope generation.

Finally, we use the validated mechanistic model to develop a new Predictor Of immunogenic Epitopes using Mechanistic modelling (POEM), employing a logistic regression trained on a dataset of neoantigens of known immunogenicity. POEM demonstrates superior efficacy on the training set and an independent dataset of GBM neoantigens. Furthermore, POEM accurately predicts the immunogenicity of pathogenic epitopes using a combined dataset from the IEDB, with its performance further validated through analysis of SARS-CoV-2 peptides across various HLA allo- types. Insights suggest that integrating source protein expression data could enhance POEM's predictions.

We conclude the thesis with a discussion of the results within the context of immunotherapy development and ideas for how our analysis may be further improved to provide clinical utility.

Actions

Access Document

Files:

Authors

More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Kennedy Institute for Rheumatology
Oxford college:
St Anne's College
Role:
Author
ORCID:
0000-0002-6647-7114

Contributors

Institution:
University of Oxford
Division:
MPLS
Department:
Mathematical Institute
Sub department:
Mathematical Institute
Oxford college:
Brasenose College
Role:
Supervisor
ORCID:
0000-0002-6888-4362
Institution:
University of Oxford
Division:
MSD
Department:
NDM
Oxford college:
Oriel College
Role:
Supervisor
ORCID:
0000-0003-1097-0222
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Kennedy Institute for Rheumatology
Role:
Supervisor
ORCID:
0000-0001-8079-9358


More from this funder
Funder identifier:
https://ror.org/00cwqg982
Grant:
2270506
Programme:
iCASE BBSRC


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford


Language:
English
Keywords:
Subjects:
Pubs id:
2042827
Local pid:
pubs:2042827
Deposit date:
2024-10-22
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP