Journal article
Predicting pyrazinamide resistance in Mycobacterium tuberculosis using a graph convolutional network
- Abstract:
-
Background
Pyrazinamide is an important first-line antibiotic for treating tuberculosis, with resistance primarily driven by mutations in the pncA gene. Traditional machine learning models are able to predict pyrazinamide resistance with some success but are limited in their ability to incorporate 3-dimensional protein structural information. Graph neural networks offer the potential to integrate protein structure and residue-level features to better predict the impact of mutations on drug resistance.
Results
We trained a graph convolutional network on PncA variants containing missense mutations and evaluated its ability to classify resistance to pyrazinamide. Each PncA variant was represented as an amino acid-level graph, with edges calculated from 3-dimensional spatial proximity, and node features derived from chemical properties and mutation meta-predictors. We used AlphaFold2 to generate predicted structures of the PncA variants, which we used to create the protein graphs. The predicted structures of resistant PncA variants showed greater deviation from the wild-type structure compared to susceptible variants. Our model achieved an F1 score of 81.6%, sensitivity of 81.6% and specificity of 80.4% on the test set and either matched or exceeded the performance of a published set of traditional machine learning models. We show that both structural graph connectivity and node features contribute significantly to model performance. Furthermore, we employ additional train/test dataset splits to demonstrate the GCN’s ability to generalise and predict resistance in samples with mutations in unseen positions and structural regions.
Conclusions
Our study demonstrates that graph-based deep learning can leverage protein structure and biochemical features to accurately predict antimicrobial resistance, despite being trained on a small dataset with little variation. We present this as a proof-of-concept for these methods to be applied to resistance phenotype prediction in more genetically diverse pathogens to predict the more complex observed patterns of antimicrobial resistance.
- Publication status:
- In press
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Supplementary materials, pdf, 8.0MB, Terms of use)
-
(Preview, Accepted manuscript, pdf, 7.1MB, Terms of use)
-
- Publisher copy:
- 10.1186/s12866-026-04876-1
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Funding agency for:
- Dissanayake, D
- Adlard, D
- Grant:
- EP/S024093/1
+ Biotechnology and Biological Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/00cwqg982
- Funding agency for:
- Brunner, V
- Grant:
- BB/T008784/1
+ NIHR Oxford Musculoskeletal Biomedical Research Centre
More from this funder
- Funder identifier:
- https://ror.org/00aps1a34
- Grant:
- NIHR207397
- Publisher:
- BioMed Central
- Journal:
- BMC Microbiology More from this journal
- Publication date:
- 2026-03-03
- Acceptance date:
- 2026-02-13
- DOI:
- EISSN:
-
1471-2180
- Language:
-
English
- Keywords:
- Pubs id:
-
2372331
- Local pid:
-
pubs:2372331
- Deposit date:
-
2026-02-13
- ARK identifier:
Terms of use
- Copyright holder:
- Dissanayake et al.
- Copyright date:
- 2026
- Rights statement:
- © The Author(s) 2026. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record