Journal article
Effects of data amount and variation in deep learning-based tuberculosis diagnosis in chest X-ray scans
- Abstract:
-
Pulmonary tuberculosis ranks among the world’s deadliest diseases, causing devastating global health harm. Despite diagnosing millions annually, COVID-19’s aftermath and inadequate screening within developing countries interfered with diagnosis of tuberculosis, hindering proper treatment and increasing tuberculosis mortality. Our study aims to enhance diagnostic opportunities by developing a deep-learning model to categorize pulmonary X-ray scans into “normal” and “tuberculosis” classes. Exercising feature extraction, we incorporated VGG-16 (Visual Geometry Group network with 16 layers) architecture to enhance model classification accuracy via transfer learning. We hypothesized that models trained on a greater amount and variety of data would perform better than those trained on less invariable data. Testing our hypothesis, we developed four models with replicate architectures trained on pulmonary X-ray scans from the Montgomery County Chest X-ray Dataset (Dataset A), the Shenzhen Chest X-ray Dataset (Dataset B), the Tuberculosis Chest X-ray Database (Dataset C), and the combined data (Dataset A, B, and C). Data amount and variability increased from Dataset A to C and was the largest in the combined datasets. Testing these models on each dataset used in training, we found the mean accuracy values were 45.6% for Dataset A, 62.0% for Dataset B, 82.0% for Dataset C, and 95.9% for the combined dataset. This indicates that models trained on more data perform more accurately across datasets due to greater data variation and amount. Additionally, our study highlights the efficacy of deep learning models in tuberculosis diagnosis, emphasizing the importance of data variability and a wider pulmonary X-ray database for potential implementation in global health.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 668.2KB, Terms of use)
-
- Publisher copy:
- 10.59720/24-278
Authors
- Publisher:
- Journal of Emerging Investigators, Inc
- Journal:
- Journal of Emerging Investigators More from this journal
- Volume:
- 8
- Pages:
- 1-8
- Publication date:
- 2025-04-28
- Acceptance date:
- 2024-12-13
- DOI:
- ISSN:
-
2638-0870
- Language:
-
English
- Keywords:
- Pubs id:
-
2068032
- Local pid:
-
pubs:2068032
- Deposit date:
-
2024-12-01
Terms of use
- Copyright holder:
- Bhorkar and Gonzales
- Copyright date:
- 2025
- Rights statement:
- Copyright © 2025 Bhorkar and Gonzales. All JEI articles are distributed under the attribution non-commercial, no derivative license. This means that anyone is free to share, copy and distribute an unaltered article for non-commercial purposes provided the original author and source are credited.
If you are the owner of this record, you can report an update to it here: Report update to this record