Preprint
Multidimensional proteomics and explainable AI feature selection identify cross-platform lung cancer molecular signature in blood plasma
- Abstract:
- Lung cancer is the leading cause of cancer mortality worldwide with 70% diagnosed late stage despite low-dose computed tomography (LDCT) screening availability. We combined data-independent acquisition mass-spectrometry (DIA-MS) and proximity extension assay (PEA) with explainable artificial intelligence (XAI)-led machine learning (ML) for plasma-based biomarker discovery. From a 490 lung cancer and 124 matched-control cohort, ML models were trained to predict lung cancer achieving an AUROC of 0.91 [95% CI: 0.88–0.93] and 0.97 [95% CI: 0.92–0.98] in DIA-MS and PEA, respectively. XAI characterised networks of model-consistent features primarily related to infection and inflammatory responses. We then introduced a DNA-aptamer proteomics method and identified a cross-platform concordance panel, with performances of 0.88 [95% CI: 0.80–0.90] and 0.88 [95% CI: 0.81–0.95] in DIA-MS and PEA, respectively. This study demonstrates that combining multi-dimensional proteomics with XAI-ML can characterise robust biomarker signatures.
- Publication status:
- Published
Actions
Access Document
- Files:
-
-
(Preview, Pre-print, pdf, 3.5MB, Terms of use)
-
- Preprint server copy:
- 10.21203/rs.3.rs-7660411/v1
Authors
- Preprint server:
- Research Square
- Publication date:
- 2025-10-31
- DOI:
- Language:
-
English
- Keywords:
- Pubs id:
-
2310313
- UUID:
-
uuid_b629aa79-63cc-49af-bb7f-707ba4d5a0b9
- Local pid:
-
pubs:2310313
- Source identifiers:
-
W4415732690
- Deposit date:
-
2025-12-13
- ARK identifier:
Terms of use
- Copyright holder:
- Liu et al.
- Copyright date:
- 2025
- Rights statement:
- Copyright © 2025 The Author(s). This work is licensed under a Creative Commons Attribution 4.0 International License.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record