A novel deep semantic- and vision-based self-attention architecture for skin cancer classification

Aftab, J; Khan, MA; Arshad, S; Hussain, A; Alsenan, S; Cho, Y; Nam, Y

AI Collection

Journal article

A novel deep semantic- and vision-based self-attention architecture for skin cancer classification

Abstract:: Objectives: In the world, skin cancer is a significant health concern, and early diagnosis of this cancer plays a key role in improving patient outcomes. The early detection of this cancer reduces the death rate, but due to the complexity of the diagnosis, incorrect detection and prediction are provided by the experts. Therefore, it is essential to propose a computer-aided diagnostic system based on deep learning and explainable Artificial Intelligence (XAI) techniques that can be used as a second opinion in clinics and help physicians more accurately detect and predict this type of cancer. Methods: This work presents the proposed deep learning architecture consisting of two modules—skin lesion segmentation and lesion type classification. The proposed architecture is interpreted using XAI techniques to better evaluate the black-box model. In the skin lesion segmentation phase, we implemented DeepLab V3 architecture for semantic segmentation. The ResNet-18 model was used as the backbone, and later hyperparameters were optimized using Bayesian Optimization (BO). In the classification phase, we design a FusedNet architecture called Inverted self-attention with Vision Transformer (ISAwViT). The proposed fused network combines an inverted self-attention residual architecture with a vision transformer. The proposed fused network extracted feature information more deeply than performing an accurate prediction in a later stage. The design model is trained, and later in the testing phase, extracted features are classified using Softmax and several other classifiers. Results: The lesion segmentation and classification experiment was conducted on the HAM10000 dataset. The accuracy achieved by the HAM10000 dataset was 95.16% for lesion segmentation and 97.5% for lesion classification. Conclusion: Compared with recent techniques, the proposed model is more effective and efficient. In addition, the interpretation of the proposed model was performed using LIME and Grad-CAM, which show how the fused model makes correct classifications.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Aftab, J., Khan, M. A., Arshad, S., Hussain, A., Alsenan, S., Cho, Y., & Nam, Y. (2026). A novel deep semantic- and vision-based self-attention architecture for skin cancer classification. Digital Health, 12.

MLA Style

Aftab, J, et al. “A Novel Deep Semantic- and Vision-Based Self-Attention Architecture for Skin Cancer Classification.” Digital Health, vol. 12, 2026.

Chicago Style

Aftab, J, MA Khan, S Arshad, et al. 2026. “A Novel Deep Semantic- and Vision-Based Self-Attention Architecture for Skin Cancer Classification.” Digital Health 12.
Print

Access Document

Files:: Aftab_et_al_2026_A_novel_deep.pdf

(Preview, Version of record, pdf, 7.5MB, Terms of use)

Publisher copy:: 10.1177/20552076261430276

Authors

+ Aftab, J More by this author

Role:: Author

+ Khan, MA More by this author

Role:: Author
ORCID:: 0000-0001-5723-3858

+ Arshad, S More by this author

Role:: Author

+ Hussain, A More by this author

Institution:: University of Oxford
Role:: Author

+ Alsenan, S More by this author

Role:: Author

More authors...

Publisher:: SAGE Publications
Journal:: Digital Health More from this journal
Volume:: 12
Article number:: 20552076261430276
Publication date:: 2026-03-03
Acceptance date:: 2026-02-18
DOI:: 10.1177/20552076261430276
EISSN:: 2055-2076
ISSN:: 2055-2076

Language:: English
Keywords:: models fusion

skin cancer

lesion classification

lesion segmentation

digital health

interpretation
Pubs id:: 2390787
Local pid:: pubs:2390787
Source identifiers:: 3819664
Deposit date:: 2026-03-04
ARK identifier:: ark:/29072/ora_9469d9c342294fe0a4a3c1924ba35e26

Terms of use

Licence:: CC Attribution-NonCommercial (CC BY-NC)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Journal article

A novel deep semantic- and vision-based self-attention architecture for skin cancer classification

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Journal article

A novel deep semantic- and vision-based self-attention architecture for skin cancer classification

Actions

Access Document

Authors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions