Journal article icon

Journal article

Large-Kernel Attention for 3D Medical Image Segmentation

Abstract:
Automated segmentation of multiple organs and tumors from 3D medical images such as magnetic resonance imaging (MRI) and computed tomography (CT) scans using deep learning methods can aid in diagnosing and treating cancer. However, organs often overlap and are complexly connected, characterized by extensive anatomical variation and low contrast. In addition, the diversity of tumor shape, location, and appearance, coupled with the dominance of background voxels, makes accurate 3D medical image segmentation difficult. In this paper, a novel 3D large-kernel (LK) attention module is proposed to address these problems to achieve accurate multi-organ segmentation and tumor segmentation. The advantages of biologically inspired self-attention and convolution are combined in the proposed LK attention module, including local contextual information, long-range dependencies, and channel adaptation. The module also decomposes the LK convolution to optimize the computational cost and can be easily incorporated into CNNs such as U-Net. Comprehensive ablation experiments demonstrated the feasibility of convolutional decomposition and explored the most efficient and effective network design. Among them, the best Mid-type 3D LK attention-based U-Net network was evaluated on CT-ORG and BraTS 2020 datasets, achieving state-of-the-art segmentation performance when compared to avant-garde CNN and Transformer-based methods for medical image segmentation. The performance improvement due to the proposed 3D LK attention module was statistically validated.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1007/s12559-023-10126-7

Authors

More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0003-2476-0664
More by this author
Role:
Author
ORCID:
0000-0002-4542-3336
More by this author
Role:
Author
ORCID:
0000-0001-7344-7733


More from this funder
Funder identifier:
https://ror.org/019af4n30
Grant:
101005122
More from this funder
Funder identifier:
https://ror.org/03wnrjx87
Grant:
IEC/NSFC/211235
More from this funder
Funder identifier:
https://ror.org/02wdwnk04
Grant:
PG/16/78/32402
TG/18/5/34111
More from this funder
Funder identifier:
https://ror.org/001aqnf71
Grant:
MR/V023799/1


Publisher:
Springer
Journal:
Cognitive Computation More from this journal
Volume:
16
Issue:
4
Pages:
2063-2077
Publication date:
2023-02-27
DOI:
EISSN:
1866-9964
ISSN:
1866-9956


Language:
English
Keywords:
Pubs id:
2342009
Local pid:
pubs:2342009
Source identifiers:
W4322495747
Deposit date:
2025-12-03
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP