Linear complexity self-attention with 3rd order polynomials

Babiloni, F; Marras, I; Deng, J; Kokkinos, F; Maggioni, M; Chrysos, G; Torr, P; Zafeiriou, S

AI Collection

Journal article

Linear complexity self-attention with 3rd order polynomials

Abstract:: Self-attention mechanisms and non-local blocks have become crucial building blocks for state-of-the-art neural architectures thanks to their unparalleled ability in capturing long-range dependencies in the input. However their cost is quadratic with the number of spatial positions hence making their use impractical in many real case applications. In this work, we analyze these methods through a polynomial lens, and we show that self-attention can be seen as a special case of a 3 rd order polynomial. Within this polynomial framework, we are able to design polynomial operators capable of accessing the same data pattern of non-local and self-attention blocks while reducing the complexity from quadratic to linear. As a result, we propose two modules (Poly-NL and Poly-SA) that can be used as “drop-in” replacements for more-complex non-local and self-attention layers in state-of-the-art CNNs and ViT architectures. Our modules can achieve comparable, if not better, performance across a wide range of computer vision tasks while keeping a complexity equivalent to a standard linear layer.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Babiloni, F., Marras, I., Deng, J., Kokkinos, F., Maggioni, M., Chrysos, G., Torr, P., & Zafeiriou, S. (2023). Linear complexity self-attention with 3rd order polynomials. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11), 12726–12737.

MLA Style

Babiloni, F, et al. “Linear Complexity Self-Attention with 3rd Order Polynomials.” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 11, 2023, pp. 12726–37.

Chicago Style

Babiloni, F, I Marras, J Deng, et al. 2023. “Linear Complexity Self-Attention with 3rd Order Polynomials.” IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (11): 12726–37.
Print

Access Document

Files:: Babiloni_et_al_2023_Linear_Complexity_SelfAM.pdf

(Preview, Accepted manuscript, pdf, 2.6MB, Terms of use)

Publisher copy:: 10.1109/TPAMI.2022.3231971

Authors

+ Babiloni, F More by this author

Role:: Author

+ Marras, I More by this author

Role:: Author

+ Deng, J More by this author

Role:: Author

+ Kokkinos, F More by this author

Role:: Author

+ Maggioni, M More by this author

Role:: Author

More authors...

Publisher:: IEEE
Journal:: IEEE Transactions on Pattern Analysis and Machine Intelligence More from this journal
Volume:: 45
Issue:: 11
Pages:: 12726 - 12737
Publication date:: 2023-03-20
Acceptance date:: 2022-12-11
DOI:: 10.1109/TPAMI.2022.3231971
EISSN:: 1939-3539
ISSN:: 0162-8828

Language:: English
Keywords:: polynomial expansion

self-attention

transformers

FFR

neural networks

non-local blocks
Pubs id:: 1337441
Local pid:: pubs:1337441
Deposit date:: 2023-05-16
ARK identifier:: ark:/29072/ora_ee0acf3770f243b8b5e22171881254fe

Terms of use

Copyright holder:: IEEE
Rights statement:: ©2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Notes:: This is the accepted manuscript version of the article. The final version is available from IEEE at: 10.1109/TPAMI.2022.3231971

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Journal article

Linear complexity self-attention with 3rd order polynomials

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Journal article

Linear complexity self-attention with 3rd order polynomials

Actions

Access Document

Authors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions