Journal article
Multimodal learning with transformers: a survey
- Abstract:
- Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks. Thanks to the recent prevalence of multimodal applications and Big Data, Transformer-based multimodal learning has become a hot topic in AI research. This paper presents a comprehensive survey of Transformer techniques oriented at multimodal data. The main contents of this survey include: (1) a background of multimodal learning, Transformer ecosystem, and the multimodal Big Data era, (2) a systematic review of <italic>Vanilla</italic> Transformer, Vision Transformer, and multimodal Transformers, from a geometrically topological perspective, (3) a review of multimodal Transformer applications, via two important paradigms, <italic>i</italic>.<italic>e</italic>., for multimodal pretraining and for specific multimodal tasks, (4) a summary of the common challenges and designs shared by the multimodal Transformer models and applications, and (5) a discussion of open problems and potential research directions for the community.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 1.5MB, Terms of use)
-
- Publisher copy:
- 10.1109/TPAMI.2023.3275156
Authors
- Publisher:
- IEEE
- Journal:
- IEEE Transactions on Pattern Analysis and Machine Intelligence More from this journal
- Volume:
- 45
- Issue:
- 10
- Pages:
- 12113-12132
- Publication date:
- 2023-05-11
- DOI:
- EISSN:
-
1939-3539
- ISSN:
-
0162-8828
- Language:
-
English
- Keywords:
- Pubs id:
-
1350356
- Local pid:
-
pubs:1350356
- Deposit date:
-
2023-06-16
Terms of use
- Copyright holder:
- Xu et al
- Copyright date:
- 2023
- Rights statement:
- © the Author(s) 2023. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record