Conference item icon

Conference item

Language matters: a weakly SupervisedVision-Language pre-training approach for scene text detection and spotting

Abstract:

Recently, Vision-Language Pre-training (VLP) techniques have greatly benefited various vision-language tasks by jointly learning visual and textual representations, which intuitively helps in Optical Character Recognition (OCR) tasks due to the rich visual and textual information in scene text images. However, these methods cannot well cope with OCR tasks because of the difficulty in both instance-level text encoding and image-text pair acquisition (i.e. images and captured texts in them). Th...

Expand abstract
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1007/978-3-031-19815-1_17

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
Publisher:
Springer
Series:
Lecture Notes in Computer Science
Series number:
13688
Publication date:
2022-10-20
Acceptance date:
2022-10-25
Event title:
2022 Computer Vision and Pattern Recognition (CVPR 2022)
Event location:
New Orleans, Louisiana, USA
Event website:
https://cvpr2022.thecvf.com/
Event start date:
2022-06-19
Event end date:
2022-06-24
DOI:
EISBN:
9783031198151
ISBN:
9783031198144
Language:
English
Keywords:
Pubs id:
1302159
Local pid:
pubs:1302159
Deposit date:
2022-11-11

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP