Journal article icon

Journal article

Efficient visual search of videos cast as text retrieval

Abstract:
We describe an approach to object retrieval which searches for and localizes all the occurrences of an object in a video, given a query image of the object. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject those that are unstable. Efficient retrieval is achieved by employing methods from statistical text retrieval, including inverted file systems, and text and document frequency weightings. This requires a visual analogy of a word which is provided here by vector quantizing the region descriptors. The final ranking also depends on the spatial layout of the regions. The result is that retrieval is immediate, returning a ranked list of shots in the manner of Google. We report results for object retrieval on the full length feature films 'Groundhog Day', 'Casablanca' and 'Run Lola Run', including searches from within the movie and specified by external images downloaded from the Internet. We investigate retrieval performance with respect to different quantizations of region descriptors and compare the performance of several ranking measures. Performance is also compared to a baseline method implementing standard frame to frame matching.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1109/tpami.2008.111

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0002-8945-8573


Publisher:
IEEE
Journal:
IEEE Transactions on Pattern Analysis and Machine Intelligence More from this journal
Volume:
31
Issue:
4
Pages:
591-606
Publication date:
2008-05-02
DOI:
EISSN:
1939-3539
ISSN:
0162-8828


Language:
English
Keywords:
Pubs id:
pubs:62101
UUID:
uuid:ef6ef3f0-aed8-4bdf-a6fb-52405c23401d
Local pid:
pubs:62101
Source identifiers:
62101
Deposit date:
2013-11-16
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP