Conference item icon

Conference item

Visual dialogue without vision or dialogue

Abstract:
We characterise some of the quirks and shortcomings in the exploration of visual dialogue (VD)—a sequential question-answering task where the questions and corresponding answers are related through given visual stimuli. To do so, we develop an embarrassingly simple method based on canonical correlation analysis (CCA) that, on the standard dataset, achieves near state-of-the-art performance on mean rank (MR). In direct contrast to current complex and over-parametrised architectures that are both compute and time intensive, our method ignores the visual stimuli, ignores the sequencing of dialogue, does not need gradients, uses off-the-shelf feature extractors, has at least an order of magnitude fewer parameters, and learns in practically no time. We argue that these results are indicative of issues in current approaches to visual dialogue and conduct analyses to highlight implicit dataset biases and effects of over-constrained evaluation metrics. Our code is publicly available.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publication website:
https://ml-critique-correct.github.io/

Authors

More by this author
Institution:
University of Oxford
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author


Publisher:
Neural Information Processing Systems Foundation
Publication date:
2018-12-07
Acceptance date:
2018-11-10
Event title:
NeurIPS 2018 Workshop on Critiquing and Correcting Trends in Machine Learning
Event location:
Montreal, Canada
Event website:
https://nips.cc/Conferences/2018/Schedule?showEvent=10911
Event start date:
2018-12-07
Event end date:
2018-12-07


Language:
English
Pubs id:
pubs:1030857
UUID:
uuid:3472f81f-5709-469f-a6b5-5684d8154f8b
Local pid:
pubs:1030857
Source identifiers:
1030857
Deposit date:
2019-07-10
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP