Thesis icon

Thesis

Multi-Modal Deep Learning for Computer Vision and Its Application

Abstract:

Although the exponential growth of visual data in various forms, such as images and videos, enables unprecedented opportunities for us to interpret the surrounding environment, natural language is still the main way to convey knowledge and information among us. Therefore, there is currently an increasing demand for building a framework to achieve interaction between pieces of information from different modalities. In this thesis, I investigate three directions to achieve an effective interac...

Expand abstract

Actions


Access Document


Files:

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Sub department:
Computer Science
Research group:
Intelligent Systems Lab
Oxford college:
St Catherine's College
Role:
Author

Contributors

Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Sub department:
Computer Science
Role:
Supervisor
ORCID:
0000-0002-7644-1668
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Sub department:
Computer Science
Role:
Supervisor
ORCID:
0000-0002-9329-8410


More from this funder
Grant:
CS2020_AXA_1147620
Programme:
DPhil in Computer Science at the University of Oxford


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford


Language:
English
Keywords:
Subjects:
Deposit date:
2023-11-04

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP