Thesis

Multi-Modal Deep Learning for Computer Vision and Its Application

Abstract:: Although the exponential growth of visual data in various forms, such as images and videos, enables unprecedented opportunities for us to interpret the surrounding environment, natural language is still the main way to convey knowledge and information among us. Therefore, there is currently an increasing demand for building a framework to achieve interaction between pieces of information from different modalities. In this thesis, I investigate three directions to achieve an effective inter...
Expand abstract
Collapse abstract

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Li, B. (2023). Multi-Modal Deep Learning for Computer Vision and Its Application [PhD thesis]. University of Oxford.

MLA Style

Li, B. Multi-Modal Deep Learning for Computer Vision and Its Application. 2023. University of Oxford, PhD thesis.

Chicago Style

Li, B. 2023. “Multi-Modal Deep Learning for Computer Vision and Its Application.” PhD thesis, University of Oxford.
Print

Access Document

Files:: Li_2022_Multi-modal_deep_learning.pdf

(Preview, Dissemination version, pdf, 50.3MB, Terms of use)

Authors

+ Li, B More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Sub department:: Computer Science
Research group:: Intelligent Systems Lab
Oxford college:: St Catherine's College
Role:: Author

Contributors

+ Lukasiewicz, T

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Sub department:: Computer Science
Role:: Supervisor
ORCID:: 0000-0002-7644-1668

+ Wooldridge, M

Institution:: University of Oxford
Division:: MPLS
Department:: Computer Science
Sub department:: Computer Science
Role:: Supervisor
ORCID:: 0000-0002-9329-8410

+ AXA Chair Funding More from this funder

Grant:: CS2020_AXA_1147620
Programme:: DPhil in Computer Science at the University of Oxford

DOI:: 10.5287/ora-8ga2mpbob
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

Language:: English
Keywords:: Multi-Modal Learning

Generative Modelling
Subjects:: Multi-Modal Learning
Pubs id:: 2044905
Local pid:: pubs:2044905
Deposit date:: 2023-11-04
ARK identifier:: ark:/29072/ora_0906fd06fbb94c5dbcdf952b9a13e07a

Terms of use

Copyright holder:: Li, B
Copyright date:: 2023

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP