Thesis icon

Thesis

Causal and identifiable deep representation learning

Abstract:
Deep Learning has been responsible for many of the recent artificial intelligence (AI) successes, from text generation, image understanding, protein structure prediction, to superhuman performance in playing chess. Despite their successes, most current deep learning methods have difficulties with generalisation beyond the training distribution, are often not interpretable or explainable, possess few learning guarantees, and are often not robust to environmental changes. As AI is being adopted in more and more parts of society, it is important to understand and address these limitations.

One way to address these issues is by applying the framework of causality to deep learning, in what has become the emerging field of causal representation learning. In causal learning, the emphasis is on using data to learn causal variables and relationships instead of pure statistical associations, as this allows for improved interpretability, transfer, and reasoning via the framework of do-calculus.

In this thesis we take inspiration from causality and identifiability to develop deep learning methods for visual data that are interpretable, possess learning guarantees, and generalise beyond the training distribution. We do this by designing deep learning systems that employ various assumptions, such as assumptions about the environment (e.g. video with a static background), assumptions about the latent space (e.g. having one or two variables, or using a causal graph), restrictions on the causal mechanism function classes (e.g. linear functions), and properties of the network architectures (e.g. equivariance to certain transformations). Using these assumptions allows us to prove that the models are guaranteed to learn the true latent variables (up to some transformations), show that the models successfully generalise beyond the training distribution by intervening on the latent representation and generating realistic videos never observed at training time, and demonstrate that the learned representation is easily interpretable and explainable.

Actions

Access Document

Files:

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Sub department:
Engineering Science
Research group:
Visual Geometry Group
Oxford college:
Linacre College
Role:
Author

Contributors

Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Supervisor
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Supervisor
ORCID:
0000-0003-1374-2858


More from this funder
Funder identifier:
https://ror.org/0439y7842
Funding agency for:
Longa, M
Programme:
EPSRC DTA


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP