Structure and uncertainty in deep learning

Smith, L

Thesis

Structure and uncertainty in deep learning

Abstract:: Designing uncertainty-aware deep learning models which are able to provide reasonable uncertainties along with their predictions has long been a goal for parts of the machine learning community. Such models are also frequently desired by practitioners. The most widespread and obvious method to provide this to this is to take existing deep architectures and attempt to apply existing Bayesian techniques to them, for instance, by treating the weights of the neural network as random variables in a Bayesian framework. This thesis attempts to answer the question: are existing neural network architectures the best way to get reasonable uncertainty? In the first part of this thesis, we present research on the uncertainty behaviour of Bayesian neural networks in an adversarial setting, which demonstrates that, while a Bayesian approach improves significantly on deterministic networks near the data distribution, the extrapolation behaviour is undesirable, as standard neural network architectures have a structural bias toward confident extrapolation. Motivated by this, we then explore two alternatives to standard deep learning architectures which attempt to address this issue. First, we describe a novel generative formulation of capsule networks, which attempt to impose structure on a learning task by making strong assumptions about the structure of scenes. We then use this generative model to examine whether these underlying assumptions are useful, arguing that they in fact have significant flaws. Second, we explore bilipschitz models, a family of architectures which address the more limited goal of ensuring prior reversion in deep neural networks. These are based on deep kernel learning, attempting to control the behaviour of neural networks out of distribution by using final classification layers which revert to a prior as the distance to a set of support vectors increases. To maintain this property while using a neural feature extractor, we describe a novel 'bilipschitz' regularisation scheme for these models, based on preventing feature collapse by imposing a constraint motivated by work on invertible networks. We describe various useful applications of these models, and analyse why this regularisation scheme still appears to be effective even when the original motivation behind it no longer holds, in particular, where the feature dimensionality is lower than the input.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.

Cite

Cite this record

APA Style

Smith, L. (2022). Structure and uncertainty in deep learning [PhD thesis]. University of Oxford.

MLA Style

Smith, L. Structure and Uncertainty in Deep Learning. University of Oxford, 2022.

Chicago Style

Smith, L. 2022. “Structure and Uncertainty in Deep Learning.” PhD thesis, University of Oxford.

Share

Print

Access Document

Files:

thesis.pdf

(Preview, Dissemination version, pdf, 11.5MB, Terms of use)

Generating preview...

Why is the content I wish to access not available via ORA?

Bibliographic data (the information relating to research outputs) and full-text items (e.g. articles, theses, reports, etc.) arrive in ORA from several different sources. Unfortunately we are not able to make available the full-text for every research output.

Please contact the ORA team if you have queries regarding unavailable content OR if you are aware of a full-text copy we can make available.

Content may be unavailable for the following four reasons

Version unsuitable
We have not obtained a suitable full-text for a given research output. See the versions advice for more information.

Recently completed
Sometimes content is held in ORA but is unavailable for a fixed period of time to comply with the policies and wishes of rights holders.

Permissions
All content made available in ORA should comply with relevant rights, such as copyright. See the copyright guide for more information.

Clearance
Some thesis volumes scanned as part of the digitisation scheme funded by Dr Leonard Polonsky are currently unavailable due to sensitive material or uncleared third-party copyright content. We are attempting to contact authors whose theses are affected.

Alternative access to the full-text

You may be able to access the full-text directly from the publisher's website using the 'Publisher Copy' link in the 'Links & Downloads' box from a research output's ORA record page. This method may require an institutional or individual subscription to the journal/resource.

Authors

+ Smith, L More by this author

Institution:

University of Oxford

Division:

MPLS

Department:

Computer Science

Research group:

OATML

Oxford college:

Kellogg College

Role:

Author

ORCID:

0000-0001-6632-8162

Contributors

+ Gal, Y

Institution:

University of Oxford

Division:

MPLS

Department:

Computer Science

Research group:

OATML Group

Role:

Supervisor

ORCID:

0000-0002-2733-2078

+ Torr, P

Institution:

University of Oxford

Division:

MPLS

Department:

Engineering Science

Role:

Examiner

+ Duvenaud, D

Institution:

University of Toronto

Role:

Examiner

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:

http://dx.doi.org/10.13039/501100000266

Funding agency for:

Smith, L

Grant:

EP/L015897/1

Type of award:

DPhil

Level of award:

Doctoral

Awarding institution:

University of Oxford

Language:

English

Keywords:

Bayesian machine learning

deep learning

machine learning

uncertainty

machine vision

Subjects:

Uncertainty

Machine learning

Deep learning (Machine learning)

Deposit date:

2022-10-16

Terms of use

Copyright holder:

Smith, L

Copyright date:

2022

Licence:

Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

Altmetrics

Dimensions

If you are the owner of this record, you can report an update to it here: Report update to this record

Report an update

Your name

Your Email
We require your email address in order to let you know the outcome of your enquiry.

-

Reason for update PDF can now be made available
Paper now published
Error in record
Other

Update details
Please add any additional information to be included within the email.

Thesis

Structure and uncertainty in deep learning

Actions

Access Document

Authors

Contributors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions