Thesis icon

Thesis

On the inductive biases of deep generative models

Abstract:
Generative modelling is an important machine learning paradigm for approximating high-dimensional distributions of multi-modal data. Generative probabilistic models learn a compressed latent representation and allow to sample from the approximated distribution, achieving state-of-the-art performance across numerous tasks. Many generative models feature latent variables which allow them to learn a hierarchical representation capturing structure in data or exploit its multi-resolution property. Others can be flexibly conditioned on available data at inference time. However, the inductive biases why generative models work well, and their critical components such as their neural architecture are often poorly understood.

In this thesis, we develop, characterise and probe the inductive biases of four major generative model classes: Variational Autoencoders (VAEs), Hierarchical VAEs (HVAEs), diffusion models, and autoregressive models or Large Language Models (LLMs). First, we propose Multi-Facet Clustering Variational Autoencoders (MFCVAE), a novel generative model which can uncover a multi-partition structure in data. Second, we identify state-of-the-art HVAEs as discretisations of an underlying multi-resolution diffusion process. Third, we analyse why U-Nets are a go-to architecture for the generative modelling of images and for diffusion models in particular, characterising them as learning the coefficients on truncated, finite-dimensional function spaces via preconditioning. Fourth, for LLMs, we analyse the hypothesis that in-context learning with i.i.d. data follows Bayesian principles. The goal of this thesis is to understand fundamental design principles and properties of deep generative models, which may characterise existing and inspire novel inductive biases.

Actions

Access Document

Files:

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Sub department:
Statistics
Oxford college:
Oriel College
Role:
Author

Contributors

Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Sub department:
Statistics
Role:
Supervisor
Institution:
University of Oxford, Google DeepMind
Division:
MPLS
Department:
Statistics
Sub department:
Statistics
Role:
Supervisor
Institution:
University of Oxford
Division:
MPLS
Department:
Statistics
Sub department:
Statistics
Role:
Examiner
Institution:
University College London
Role:
Examiner


More from this funder
Funder identifier:
https://ror.org/03x94j517
Funding agency for:
Holmes, C
Grant:
MC_UP_A390_1107
Programme:
Programme Leaders award
More from this funder
Funder identifier:
https://ror.org/029chgv08
Funding agency for:
Falck, F
Grant:
218529/Z/19/Z
Programme:
Health Data Research UK-The Alan Turing Institute Wellcome PhD scholarship
More from this funder
Funder identifier:
https://ror.org/0439y7842
Funding agency for:
Doucet, A
Grant:
EP/R013616/1
EP/R034710/1
EP/N510129/1
More from this funder
Funder identifier:
https://ror.org/04rtjaj74
Funding agency for:
Holmes, C
More from this funder
Funder identifier:
https://ror.org/035dkdb55
Funding agency for:
Holmes, C


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP