Thesis
On the inductive biases of deep generative models
- Abstract:
-
Generative modelling is an important machine learning paradigm for approximating high-dimensional distributions of multi-modal data. Generative probabilistic models learn a compressed latent representation and allow to sample from the approximated distribution, achieving state-of-the-art performance across numerous tasks. Many generative models feature latent variables which allow them to learn a hierarchical representation capturing structure in data or exploit its multi-resolution property. Others can be flexibly conditioned on available data at inference time. However, the inductive biases why generative models work well, and their critical components such as their neural architecture are often poorly understood.
In this thesis, we develop, characterise and probe the inductive biases of four major generative model classes: Variational Autoencoders (VAEs), Hierarchical VAEs (HVAEs), diffusion models, and autoregressive models or Large Language Models (LLMs). First, we propose Multi-Facet Clustering Variational Autoencoders (MFCVAE), a novel generative model which can uncover a multi-partition structure in data. Second, we identify state-of-the-art HVAEs as discretisations of an underlying multi-resolution diffusion process. Third, we analyse why U-Nets are a go-to architecture for the generative modelling of images and for diffusion models in particular, characterising them as learning the coefficients on truncated, finite-dimensional function spaces via preconditioning. Fourth, for LLMs, we analyse the hypothesis that in-context learning with i.i.d. data follows Bayesian principles. The goal of this thesis is to understand fundamental design principles and properties of deep generative models, which may characterise existing and inspire novel inductive biases.
Actions
Access Document
- Files:
-
-
(Preview, Dissemination version, pdf, 26.6MB, Terms of use)
-
Authors
Contributors
+ Holmes, C
- Institution:
- University of Oxford
- Division:
- MPLS
- Department:
- Statistics
- Sub department:
- Statistics
- Role:
- Supervisor
+ Doucet, A
- Institution:
- University of Oxford, Google DeepMind
- Division:
- MPLS
- Department:
- Statistics
- Sub department:
- Statistics
- Role:
- Supervisor
+ Teh, Y
- Institution:
- University of Oxford
- Division:
- MPLS
- Department:
- Statistics
- Sub department:
- Statistics
- Role:
- Examiner
+ Barber, D
- Institution:
- University College London
- Role:
- Examiner
+ Medical Research Council
More from this funder
- Funder identifier:
- https://ror.org/03x94j517
- Funding agency for:
- Holmes, C
- Grant:
- MC_UP_A390_1107
- Programme:
- Programme Leaders award
+ Wellcome Trust
More from this funder
- Funder identifier:
- https://ror.org/029chgv08
- Funding agency for:
- Falck, F
- Grant:
- 218529/Z/19/Z
- Programme:
- Health Data Research UK-The Alan Turing Institute Wellcome PhD scholarship
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Funding agency for:
- Doucet, A
- Grant:
- EP/R013616/1
- EP/R034710/1
- EP/N510129/1
+ Health Data Research UK
More from this funder
- Funder identifier:
- https://ror.org/04rtjaj74
- Funding agency for:
- Holmes, C
+ The Alan Turing Institute
More from this funder
- Funder identifier:
- https://ror.org/035dkdb55
- Funding agency for:
- Holmes, C
- DOI:
- Type of award:
- DPhil
- Level of award:
- Doctoral
- Awarding institution:
- University of Oxford
- Language:
-
English
- Keywords:
- Subjects:
- Deposit date:
-
2026-06-28
- ARK identifier:
Terms of use
- Copyright holder:
- Fabian Falck
- Copyright date:
- 2024
- Notes:
- A multi-resolution framework for U-nets with applications to hierarchical VAEs and Multi-facet clustering variational autoencoders are derived from this thesis.
If you are the owner of this record, you can report an update to it here: Report update to this record