On the inductive biases of deep generative models

Falck, F

Abstract:: Generative modelling is an important machine learning paradigm for approximating high-dimensional distributions of multi-modal data. Generative probabilistic models learn a compressed latent representation and allow to sample from the approximated distribution, achieving state-of-the-art performance across numerous tasks. Many generative models feature latent variables which allow them to learn a hierarchical representation capturing structure in data or exploit its multi-resolution property. Others can be flexibly conditioned on available data at inference time. However, the inductive biases why generative models work well, and their critical components such as their neural architecture are often poorly understood.

In this thesis, we develop, characterise and probe the inductive biases of four major generative model classes: Variational Autoencoders (VAEs), Hierarchical VAEs (HVAEs), diffusion models, and autoregressive models or Large Language Models (LLMs). First, we propose Multi-Facet Clustering Variational Autoencoders (MFCVAE), a novel generative model which can uncover a multi-partition structure in data. Second, we identify state-of-the-art HVAEs as discretisations of an underlying multi-resolution diffusion process. Third, we analyse why U-Nets are a go-to architecture for the generative modelling of images and for diffusion models in particular, characterising them as learning the coefficients on truncated, finite-dimensional function spaces via preconditioning. Fourth, for LLMs, we analyse the hypothesis that in-context learning with i.i.d. data follows Bayesian principles. The goal of this thesis is to understand fundamental design principles and properties of deep generative models, which may characterise existing and inspire novel inductive biases.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Falck, F. (2024). On the inductive biases of deep generative models [PhD thesis]. University of Oxford.

MLA Style

Falck, F. On the Inductive Biases of Deep Generative Models. 2024. University of Oxford, PhD thesis.

Chicago Style

Falck, F. 2024. “On the Inductive Biases of Deep Generative Models.” PhD thesis, University of Oxford.
Print

Access Document

Files:: Falck_2024_On_the_inductive.pdf

(Preview, Dissemination version, pdf, 26.6MB, Terms of use)

Authors

+ Falck, F More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Sub department:: Statistics
Oxford college:: Oriel College
Role:: Author

Contributors

+ Holmes, C

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Sub department:: Statistics
Role:: Supervisor

+ Doucet, A

Institution:: University of Oxford, Google DeepMind
Division:: MPLS
Department:: Statistics
Sub department:: Statistics
Role:: Supervisor

+ Teh, Y

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Sub department:: Statistics
Role:: Examiner

+ Barber, D

Institution:: University College London
Role:: Examiner

+ Medical Research Council More from this funder

Funder identifier:: https://ror.org/03x94j517
Funding agency for:: Holmes, C
Grant:: MC_UP_A390_1107
Programme:: Programme Leaders award

+ Wellcome Trust More from this funder

Funder identifier:: https://ror.org/029chgv08
Funding agency for:: Falck, F
Grant:: 218529/Z/19/Z
Programme:: Health Data Research UK-The Alan Turing Institute Wellcome PhD scholarship