Thesis

Generalisation and expressiveness for over-parameterised neural networks

Abstract:: Over-parameterised modern neural networks owe their success to two fundamental properties: expressive power and generalisation capability. The former refers to the model's ability to fit a large variety of data sets, while the latter enables the network to extrapolate patterns from training examples and apply them to previously unseen data. This thesis addresses a few challenges related to these two key properties.

The fact that over-parameterised networks can fit any data set is not always indicative of their practical expressiveness. This is the object of the first part of this thesis, where we delve into how the input information can get lost when propagating through a deep architecture, and we propose as an easily implementable possible solution the introduction of suitable scaling factors and residual connections.

The second part of this thesis focuses on generalisation. The reason why modern neural networks can generalise well to new data without overfitting, despite being over-parameterised, is an open question that is currently receiving considerable attention in the research community. We explore this subject from information-theoretic and PAC-Bayesian viewpoints, proposing novel learning algorithms and generalisation bounds.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Cite

Cite this record

APA Style

Clerico, E. (2023). Generalisation and expressiveness for over-parameterised neural networks [PhD thesis]. University of Oxford.

MLA Style

Clerico, E. Generalisation and Expressiveness for over-Parameterised Neural Networks. University of Oxford, 2023.

Chicago Style

Clerico, E. 2023. “Generalisation and Expressiveness for over-Parameterised Neural Networks.” PhD thesis, University of Oxford.
Share
Print

Access Document

Files:: Clerico_2023_Generalisation_and_expressiveness.pdf

(Preview, Dissemination version, pdf, 3.3MB, Terms of use)

Authors

+ Clerico, E More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Role:: Author

Contributors

Role:: Contributor

Role:: Contributor

Role:: Contributor
ORCID:: 0000-0002-3223-7171

Role:: Contributor

Role:: Contributor

More contributors...

+ Engineering & Physical Sciences Research Council More from this funder

Funding agency for:: Clerico, E
Grant:: EP/R513295/1

DOI:: 10.5287/ora-6qjeqkng1
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

Language:: English
Keywords:: generalisation bounds

expressiveness

over-parameterised networks
Subjects:: Machine learning

Statistical learning theory
Deposit date:: 2024-04-02

Terms of use

Copyright holder:: Eugenio Clerico
Copyright date:: 2023

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP