Conference item icon

Conference item

Double-descent curves in neural networks: a new perspective using Gaussian processes

Abstract:
Double-descent curves in neural networks describe the phenomenon that the generalisation error initially descends with increasing parameters, then grows after reaching an optimal number of parameters which is less than the number of data points, but then descends again in the overparameterized regime. In this paper, we use techniques from random matrix theory to characterize the spectral distribution of the empirical feature covariance matrix as a width-dependent perturbation of the spectrum of the neural network Gaussian process (NNGP) kernel, thus establishing a novel connection between the NNGP literature and the random matrix theory literature in the context of neural networks. Our analytical expressions allow us to explore the generalisation behavior of the corresponding kernel and GP regression. Furthermore, they offer a new interpretation of double-descent in terms of the discrepancy between the width-dependent empirical kernel and the width-independent NNGP kernel.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1609/aaai.v38i10.29071

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Oxford college:
New College
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Physics
Sub department:
Theoretical Physics
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Physics
Sub department:
Theoretical Physics
Role:
Author
ORCID:
0000-0002-8438-910X


Publisher:
Association for the Advancement of Artificial Intelligence
Journal:
Proceedings of the AAAI Conference on Artificial Intelligence More from this journal
Volume:
38
Issue:
10
Pages:
11856-11864
Publication date:
2024-03-24
Acceptance date:
2023-12-09
Event title:
Thirty-Eighth AAAI Conference on Artificial Intelligence
Event location:
Vancouver, Canada
Event website:
https://aaai.org/aaai-conference/
Event start date:
2024-02-20
Event end date:
2024-02-27
DOI:
EISSN:
2374-3468
ISSN:
2159-5399
Commissioning body:
Association for the Advancement of Artificial Intelligence
ISBN-10:
1-57735-887-2
ISBN-13:
978-1-57735-887-9


Language:
English
Keywords:
Pubs id:
1987251
Local pid:
pubs:1987251
Deposit date:
2024-04-27

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP