Conference item
Double-descent curves in neural networks: a new perspective using Gaussian processes
- Abstract:
- Double-descent curves in neural networks describe the phenomenon that the generalisation error initially descends with increasing parameters, then grows after reaching an optimal number of parameters which is less than the number of data points, but then descends again in the overparameterized regime. In this paper, we use techniques from random matrix theory to characterize the spectral distribution of the empirical feature covariance matrix as a width-dependent perturbation of the spectrum of the neural network Gaussian process (NNGP) kernel, thus establishing a novel connection between the NNGP literature and the random matrix theory literature in the context of neural networks. Our analytical expressions allow us to explore the generalisation behavior of the corresponding kernel and GP regression. Furthermore, they offer a new interpretation of double-descent in terms of the discrepancy between the width-dependent empirical kernel and the width-independent NNGP kernel.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 801.6KB, Terms of use)
-
- Publisher copy:
- 10.1609/aaai.v38i10.29071
Authors
- Publisher:
- Association for the Advancement of Artificial Intelligence
- Journal:
- Proceedings of the AAAI Conference on Artificial Intelligence More from this journal
- Volume:
- 38
- Issue:
- 10
- Pages:
- 11856-11864
- Publication date:
- 2024-03-24
- Acceptance date:
- 2023-12-09
- Event title:
- Thirty-Eighth AAAI Conference on Artificial Intelligence
- Event location:
- Vancouver, Canada
- Event website:
- https://aaai.org/aaai-conference/
- Event start date:
- 2024-02-20
- Event end date:
- 2024-02-27
- DOI:
- EISSN:
-
2374-3468
- ISSN:
-
2159-5399
- Commissioning body:
- Association for the Advancement of Artificial Intelligence
- ISBN-10:
- 1-57735-887-2
- ISBN-13:
- 978-1-57735-887-9
- Language:
-
English
- Keywords:
- Pubs id:
-
1987251
- Local pid:
-
pubs:1987251
- Deposit date:
-
2024-04-27
Terms of use
- Copyright holder:
- Association for the Advancement of Artificial Intelligence
- Copyright date:
- 2024
- Rights statement:
- © 2024, Association for the Advancement of Artificial Intelligence. All Rights Reserved
- Notes:
-
Work supported by the SIRIUS Centre (Res. Council of Norway, project 237889), and the EPSRC projects ConCur (EP/V050869/1) and UK FIRES (EP/S019111/1). For the purpose of Open Access, the authors have applied a CC BY public copyright licence to any Author Accepted Manuscript (AAM) version arising from this submission.
This is the accepted manuscript version of the paper. The final version is available from Association for the Advancement of Artificial Intelligence at: 10.1609/aaai.v38i10.29071
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record