Journal article
Principal component analysis by optimisation of symmetric functions has no spurious local optima
- Abstract:
- Principal component analysis (PCA) finds the best linear representation of data and is an indispensable tool in many learning and inference tasks. Classically, principal components of a dataset are interpreted as the directions that preserve most of its “energy,” an interpretation that is theoretically underpinned by the celebrated Eckart--Young--Mirsky theorem. This paper introduces many other ways of performing PCA, with various geometric interpretations, and proves that the corresponding family of nonconvex programs has no spurious local optima, while possessing only strict saddle points. These programs therefore loosely behave like convex problems and can be efficiently solved to global optimality, for example, with certain variants of the stochastic gradient descent. Beyond providing new geometric interpretations and enhancing our theoretical understanding of PCA, our findings might pave the way for entirely new approaches to structured dimensionality reduction, such as sparse PCA and nonnegative matrix factorization. More specifically, we study an unconstrained formulation of PCA using determinant optimization that might provide an elegant alternative to the deflating scheme commonly used in sparse PCA. Read More: https://epubs.siam.org/doi/abs/10.1137/18M1188495
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, 714.2KB, Terms of use)
-
- Publisher copy:
- 10.1137/18M1188495
Authors
- Publisher:
- Society for Industrial and Applied Mathematics
- Journal:
- SIAM Journal on Optimization More from this journal
- Volume:
- 30
- Issue:
- 1
- Pages:
- 439–463
- Publication date:
- 2020-02-06
- Acceptance date:
- 2019-11-26
- DOI:
- EISSN:
-
1095-7189
- ISSN:
-
1052-6234
- Language:
-
English
- Keywords:
- Pubs id:
-
pubs:871250
- UUID:
-
uuid:ea47944f-07e6-4fe6-9db6-9a763767a0c1
- Local pid:
-
pubs:871250
- Source identifiers:
-
871250
- Deposit date:
-
2019-11-27
Terms of use
- Copyright holder:
- Society for Industrial and Applied Mathematics
- Copyright date:
- 2020
- Rights statement:
- © 2020, Society for Industrial and Applied Mathematics.
If you are the owner of this record, you can report an update to it here: Report update to this record