Journal article
On the stability of canonical correlation analysis and partial least squares with application to brain-behavior associations
- Abstract:
- Associations between datasets can be discovered through multivariate methods like Canonical Correlation Analysis (CCA) or Partial Least Squares (PLS). A requisite property for interpretability and generalizability of CCA/PLS associations is stability of their feature patterns. However, stability of CCA/PLS in high-dimensional datasets is questionable, as found in empirical characterizations. To study these issues systematically, we developed a generative modeling framework to simulate synthetic datasets. We found that when sample size is relatively small, but comparable to typical studies, CCA/PLS associations are highly unstable and inaccurate; both in their magnitude and importantly in the feature pattern underlying the association. We confirmed these trends across two neuroimaging modalities and in independent datasets with n ≈ 1000 and n = 20,000, and found that only the latter comprised sufficient observations for stable mappings between imaging-derived and behavioral features. We further developed a power calculator to provide sample sizes required for stability and reliability of multivariate analyses. Collectively, we characterize how to limit detrimental effects of overfitting on CCA/PLS stability, and provide recommendations for future studies
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 3.3MB, Terms of use)
-
- Publisher copy:
- 10.1038/s42003-024-05869-4
Authors
- Publisher:
- Nature Research
- Journal:
- Communications Biology More from this journal
- Volume:
- 7
- Issue:
- 1
- Pages:
- 217-217
- Article number:
- 217
- Publication date:
- 2024-02-21
- DOI:
- EISSN:
-
2399-3642
- ISSN:
-
2399-3642
- Language:
-
English
- Keywords:
-
- Pubs id:
-
1753185
- Local pid:
-
pubs:1753185
- Source identifiers:
-
W4391991358
- Deposit date:
-
2026-06-08
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2024
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record