Journal article icon

Journal article

On the stability of canonical correlation analysis and partial least squares with application to brain-behavior associations

Abstract:
Associations between datasets can be discovered through multivariate methods like Canonical Correlation Analysis (CCA) or Partial Least Squares (PLS). A requisite property for interpretability and generalizability of CCA/PLS associations is stability of their feature patterns. However, stability of CCA/PLS in high-dimensional datasets is questionable, as found in empirical characterizations. To study these issues systematically, we developed a generative modeling framework to simulate synthetic datasets. We found that when sample size is relatively small, but comparable to typical studies, CCA/PLS associations are highly unstable and inaccurate; both in their magnitude and importantly in the feature pattern underlying the association. We confirmed these trends across two neuroimaging modalities and in independent datasets with n ≈ 1000 and n = 20,000, and found that only the latter comprised sufficient observations for stable mappings between imaging-derived and behavioral features. We further developed a power calculator to provide sample sizes required for stability and reliability of multivariate analyses. Collectively, we characterize how to limit detrimental effects of overfitting on CCA/PLS stability, and provide recommendations for future studies
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1038/s42003-024-05869-4

Authors

More by this author
Role:
Author
ORCID:
0000-0001-9680-0595
More by this author
Role:
Author
ORCID:
0000-0002-7198-8162
More by this author
Role:
Author
ORCID:
0000-0002-6280-9070
More by this author
Role:
Author
ORCID:
0000-0001-9709-8523


Publisher:
Nature Research
Journal:
Communications Biology More from this journal
Volume:
7
Issue:
1
Pages:
217-217
Article number:
217
Publication date:
2024-02-21
DOI:
EISSN:
2399-3642
ISSN:
2399-3642


Language:
English
Keywords:
Pubs id:
1753185
Local pid:
pubs:1753185
Source identifiers:
W4391991358
Deposit date:
2026-06-08
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP