Determining the number of canonical correlation pairs for high-dimensional vectors

Jiasen Zheng, Lixing Zhu*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

For two random vectors whose dimensions are both proportional to the sample size, we in this paper propose two ridge ratio criteria to determine the number of canonical correlation pairs. The criteria are, respectively, based on eigenvalue difference-based and centered eigenvalue-based ridge ratios. Unlike existing methods, the criteria make the ratio at the index we want to identify stick out to show a visualized “valley-cliff” pattern and thus can adequately avoid the local optimal solutions that often occur in the eigenvalues multiplicity cases. The numerical studies also suggest its advantage over existing scree plot-based method that is not a visualization method and more seriously underestimates the number of pairs than the proposed ones and the AIC and Cp criteria that often extremely over-estimate the number, and the BIC criterion that has very serious underestimation problem. A real data set is analyzed for illustration.

Original languageEnglish
Pages (from-to)737-756
Number of pages20
JournalAnnals of the Institute of Statistical Mathematics
Volume73
Issue number4
Early online date12 Feb 2021
DOIs
Publication statusPublished - Aug 2021

Scopus Subject Areas

  • Statistics and Probability

User-Defined Keywords

  • Canonical correlation matrix
  • Eigenvalue-based ridge ratios
  • High dimensionality
  • The number of canonical correlation pairs

Fingerprint

Dive into the research topics of 'Determining the number of canonical correlation pairs for high-dimensional vectors'. Together they form a unique fingerprint.

Cite this