Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A tutorial on canonical correlation methods
Uurtio V., Monteiro J., Kandola J., Shawe-Taylor J., Fernandez-Reyes D., Rousu J. ACM Computing Surveys50 (6):1-33,2018.Type:Article
Date Reviewed: Apr 26 2018

Canonical correlation analysis (CCA) is used to discover relations between two or more multivariate sets of variables, called views. Data to be processed are collected for a population of individuals, and for one individual its state is described by groups of values of associated variables, every group being part of the corresponding view. Not only linear relations can be extracted, but also nonlinear relations may provide enhanced knowledge on the subject under investigation.

This valuable paper succeeds in covering the most important aspects of CCA. A unified framework is used to present linear models in both over- and under-determined settings. Regularization techniques are used for under-determined cases. The robustness of CCA is improved in a Bayesian framework. Therefore, the Bayesian CCA approach is obtained. The nonlinear relations can be extracted using kernel methods, or deep CCA based on neural networks. Solving sparse versions of CCA (enforcing some components of position vectors to null values) with appropriate algorithms is also covered.

The reader will appreciate a well-written introduction, a valuable section dedicated to a discussion of the presented topic, and an excellent list of references.

The second section is dedicated to the over-determined case. Solved as an optimization problem, the unknown parameters can be computed by solving the linear CCA through one of the following approaches: (1) solving a standard eigenvalue problem; (2) solving a generalized eigenvalue problem; and (3) using the single-value decomposition method. Not only are the computational procedures well described, but the included examples and interpretations for each case bring value. The models are evaluated for statistical significance and tested for generalizability.

The third section contains the rest of approaches, where three algorithms are described: (1) application of repeated k-fold cross-validation for regularized CCA; (2) computation of a 1-rank approximation of the covariance matrix; and (3) solving the convex sparse least square problem. Also, inspired examples illustrate the applicability of the described methods.

The fourth section is dedicated to discussion. Every section ends with a short summary.

The tutorial is well structured and clearly written (minor typographical errors can be easily identified). I highly recommend this paper as a good resource for researchers, master’s students, or undergraduates enrolled in advanced data analysis courses.

Reviewer:  G. Albeanu Review #: CR146003 (1807-0390)
Bookmark and Share
  Featured Reviewer  
 
Correlation And Regression Analysis (G.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Correlation And Regression Analysis": Date
Strong convergence of estimators in nonlinear autoregressive models
Liebscher E. Journal of Multivariate Analysis 84(2): 247-261, 2003. Type: Article
Aug 27 2003
Statistical inference
Rohatgi V., Dover Publications, Incorporated, 2003.  948, Type: Book (9780486428123)
Jan 23 2004
Using GMDH for modeling economical indices of mine opening
Sarycheva L. Systems Analysis Modelling Simulation 43(10): 1341-1349, 2003. Type: Article
Mar 2 2004
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy