1 / 11

Canonical Correlation Analysis and Related Techniques

Canonical Correlation Analysis and Related Techniques. Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University. L i n k i n g S c i e n c e t o S o c i e t y. Principal components

toya
Download Presentation

Canonical Correlation Analysis and Related Techniques

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Canonical Correlation Analysisand Related Techniques Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University L i n k i n g S c i e n c e t o S o c i e t y

  2. Principal components A principal component is defined as a weighted sum of a set of original variables in which the variance of the weighted sum is maximized, subject to the constraint that the matrix of weights is orthogonal. The weights could be defined to meet any set of criteria. For example, rotation is used to redefine the weights so that simple structure is achieved. There are a number of related techniques that involve defining weighted sums for two sets of variables … L i n k i n g S c i e n c e t o S p o r t !

  3. Maximum Covariance Analysis Consider two sets of variables, X and Y, both with the same number of corresponding cases n. With principal components analysis, the aim is to define a set of weights, U, that generate new variables, Z, with maximum variance: Consider alternative sets of weights for X and Y (UX and UY) that maximize the covariances, but with similar orthogonality constraints: L i n k i n g S c i e n c e t o S p o r t !

  4. Maximum Covariance Analysis The covariance to be maximized between ZX and ZY is: In terms of X and Y: But XTY is the covariance matrix of X with Y, CXY. L i n k i n g S c i e n c e t o S p o r t !

  5. Maximum Covariance Analysis Rearranging: L i n k i n g S c i e n c e t o S p o r t !

  6. Maximum Covariance Analysis The covariance matrix is then expressed in terms of a diagonal matrix C, and two orthogonal matrices: This is a singular value decomposition of the covariance matrix. For this reason maximum covariance analysis (MCA) is sometimes called SVD analysis or canonical covariance analysis. L i n k i n g S c i e n c e t o S p o r t !

  7. Canonical Correlation Analysis Canonical correlation analysis is very similar to maximum covariance analysis except that the correlations rather than the covariances are maximized. Instead of: The Zs are standardized to Ws to obtain the correlation matrix, R: L i n k i n g S c i e n c e t o S p o r t !

  8. Canonical Correlation Analysis The standardized scores, W, are obtained by dividing the scores, Z, by their standard deviations, S (which are diagonal): Substituting the Zs: indicating that the matrices of weights are no longer orthogonal (cf. maximum covariance analysis). L i n k i n g S c i e n c e t o S p o r t !

  9. Canonical Correlation Analysis Note that CCA is not the same as MCA using standardized original data since it is the variance of the weighted sums that is standardized in CCA. Buell pattern and other loading interpretation problems apply to MCA and CCA just as they do in PCA. It is unusual to apply rotation, however, because the criteria of maximizing covariance / correlation would be lost. L i n k i n g S c i e n c e t o S p o r t !

  10. Buell patterns

  11. Redundancy Analysis Redundancy analysis is another similar technique except that the explained variance in the Y using the X is maximized. The technique differs from MCA and CCA in that only the ZXs are standardized. Note that different results will be obtained depending upon which dataset is X and which is Y. L i n k i n g S c i e n c e t o S p o r t !

More Related