1 / 20

K -means Clustering via Principal Component Analysis

K -means Clustering via Principal Component Analysis. According to the paper by Chris Ding and Xiaofeng He from Int’l Conf. Machine Learning, Banff, Canada, 2004. Traditional K -means Clustering. Minimizing the sum of squared errors. Where data matrix. Centroid of cluster C k.

mahdis
Download Presentation

K -means Clustering via Principal Component Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. K-means Clustering via Principal Component Analysis According to the paper by Chris Ding and Xiaofeng He from Int’l Conf. Machine Learning, Banff, Canada, 2004

  2. Traditional K-means Clustering Minimizing the sum of squared errors Where data matrix Centroid of cluster Ck nk is the number of points in Ck

  3. Principal Component Analysis (PCA) Centered data matrix Covariance matrix is ignored Factor

  4. PCA - continuation Eigenvalues and eigenvectors Singular value decomposition (SVD)

  5. PCA - example

  6. K-means → PCA Indikator vectors Criterion Linear transform by K× Korthonormal matrix T Last column of T

  7. K-means → PCA - continuation Therefore Criterion Optimization becomes Solution is first K-1 principal components

  8. PCA→ K-means Clustering by PCA Probability of connectivity between i and j

  9. Eigenvalues • 1. case 164030, 58, 5 • 2. case 212920, 1892, 157

  10. Thank you for your attention

More Related