180 likes | 348 Views
A Link-Based Cluster Ensemble Approach for Categorical Data Clustering. Presenter : Jian-Ren Chen Authors : Natthakan Iam -On, Tossapon Boongoen , Simon Garrett, and Chris Price 2012 , IEEE. Outlines. Motivation Objectives Methodology Experiments Conclusions Comments.
E N D
A Link-Based Cluster Ensemble Approach for Categorical Data Clustering Presenter : Jian-Ren ChenAuthors : NatthakanIam-On, TossaponBoongoen,Simon Garrett, and Chris Price 2012 , IEEE
Outlines • Motivation • Objectives • Methodology • Experiments • Conclusions • Comments
Motivation • Cluster Ensembles: combine different clustering decisions in such a way as to achieve accuracy superior to that of any individual clustering.
Objectives • A new link-based approach improves the conventional matrix by discovering unknown entries through similarity between clusters in an ensemble.
Methodology Creating a Cluster Ensemble Generating a Refined Matrix Applying a Consensus Function to RM
Methodology Creating a Cluster Ensemble Type I (Direct ensemble): Generating a Refined Matrix Applying a Consensus Function to RM Type III (Subspace ensemble) Type II (Full-space ensemble)
Methodology Creating a Cluster Ensemble Generating a Refined Matrix Applying a Consensus Function to RM
Methodology Creating a Cluster Ensemble Generating a Refined Matrix Applying a Consensus Function to RM
Methodology Creating a Cluster Ensemble • given a graph G = (V,W) • SPEC finds the K largest eigenvectors of W • formed another matrix U Generating a Refined Matrix Applying a Consensus Function to RM
Experiments • Investigated Data Sets
Conclusions • Constructing the RM is efficiently resolved by the similarity among categorical labels, using the Weighted Triple-Quality similarity algorithm. • The link-based method usually achieves superior clustering results.
Comments • Advantages • The link-based method is efficient. • Applications • Categorical Data Clustering