1 / 44

Clustering

Clustering. The process of grouping samples so that the samples are similar within each group. Clustering. Algorithm of Clustering. Hierarchical clustering Organizes the data into larger groups, which contain smaller groups, like a tree or dendrogram.

jeggen
Download Presentation

Clustering

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Clustering • The process of grouping samples so that the samples are similar within each group.

  2. Clustering

  3. Algorithm of Clustering • Hierarchical clustering • Organizes the data into larger groups, which contain smaller groups, like a tree or dendrogram. • Algorithms :Agglomerative,Single-linkage, complete-linkage, average-linkage, Ward…. • Partitional clustering • To create one set of clusters that partitions the data into similar groups. • Algorithms: Forgy’s, k-means, Isodata… • SOM,CLICK, CAST, …

  4. Figures of Hierarchical Clustering 1‘ 1 2 3 4 5

  5. Figures of Hierarchical Clustering 2‘ 1 2 3 4 5

  6. Figures of Hierarchical Clustering 2‘ 3‘ 1 2 3 4 5

  7. Figures of Hierarchical Clustering 1 2 3 4 5

  8. Hierarchical Clustering

  9. K-mean approach • One more input k is required. There are many variants of k-mean. • Sum-of squares criterion • minimize

  10. An example of k-mean approach • Two passes • Begin with k clusters, each consisting of one of the first k samples. For the remaining n-k samples, find the centroid nearest it. After each sample is assigned, re-compute the centroid of the altered cluster. • For each sample, find the centroid nearest it. Put the sample in the cluster identified with this nearest centroid. ( do not need to re-compute.)

  11. Examples

  12. Examples

  13. Examples

  14. Examples

  15. Examples

  16. Examples

  17. Self Organizing Maps

  18. Examples

  19. Examples

  20. Examples

  21. Examples

  22. Examples

  23. Examples

  24. Examples

  25. Examples

  26. Examples

  27. Examples

  28. Examples

  29. CLICK • Use graph theory • Connected component • The edge weight is calculated by statistical probabilities

More Related