Lecture 3-4 Clustering (1hr) Gaussian Mixture and EM (1hr)

Lecture 3-4Clustering (1hr)Gaussian Mixture and EM (1hr) Tae-Kyun Kim

Vector Clustering 2D data vectors (green) are grouped to two homogenous clusters (blue and red). Clustering is achieved by an iterative algorithm (left to right). The cluster centers are marked x.

Pixel Clustering (Image Quantisation) Image pixels are represented by 3D vectors of R,G,B values. The vectors are grouped to K=10,3,2 clusters, and represented by the mean values of the respective clusters. R G B ``

Patch Clustering (BoW in Lecture 9-10) Image patches are harvested around feature points in a large number of images. They are represented by finite dimensional vectors, and clustered to form a visual dictionary. SIFT or raw pixels 20 D=400 20 … dimension D …… …… K codewords …

Image Clustering Whole images are represented as finite dimensional vectors. Homogenous vectors are grouped together in Euclidean space. ……

K-means vs GMM Two representative techniques are k-means and Gaussian Mixture Model (GMM). K-means assigns data points to the nearest clusters, while GMM assigns data to the Gaussian densities that best represent the data. Hard clustering: a data point is assigned only one cluster. Soft clustering: a data point is assigned multiple Gaussians probabilistically.

Matrix and Vector Derivatives

K-means Clustering

till converge

K=2 rnk μ 1 2 μ

Convergence proof (yes) Global minimum (no)

Statistical Pattern Recognition Toolbox for Matlab http://cmp.felk.cvut.cz/cmp/software/stprtool/ …\stprtool\probab\cmeans.m …\stprtool\probab\cmeans_tk.m

Mixture of Gaussians

Maximum Likelihood s.t.

objective ftn. f(x) constraints g(x) max f(x) s.t. g(x)=0 max f(x) + g(x) http://en.wikipedia.org/wiki/Lagrange_multiplier

till converge

Statistical Pattern Recognition Toolbox for Matlab http://cmp.felk.cvut.cz/cmp/software/stprtool/ …\stprtool\visual\pgmm.m …\stprtool\demos\demo_emgmm.m

Supplementary Material

Information Theory (for Lecture 7-8)

Advanced topic (optional) http://www.iis.ee.ic.ac.uk/~tkkim/mlcv/lecture_clustering_em.pdf

EM Algorithm in General

Lecture 3-4 Clustering (1hr) Gaussian Mixture and EM (1hr)