1 / 49

COMP 328: Final Review Spring 2010

COMP 328: Final Review Spring 2010. Nevin L. Zhang Department of Computer Science & Engineering The Hong Kong University of Science & Technology http://www.cse.ust.hk/~lzhang/ Can be used as cheat sheet. Pre-Midterm. Algorithms for supervised learning Decision trees

maddena
Download Presentation

COMP 328: Final Review Spring 2010

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. COMP 328: Final Review Spring 2010 Nevin L. Zhang Department of Computer Science & Engineering The Hong Kong University of Science & Technology http://www.cse.ust.hk/~lzhang/ Can be used as cheat sheet

  2. Pre-Midterm • Algorithms for supervised learning • Decision trees • Instance-based learning • Naïve Bayes classifiers • Neural networks • Support vector machines • General issues regarding supervised learning • Classification error and confidence interval • Bias-Variance tradeoff • PAC learning theory

  3. Post-Midterm • Clustering • Distance-Based Clustering • Model-Based Clustering • Dimension Reduction • Principal Component Analysis • Reinforcement Learning • Ensemble Learning

  4. Clustering

  5. Distance/Similarity Measures

  6. Distance-Based Clustering • Partitional and Hierarchical clustering

  7. K-Means: Partitional Clustering

  8. K-Means: Partitional Clustering • Different initial points might lead to different partitions • Solution: • Multiple runs, • Use evaluation criteria such as SSE to pick the best one

  9. Hierarchical Clustering • Agglomerative and Divisive

  10. Cluster Similarity

  11. Cluster Validation • External indices • Entropy: Average purity of clusters obtained • Mutual Information between class label and cluster label

  12. Cluster Validation • External Measure • Jaccard Index • Rand Index Measure similarity between two relationships: in-same-class & in-same-cluster

  13. Cluster Validation • Internal Measure • Dunn’s index

  14. Cluster Validation • Internal Measure

  15. Post-Midterm • Clustering • Distance-Based Clustering • Model-Based Clustering • Dimension Reduction • Principal Component Analysis • Reinforcement Learning • Ensemble Learning

  16. Model-Based Clustering • Assume data generated from a mixture model with K components • Estimate parameters of the model from data • Assign objects to clusters based posterior probability: Soft Assignment

  17. Gaussian Mixtures

  18. Learning Gaussian Mixture Models

  19. EM

  20. EM

  21. EM • l(t): Log likelihood of model after t-th iteration • l(t): increases monotonically with t • But might go to infinite in case of singularity • Solution: place bound on eigen values of covariance matrix • Local maximum • Multiple restart • Use likelihood to pick best model

  22. EM and K-Means • K-Means is hard-assignment EM

  23. Mixture Variable for Discrete Data

  24. Latent Class Model

  25. Learning Latent Class Models Always converges

  26. Post-Midterm • Clustering • Distance-Based Clustering • Model-Based Clustering • Dimension Reduction • Principal Component Analysis • Reinforcement Learning • Ensemble Learning

  27. Dimension Reduction • Necessary because there are data sets with large numbers of attributes that are difficult to learning algorithms to handle.

  28. Principal Component Analysis

  29. PCA Solution

  30. PCA Illustration

  31. Eigenvalues and Projection Error

  32. Post-Midterm • Clustering • Distance-Based Clustering • Model-Based Clustering • Dimension Reduction • Principal Component Analysis • Reinforcement Learning • Ensemble Learning

  33. Reinforcement Learning

  34. Markov Decision Process • A model of how agent interact with its environment

  35. Markov Decision Process

  36. Value Iteration

  37. Reinforcement Learning

  38. Q-Learning

  39. Q-Learning • From Q-function based value iteration • Ideas • In-place/asynchronous value iteration • Approximate expectation using samples • ε-greedy policy (for exploration/exploitation) tradeoff

  40. Time Difference Learning

  41. Sarsa is also time difference learning

  42. Post-Midterm • Clustering • Distance-Based Clustering • Model-Based Clustering • Dimension Reduction • Principal Component Analysis • Reinforcement Learning • Ensemble Learning

  43. Ensemble Learning

  44. Bagging: Reduce Variance

  45. Boosting: Reduce Classification Error

  46. AdaBoost: Exponential Error

More Related