1 / 75

Data Mining: Concepts and Techniques Slides for Textbook Chapter 7

August 18, 2012. Data Mining: Concepts and Techniques. 2. Chapter 7. Classification and Prediction. What is classification? What is prediction?Issues regarding classification and predictionClassification by decision tree inductionBayesian ClassificationClassification by backpropagationClassific

Patman
Download Presentation

Data Mining: Concepts and Techniques Slides for Textbook Chapter 7

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


    1. August 19, 2012 Data Mining: Concepts and Techniques 1 Data Mining: Concepts and Techniques — Slides for Textbook — — Chapter 7 — ©Jiawei Han and Micheline Kamber Intelligent Database Systems Research Lab School of Computing Science Simon Fraser University, Canada http://www.cs.sfu.ca

    2. August 19, 2012 Data Mining: Concepts and Techniques 2 Chapter 7. Classification and Prediction What is classification? What is prediction? Issues regarding classification and prediction Classification by decision tree induction Bayesian Classification Classification by backpropagation Classification based on concepts from association rule mining Other Classification Methods Prediction Classification accuracy Summary

    3. August 19, 2012 Data Mining: Concepts and Techniques 3 Classification: predicts categorical class labels classifies data (constructs a model) based on the training set and the values (class labels) in a classifying attribute and uses it in classifying new data Prediction: models continuous-valued functions, i.e., predicts unknown or missing values Typical Applications credit approval target marketing medical diagnosis treatment effectiveness analysis Classification vs. Prediction

    4. August 19, 2012 Data Mining: Concepts and Techniques 4 Classification—A Two-Step Process Model construction: describing a set of predetermined classes Each tuple/sample is assumed to belong to a predefined class, as determined by the class label attribute The set of tuples used for model construction: training set The model is represented as classification rules, decision trees, or mathematical formulae Model usage: for classifying future or unknown objects Estimate accuracy of the model The known label of test sample is compared with the classified result from the model Accuracy rate is the percentage of test set samples that are correctly classified by the model Test set is independent of training set, otherwise over-fitting will occur

    5. August 19, 2012 Data Mining: Concepts and Techniques 5 Classification Process (1): Model Construction

More Related