1 / 17

Evaluation – next steps

Evaluation – next steps. Lift and Costs. Outline. Lift and Gains charts *ROC Cost-sensitive learning Evaluation for numeric predictions. Application Example: Direct Marketing Paradigm. Find most likely prospects to contact Not everybody needs to be contacted

ronnien
Download Presentation

Evaluation – next steps

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Evaluation – next steps Lift and Costs

  2. Outline • Lift and Gains charts • *ROC • Cost-sensitive learning • Evaluation for numeric predictions

  3. Application Example:Direct Marketing Paradigm • Find most likely prospects to contact • Not everybody needs to be contacted • Number of targets is usually much smaller than number of prospects • Typical Applications • retailers, catalogues, direct mail (and e-mail) • customer acquisition, cross-sell, attrition prediction • ...

  4. Direct Marketing Evaluation • Accuracy on the entire dataset is not the right measure • Approach • develop a target model • score all prospects and rank them by decreasing score • select top P% of prospects for action • How to decide what is the best selection?

  5. Model-Sorted List Use a model to assign score to each customer Sort customers by decreasing score Expect more targets (hits) near the top of the list 3 hits in top 5% of the list If there 15 targets overall, then top 5 has 3/15=20% of targets

  6. CPH (Cumulative Pct Hits) Cumulative % Hits Definition: CPH(P,M) = % of all targets in the first P% of the list scored by model M CPH frequently called Gains Pct list 5% of random list have 5% of targets Q: What is expected value for CPH(P,Random) ? A: Expected value for CPH(P,Random) = P

  7. CPH: Random List vs Model-ranked list Cumulative % Hits Pct list 5% of random list have 5% of targets, but 5% of model ranked list have 21% of targets CPH(5%,model)=21%.

  8. Lift Lift(P,M) = CPH(P,M) / P Lift (at 5%) = 21% / 5% = 4.2 better than random Note: Some (including Witten & Eibe) use “Lift” for what we call CPH. P -- percent of the list

  9. Lift Properties • Q: Lift(P,Random) = • A: 1 (expected value, can vary) • Q: Lift(100%, M) = • A: 1 (for any model M) • Q: Can lift be less than 1? • A: yes, if the model is inverted (all the non-targets precede targets in the list) • Generally, a better model has higher lift

  10. ROC curves • ROC curvesStands for “receiver operating characteristic” • Used in signal detection to show tradeoff between hit rate and false alarm rate over noisy channel • Plots TP Rate vs FP Rate • True-positive rate is also known as recall • TP/(TP+FN): fraction of positives that you correctly classify • False Positive rate • FP/(FP+TN): fraction of negatives that you incorrectly classify witten & eibe

  11. A sample ROC curve • Jagged curve—one set of test data • Smooth curve—use cross-validation witten & eibe

  12. ROC curves for two schemes • For a small, focused sample, use method A • only need to act on a few predictions • For a larger one, use method B • In between, choose between A and B with appropriate probabilities witten & eibe

  13. Cost Sensitive Learning • There are two types of errors • Machine Learning methods usually minimize FP+FN • Direct marketing maximizes TP

  14. Different Costs • In practice, true positive and false negative errors often incur different costs • Examples: • Medical diagnostic tests: does X have leukemia? • Loan decisions: approve mortgage for X? • Web mining: will X click on this link? • Promotional mailing: will X buy the product? • …

  15. Cost-sensitive learning • Most learning schemes do not perform cost-sensitive learning • They generate the same classifier no matter what costs are assigned to the different classes • Example: standard decision tree learner • Simple methods for cost-sensitive learning: • Re-sampling of instances according to costs • Weighting of instances according to costs • Change probability threshold

  16. Evaluating numeric prediction • Same strategies: independent test set, cross-validation, significance tests, etc. • Difference: error measures • Actual target values: a1a2…an • Predicted target values: p1p2… pn • Most popular measure: mean-squared error • Easy to manipulate mathematically witten & eibe

  17. Other measures • The root mean-squared error : • The mean absolute error is less sensitive to outliers than the mean-squared error: witten & eibe

More Related