1 / 12

Advanced Covering Algorithms in Data Mining

Explore advanced covering algorithms in data mining and learn how to refine rules to improve accuracy and efficiency. Discover the PRISM method and its application in decision lists.

polingr
Download Presentation

Advanced Covering Algorithms in Data Mining

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data MiningCSCI 307, Spring 2019Lecture 17 Covering algorithms II

  2. Covering Example continued After first rule established, delete instances covered by the first rule and start again. "Fresh" data set

  3. Example part 2: Contact Lens Data Rule we seek: Possible Tests: Age = Young Age = Pre-presbyopic Age = Presbyopic Spectacle prescription = Myope Spectacle prescription = Hypermetrope Astigmatism = no Astigmatism = yes Tear production rate = Reduced Tear production rate = Normal

  4. part 2: Modified Rule and Resulting Data Rule with the best test is added: Instances covered by modified rule:

  5. Example part 2: Refine Rule we seek: Possible Tests: Spectacle prescription = Myope Spectacle prescription = Hypermetrope Astigmatism = no Astigmatism = yes Tear production rate = Reduced Tear production rate = Normal

  6. part 2: Modified Rule and Resulting Data Rule with the best test is added: Instances covered by modified rule:

  7. Example part 2: Refine More Rule we seek: Possible Tests: Spectacle prescription = Myope Spectacle prescription = Hypermetrope Tear production rate = Reduced Tear production rate = Normal

  8. The Result Final Rules: Second rule for recommending "hard lenses": (build from the instances not covered by the first rule.) If astigmatism = yes and tear production rate = normal and spectacle prescription = myope then recommendation = hard

  9. The Rest of the PRISM Rules If astigmatism = no and tear-prod-rate = normal and spectacle-prescription = hypermetrope then soft If astigmatism = no and tear-prod-rate = normal and age = young then soft If age = pre-presbyopic and astigmatism = no and tear-prod-rate = normal then soft If tear-prod-rate = reduced then none If age = presbyopic and tear-prod-rate = normal and spectacle-prescription = myope and astigmatism = no then none If spectacle-prescrip = hypermetrope and astigmatism = yes and age = pre-presbyopic then none If age = presbyopic and spectacle-prescription = hypermetrope and astigmatism = yes then none

  10. Pseudo-Code for PRISM

  11. Rules versus Decision Lists • PRISM with outer loop removed generates a decision list for one class • Subsequent rules are designed for rules that are not covered by previous rules • But: order doesn’t matter because all rules predict the same class • Outer loop considers all classes separately • No order dependence implied • Problems: overlapping rules, default rule required

  12. Separate and Conquer • Methods like PRISM (for dealing with one class) are separate-and-conqueralgorithms: • First, identify a useful rule • Then, separate out all the instances it covers • Finally, “conquer” the remaining instances

More Related