1 / 25

Machine Learning Chapter 10. Learning Sets of Rules

Machine Learning Chapter 10. Learning Sets of Rules. Tom M. Mitchell. Learning Disjunctive Sets of Rules. Method 1: Learn decision tree, convert to rules Method 2: Sequential covering algorithm: 1. Learn one rule with high accuracy, any coverage

vaughan
Download Presentation

Machine Learning Chapter 10. Learning Sets of Rules

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Machine LearningChapter 10. Learning Sets of Rules Tom M. Mitchell

  2. Learning Disjunctive Sets of Rules • Method 1: Learn decision tree, convert to rules • Method 2: Sequential covering algorithm: 1. Learn one rule with high accuracy, any coverage 2. Remove positive examples covered by this rule 3. Repeat

  3. Sequential Covering Algorithm SEQUENTIAL- COVERING (Target attribute; Attributes; Examples; Threshold) • Learned rules  {} • Rule  LEARN-ONE-RULE(Target_attribute, Attributes, Examples) • while PERFORMANCE (Rule, Examples) > Threshold, do • Learned_rules  Learned_rules + Rule • Examples Examples – {examples correctly classified by Rule} • Rule  LEARN-ONE-RULE (Target_attribute, Attributes, Examples) • Learned_rules  sort Learned_rules accord toPERFORMANCE over Examples • return Learned_rules

  4. Learn-One-Rule

  5. Learn-One-Rule(Cont.) • Pos positive Examples • Neg negative Examples • while Pos, doLearn a NewRule- NewRule  most general rule possible- NewRule Neg - while NewRuleNeg, do Add a new literal to specialize NewRule 1. Candidate literals generate candidates 2. Best_literal  argmaxLCandidate literals Performance(SpecializeRule(NewRule; L)) 3. add Best_literal to NewRule preconditions 4. NewRuleNeg  subset of NewRuleNeg that satisfies NewRule preconditions - Learned_rules  Learned_rules + NewRule - Pos  Pos – {members of Pos coverd by NewRule} • Return Learned_rules

  6. Subtleties: Learn One Rule 1. May use beam search 2. Easily generalizes to multi-valued target functions 3. Choose evaluation function to guide search: • Entropy (i.e., information gain) • Sample accuracy: where nc = correct rule predictions, n = all predictions • m estimate:

  7. Variants of Rule Learning Programs • Sequential or simultaneous covering of data? • General  specific, or specific  general? • Generate-and-test, or example-driven? • Whether and how to post-prune? • What statistical evaluation function?

  8. Learning First Order Rules Why do that? • Can learn sets of rules such asAncestor(x, y)Parent(x; y)Ancestor(x; y)  Parent(x; z) ^ Ancestor(z; y) • General purpose programming languagePROLOG : programs are sets of such rules

  9. First Order Rule for Classifying Web Pages [Slattery, 1997] course(A)  has-word(A, instructor), Not has-word(A, good), link-from(A, B), has-word(B, assign), Not link-from(B, C) Train: 31/31, Test: 31/34

  10. Specializing Rules in FOIL

  11. Information Gain in FOIL

  12. Induction as Inverted Deduction

  13. Induction as Inverted Deduction(Cont’)

  14. Induction as Inverted Deduction(Cont’) Induction is, in fact, the inverse operation of deduction, and cannot be conceived to exist without the corresponding operation, so that the question of relative importance cannot arise. Who thinks of asking whether addition or subtraction is the more important process in arithmetic? But at the same time much difference in difficulty may exist between a direct and inverse operation; : : : it must be allowed that inductive investigations are of a far higher degree of difficulty and complexity than any questions of deduction….(Jevons 1874)

  15. Induction as Inverted Deduction(Cont’)

  16. Induction as Inverted Deduction(Cont’)

  17. Induction as Inverted Deduction(Cont’)

  18. Deduction: Resolution Rule

  19. Inverting Resolution

  20. Inverted Resolution (Propositional)

  21. First order resolution

  22. Inverting First order resolution

  23. Cigol

  24. Progol

More Related