1 / 24

Efficient Rule Testing in Learning By Observation

Efficient Rule Testing in Learning By Observation. Tolga Konik University of Michigan. GOAL. Storing and accessing the experience of an agent efficiently Key component of our learning by observation system Examining the behavior history of a Soar agent (i.e. for debugging).

raiden
Download Presentation

Efficient Rule Testing in Learning By Observation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Efficient Rule Testing in Learning By Observation Tolga Konik University of Michigan

  2. GOAL • Storing and accessing the experience of an agent efficiently • Key component of our learning by observation system • Examining the behavior history of a Soar agent (i.e. for debugging)

  3. Learning By Observation Expert Interface Environment Wme changes Selected operators Episodic Database Domain Knowledge Soar Agent Behavior history Soar Program Horn Clauses ILP Learning Knowledge Generation

  4. connection door d1 d2 connection r2 in - room destination pathdoor door 1 distance pathdoor p1 “room1” p2 r3 destination path name path distance r1 r1 path item 2 “health” door in - room current current - - room room type d5 w3 AGENT AGENT other sensors other sensors Snapshop of Working Memory

  5. Snapshop of Working Memory • Situation: • a snapshot of the working memory at a time AGENT BEHAVIOR HISTORY : The Set of Situations

  6. Example Situation Interval: Positive Examples of Termination Regions operator A is selected A A AGENT BEHAVIOR HISTORY : The Set of Situations Positive examples of Termination Condition of A

  7. Example Situation Interval: Positive Examples of Termination • hypotheses are tested over each “positive situation” A A AGENT BEHAVIOR HISTORY : The Set of Situations

  8. Storing Explicit Situations • For each situation, store all WMEs • Space inefficient

  9. Storing Changes in Situations • Store only the changes of WMEs • i.e. KnoMic (van Lent 2000) • the database has to be traced forward starting from the initial situation. • Difficult to use with learning algorithms that deal with noise

  10. Storing Indexed Changes • Store the changes to WMEs and use index mechanism to efficiently access situations

  11. Rule Testing at a Situation • Given: • R1 is a WME id representing a room • S30 is a situation • Example Query: • (R1, ^contains-item, ?VALUE) at S30. • Output: • Return all VALUE such that (R1 ^contains-item, VALUE)holds at S30.

  12. Storing Indexed Changes • A binary search tree, for each (WME Id, Attribute) • For each query: • Locate the corresponding search tree • Find values traversing the tree

  13. Binary Search Tree Example • Search tree of (R1, contains-item) 40 65 20 15 25 50 80 i1 i1 i1 i1 i2 i3 i3 i2 i2 i3 i3 i3 start 25 40 50 65 80 15 20

  14. VALUE= i1, i2, i3 Binary Search Tree Example • Query: (R1, ^contains-item, ?VALUE) at S30 40 30 65 20 30 15 25 50 80 30 i1 i1 i1 i1 i2 i3 i3 i2 i2 i3 i3 i3 start 25 40 50 65 80 15 20

  15. Rule Testing at a Situation • Using this scheme, a rule can be tested at any situation in the history of an agent. • This provides space efficiency, while keeping the access times reasonable. • We want more than that !

  16. Rule Testing over a Range of Situations • GOAL • test a rule over a range of situations at once. • MOTIVATION • In consecutive situations, similar conditions hold • RESULT • More efficient than testing a rule at each situation individually

  17. Testing a Condition on Multiple States • Query: • (R1, contains-item, ?VALUE) at S1-S10, S30-S45 • Output: • A Set of (VALUE, Situation-Range) pairs

  18. Testing a Condition on Multiple States 1-10, 30-45 40 1-10, 30 – 39 40 - 45 65 20 1-10 40 - 45 30 – 39 40 - 45 15 25 50 80 1-10 30 – 39 i1 i1 i1 i1 i2 i3 i3 i2 i2 i3 i3 i3 start 25 40 50 65 80 15 20

  19. Testing a Condition on Multiple States 30-39, 40-45 i1 • (R1 ^contains-item ) at • (R1 ^contains-item ) at • (R1 ^contains-item ) at 1-10, 30-45 30-39 i2 40 1-10, 30 – 39 40 - 45 30-39, 40-45 i3 65 20 1-10 40 - 45 30 – 39 40 - 45 15 25 50 80 1-10 30 – 39 i1 i1 i1 i1 i2 i3 i3 i2 i2 i3 i3 i3 start 25 40 50 65 80 15 20

  20. Testing a Rule over a Range of Situations • A Rule contains multiple conditions • i.e. C1 and C2 … • Each Condition Ci propagates: • the values • the set of situations that satisfy Ci and previous conditions • This idea can be also generalized for rules that contain “not” and “or”

  21. Why is this useful? • In Learning by Observation : • The rules are checked over continues ranges A A

  22. Why is this useful? • In Soar Programing: • It may help to understand the behavior of a Soar agent. • i.e. Show me all situations when a Tanksoar agent has perceived a threat but did not fire.

  23. Nuggests • Range Testing is efficient • Space complexity • Significantly better than explicit storage • Proportional to the number of changes (not situations) • Time Complexity: • Significantly better than testing at each situation using the index. • Can potentially perform better than storing situations explicitly

  24. Coals • Performance depends on the Database & Rules • Multivalued attributes that change value often cause space inefficiency • As the rules gets longer, the query mechanism time efficiency decrease • multivalued attributes and “or” connections decrease time efficiency. • No direct connection to Soar in current implementation

More Related