1 / 65

Decision-Making Under Uncertainty

Decision-Making Under Uncertainty . Overview. Two models of decision-making under uncertainty Nondeterministic uncertainty Probabilistic uncertainty In particular, what is the rational action in the presence of repeated states?. Game Playing Review. Game playing with an adversary

kasi
Download Presentation

Decision-Making Under Uncertainty

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Decision-Making Under Uncertainty

  2. Overview • Two models of decision-making under uncertainty • Nondeterministic uncertainty • Probabilistic uncertainty • In particular, what is the rational action in the presence of repeated states?

  3. Game Playing Review • Game playing with an adversary • Nondeterministic uncertainty => set of possible future states • Need a plan for all states that guarantees a win will be reached • Minimax works with no repeated states

  4. Action Uncertainty • Each action representation is of the form: Action: a(s) -> {s1,…,sr} • where each si, i = 1, ..., r describes one possible effect of the action in a state s

  5. R1 R2 Example: Devious Vacuum Robot Right(s) (In(R1) holds in s) {s1 = In(R2), s2 = Clean(R1)} Right may cause the robot to move to room R2(s1), or to dump dust and stay in R1(s2) Not intentional, so unpredictable

  6. Left (In(R2) holds in s) {s1= In(R1)} Left always leads the robot to move to R1

  7. Suck if(In(R1) holds in s) {s1= Clean(R1)} Suck(R1) always leads the robot to do the right thing

  8. Suck if(In(R1) holds in s) {s1= Clean(R1)} else {s1 = Clean(R2), s2= Clean(R2), In(R1)} But Suck(R2) may also cause the robot to move to R1

  9. R1 R2 Problem • From the initial state: • our devious vacuum robot must achieve the goal Clean(R1)  Clean(R2) • We want a guaranteed plan, i.e., one that works regardless of which action outcomes actually occur

  10. Right Suck(R1) loop action nodes(world “decision” nodes) state nodes(agent decision nodes) AND/OR Tree

  11. Right Suck(R1) loop Left Suck(R2) Right Suck(R1) Suck(R2) loop goal goal loop loop AND/OR Tree Does the problem have a solution? [not as obvious as it looks]

  12. Labeling an AND/OR Tree • Assume no detection of revisited states

  13. Labeling an AND/OR Tree • A leaf state node is solved if it’s a goal state • A leaf state node is closed if it has no successor and is not a goal

  14. Labeling an AND/OR Tree • A leaf state node is solved if it’s a goal state • A leaf state node is closed if it has no successor and is not a goal

  15. Labeling an AND/OR Tree • A leaf state node is solved if it’s a goal state • A leaf state node is closed if it has no successor and is not a goal • A non-leaf state node is solved if one of its children is solved • A non-leaf state node is closed if all its children are closed

  16. It was useless to expand this node Labeling an AND/OR Tree • The problem is solved when the root node is solved • The problem is impossible if the root node is closed

  17. a1 s1 s2 a2 a3 Solution of an AND/OR Tree Conditional plan: • Perform a1 • If s1 is observed then perform a2 • Else if s2 is observed then perform a3 • The solution is the sub-tree that establishes that the root is solved • It defines a conditional plan (or contingency plan) that includes tests on sensory data to pick the next action

  18. OR Sub-Tree • OR sub-tree : AND-OR tree :: path : classical search tree • For each state node, only one child is included • For each action node, all children are included • It forms a part of a potential solution ifnone of its nodes is closed • A solution is an OR sub-tree in which all leaves are goal states

  19. Right Suck(R1) loop action nodes(world “decision” nodes) state nodes(agent decision nodes) AND/OR Tree

  20. Right Suck(R1) loop Left Suck(R2) Right Suck(R1) Suck(R2) loop goal goal loop loop AND/OR Tree

  21. OR Sub-Tree • OR sub-tree : AND-OR tree :: path : classical search tree • For each state node, only one child is included • For each action node, all children are included • It forms a part of a potential solution ifnone of its nodes is closed • A solution is an OR sub-tree in which all leaves are goal states

  22. Another OR Sub-Tree

  23. Another OR Sub-Tree

  24. Best-First Search • Let T be any OR sub-tree in the current AND/OR tree and f be a function that estimates the cost of the best solution sub-tree containing T, e.g., f(T) = g(T) + h(T)where g(T) is the cost of T and h(T) is an estimate of the cost from T to a solution sub-tree. • Best-first search expands a pending state node of the OR sub-tree with the smallest estimated cost • An algorithm similar to A* – AO*– is available for AND/OR trees

  25. Dealing with Revisited States • Solution #1: Do not test for revisited states Duplicated sub-trees[The tree may grow arbitrarily large even if the state space is finite] • Solution #2: Test for revisited states and avoid expanding nodes with revisited states

  26. N N’ Solution #2 – Case #1 • The state of a newly created node N is the same as the state of another node N’ that is not an ancestor of N • Merge N and N’  acyclic AND/OR graph

  27. N Solution #2 – Case #1 • The state of a newly created node N is the same as the state of another node N’ that is not an ancestor of N • Merge N and N’  acyclic AND/OR graph • Just discarding the new node would not work! Why?? • This makes it more difficult to extract OR sub-trees and manage evaluation function

  28. Solution #2 – Case #2 • The state of a newly created node N is the same as the state of a parent of N • Two possible choices: • Mark N closed • Mark N solved • In either case, the search tree will remain finite, if the state space is finite • If N is marked solved, the conditional plan may include loops • What does this mean???

  29. initialstate Right goal loop Example

  30. initialstate Right goal loop Example

  31. initialstate Right goal loop Example Marking loop nodes closed  The problem has no solution

  32. initialstate Right goal loop Example • Marking loop nodes solved • The problem has a solution But what is this solution?

  33. cyclicplan: While In(R1) do Right Example initialstate This plan requires that whenever Right is executed, there is a non-zero probability that it does the right thing The plan is guaranteed only in a probabilistic sense: the probability that it achieves the goal goes to 1 with time, but the running time is not bounded Right goal

  34. In the presence of uncertainty, it’s often the case that things don’t work the first time as one would like them to work; one must try again • Without allowing cyclic plans, many problems would have no solution • So, dealing properly with repeated states in case #2 is much more than just a matter of search efficiency!

  35. Right loop Suck(R1) Left Suck(R2) Right Suck(R1) loop goal goal loop Suck(R2) loop Left Suck(R1) Suck(R1) goal goal

  36. Right loop Suck(R1) Left Suck(R2) Right Suck(R1) loop goal goal loop Suck(R2) loop Left Suck(R1) Suck(R1) goal goal

  37. Right Suck(R1) Left Suck(R2) loop goal goal Suck(R2) loop Right Suck(R1) loop Left Suck(R1) Suck(R1) goal goal

  38. Does this always work? • No! We must be more careful • For a cyclic plan to be correct, it should be possible to reach a goal node from every non-goal node in the plan

  39. Right loop Suck(R1) Left Suck(R2) Right Suck(R1) loop goal goal loop Suck(R2) loop Left Suck(R1) Suck(R1) goal goal

  40. Does this always work? • No ! We must be more careful • For a cyclic plan to be correct, it should be possible to reach a goal node from every non-goal node in the plan •  The node labeling algorithm must be slightly modified

  41. Planning with Uncertainty in Sensing and Action

  42. Sensing Feedback • In previous, we assumed the state of the world was available to the agent after every step (feedback), and uncertainty lies in the future state of the world • “Playing a game against nature” • Sensing uncertainty demands a fundamental change to the problem setup

  43. Belief State • A belief state is the set of all states that an agent think are possible at any given time or at any stage of planning a course of actions, e.g.: • To plan a course of actions, the agent searches a space of belief states, instead of a space of states

  44. Sensor Model • State space S • The sensor model is a function SENSE: S  2Sthat maps each state s  S to a belief state (the set of all states that the agent would think possible if it were actually observing state s) • Example: Assume our vacuum robot can perfectly sense the room it is in and if there is dust in it. But it can’t sense if there is dust in the other room SENSE( ) = SENSE( ) =

  45. Righteither moves the robot right, or does nothing Leftalways moves the robot to the left, but it may occasionally deposit dust in the right room Suck picks up the dirt in the room, if any, and always does the right thing • The robot perfectly senses the room it is in and whether there is dust in it • But it can’t sense if there is dust in the other room Vacuum Robot Action Model

  46. Transition Between Belief States • Suppose the robot is initially in state: • After sensing this state, its belief state is: • Just after executing Left, its belief state will be: • After sensing the new state, its belief state will be: or if there is no dust if there is dust in R1 in R1

  47. Left Clean(R1) Clean(R1) Transition Between Belief States • Playing a “game against nature” After receiving an observation, the robot will have one of these two belief states

  48. Left Right Suck loop Suck goal goal AND/OR Tree of Belief States An action is applicable to a belief state B if its preconditions are achieved in all states in B A goal belief state is one in which all states are goal states

  49. AND/OR Tree of Belief States Left Right loop Suck goal goal Right Suck loop

  50. AND/OR Tree of Belief States Left goal Right Right Suck Suck goal

More Related