1 / 43

Efficient algorithms for online decinsion problems

Efficient algorithms for online decinsion problems. Adam Kalai , Santosh Vempala. Seminar on Experts and Bandits, Fall 17/18. Ran Hochshtet. Contents. Online decision problem N Experts Online shortest paths Tree update problem. Introduction. Online decision problem

cutshaw
Download Presentation

Efficient algorithms for online decinsion problems

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Efficient algorithms foronline decinsion problems Adam Kalai, Santosh Vempala Seminar on Experts and Bandits, Fall 17/18 Ran Hochshtet

  2. Contents • Online decision problem • N Experts • Online shortest paths • Tree update problem

  3. Introduction • Online decision problem • No knowledge of the future • Each period we pick choice • Pay • Goal: Minimize the regret

  4. Linear generalization • Series of decisions from infinite set • leads to state • Making decision in state costs • Total cost =

  5. Linear generalization • - state • - decision • M computes the best single decision in hindsight

  6. Predicting from expert advice • experts • Each period we pick expert • Pay • Goal: Minimize the regret

  7. Predicting from expert advice • experts problem • - the costs vector

  8. Motivation • Consider example with two experts • The costs are: • Follow the leader always incurs cost of 1The total cost is • Using perturbations we can achieve

  9. On each period t: • Choose uniformly at random from the cube • Use

  10. On each period t: • Choose at random according to the exponential distribution • Use

  11. Notations • for • for all • for all • for all

  12. Theorem 1.1 • Let be a state sequence • (a) Running with gives:

  13. Theorem 1.1 • Let be a state sequence • (b) For nonnegative , gives:

  14. Theorem 1.1 • If or are known: • For FPL:

  15. Experts problem • It seems that , • In our algorithm the worst case is wheneach period only one expert incurs cost Min-cost is b After we choose b, there is a chance we choose c

  16. Experts problem • , • By Theorem (b):

  17. Online shortest paths • Input • Directed Graph - • Pair of nodes • Each period pick path from to • Then times on all edges are revealed • The cost is the sum of times on the chosen path

  18. Online shortest paths • is the number of edges • - the times vector

  19. Online shortest paths • Use • On each period • For each edge pick from exp. Distribution (same as ) • = the total times on edge so far • Use shortest path in the graph with weights

  20. Online shortest paths • , • By Theorem :

  21. Proof of Theorem - • “Be the leader” – use instead of • “Be the leader” has no regret • Prove by induction

  22. Proof of Theorem - • We show that: • For – trivial • Induction step from to :

  23. Lemma • We want to show that perturbations do not hurt too much • Still “be the leader” algorithm • For any state sequence , any and any vectors

  24. Proof of Lemma • Pretend

  25. Proof of Lemma

  26. Proof of Theorem - • Use , for all • No need to choose new each period • Applying Lemma :

  27. Proof of Theorem - • Now we return to use instead of • We need to show that:

  28. Proof of Theorem - • Key idea: the distributions over and are similar • If the cubes are identical, i.e. , then • If they overlap on fraction of their volume:

  29. Lemma • For any , the cubes and overlap in at least a fraction

  30. Proof of Lemma • Take a random point ,if , then for some , • With union bound we get:

  31. Proof of Theorem - • By lemma : • Each period the difference between using and is at most • We get:

  32. The tree update problem • Maintain a binary search tree over items • There is an unknown sequence of accesses • The cost is the number of comparisons • Equals to the depth in the tree

  33. The tree update problem • We can solve the problem with • Each period we find the best tree so far, and use it • The problem: • For each access we do expensive computation

  34. The tree update problem • Follow the lazy leading tree: • For , let and choose randomly from • Start with best tree as if there were accesses to node • After each access to item :(a) (b) if theni. ii. Compute best tree as if there were accesses to node

  35. Calling the oracle can be a computationally expensive • We want to minimize the numbers of times we use • Trick: use as often as possible

  36. is equivalent to in terms of expected cost • rarely calls the oracle • rarely changes decision from one period to the next

  37. Once, choose uniformly • Determine a grid: • On period . Use where is the unique point in • If - no need to re-evaluate

  38. Lemma • For any fixed sequence of states and (also and ) have identical expectations on each period . • The probability of (or ) performing an update is at most .

  39. Proof of Lemma • chooses a uniformly random grid of spacing • There is exactly one grid point inside • By symmetry is uniformly distributed over • Same as - uniform over

  40. Lemma 3.2:For any , the cubes and overlap in at least a fraction Proof of Lemma • In each update: • The grid point of is not in the cube • By lemma 3.2:

  41. Summary • Online decision problem • N Experts • Online shortest paths • Tree update problem

  42. THANKS! ANY QUESTIONS?

More Related