1 / 9

On Playing Games without Knowing the Rules

On Playing Games without Knowing the Rules. Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author :Denis V. Batalov and B. John Oommen. Outline. Motivation Objective Method Experience Conclusion. Motivation.

lysa
Download Presentation

On Playing Games without Knowing the Rules

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. On Playing Games without Knowing the Rules Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author :Denis V. Batalov and B. John Oommen

  2. Outline • Motivation • Objective • Method • Experience • Conclusion

  3. Motivation • We know that one of the interesting areas in AI is to teach machine to play a game against an educated opponent. • But if the machines don’t know the rule of the game?

  4. Objective • This paper will show that the machine will learns the rules of the game, tic-tac-toe, and strategy just as paper’s title say.

  5. Method • To accomplish this goal, we assume that the LM interacts with an environment. • Sense-act-learn procedure • Agent-Environment Interaction Protocol (AEip) • AEip • Because we use JAVA to implement this platform, so we call it JAGUAR

  6. EX Method • AEip specification of Tic-tac-toe • Reinforcement • Doesn’t end the game : -1 • Win & Lose & Tie: + 10 & -10 & +5 • Learning algorithm • Q-learning • Select mathod : If t = 0.1=> P = ∞ (greedy) If t = ∞ => p = 1/j (random)

  7. Experiment • This paper just underscore two set of results • The agents were selecting their actions simultaneously

  8. Experiment • How much faster the agents learn to play when they allowed to make a move on their own turn

  9. Conclusion • In this paper we have specified a novel framework and show how an agent can learn to play a new game without any prior knowledge.

More Related