Applied machine learning in game theory

Applied machine learning in game theory Dmitrijs Rutko Faculty of Computing University of Latvia Joint Estonian-Latvian Theory Days at Rakari, 2010

Topic outline • Game theory • Game Tree Search • Fuzzy approach • Machine learning • Heuristics • Neural networks • Adaptive / Reinforcement learning • Card games

Research overview • Deterministic / stochastic games • Perfect / imperfect information games

Finite zero-sum games

Game trees

max 8 min 2 8 max 2 7 8 9 1 2 7 4 3 6 8 9 5 4 √ √ √ Χ Χ √ √ √ Χ Χ Classical algorithms • MiniMax • O(wd) • Alpha-Beta • O(wd/2)

Advanced search techniques • Transposition tables • Time efficiency / high cost of space • PVS • Negascout • NegaC* • SSS* / DUAL* • MTD(f)

max ≥5 min <5 ≥5 max <5 ? ≥5 ≥5 1 2 7 4 3 6 8 9 5 4 √ √ Χ Χ Χ √ Χ √ Χ Χ Fuzzy approach • O(wd/2) • More cut-offs

α β 2 8 X1 X3 X2 Geometric interpretation • 1) X2 - successful separation • 2) X1 or X3 - reduced search window • α = X1 • β = X3

BNS enhancement through self-training • Traditional statistical approach

Two dimensional game sub-tree distribution

Statistical sub-tree separation

Experimental results. 2-width trees

Experimental results. 3-width trees

Future research directions in game tree search • Multi-dimensional self-training • Wider trees • Real domain games

Games with element of chance

Expectiminimax algorithm • Expectiminimax(n) = • Utility(n) • If n is a terminal state • Max s Successors(n) Expectiminimax(s) • if n is a max node • Min s Successors(n) Expectiminimax(s) • if n is a min node • s Successors(n) P(s) * Expectiminimax(s) • if n is a chance node • O(wdcd)

Perfomance in Backgammon *-Minimax Performance in Backgammon, Thomas Hauk, Michael Buro, and Jonathan Schaeer

Backgammon • Evaluation methods • Static – pip count • Heuristic – key points • Neural Networks

Temporal difference (TD) learning • Reinforcement learning • Prediction method

Experimental setup • Multi-layer perceptron • Representation encoding • Raw data (27 inputs) • Unary (157 inputs) • Extended unary (201 inputs) • Binary (201 input) • Training game series – 400 000 games

Learning results

Program “DMBackgammon”

Artificial Intelligence and Poker* * Joint work with Annija Rupeneite

Questions ? dim_rut@inbox.lv

Applied machine learning in game theory

Applied machine learning in game theory

Presentation Transcript

Applied Game Theory Lecture 4

Applied Game Theory Lecture 5

Machine Learning in Computer Game Players

Online Learning + Game Theory

Applied Game Theory Lecture 2

Applied Game Theory Lecture 6

Applied Game Theory Lecture 3

Applied Game Theory Lecture 1

Machine Learning Applied in Product Classification

Game Theory in Q-Learning

CS 391L: Machine Learning: Computational Learning Theory

Machine Learning Theory

Applied Game Theory

CS 5331: Applied Machine Learning Spring 2011

Learning Theory Applied to Teaching

SAL: A Game Learning Machine

Indo-US Symposium on New Directions in Machine Learning, Game Theory, and Optimization

Machine Learning Chapter 7. Computational Learning Theory

CS 391L: Machine Learning: Computational Learning Theory

Connections between Learning Theory, Game Theory, and Optimization

Machine Learning Theory

Connections between Learning Theory, Game Theory, and Optimization