Game Theoretic Approaches to Solve H  Problems

Alex Feng Joint work with Brian D. O. Anderson, Alexander Lanzon and Michael Rotkowitz Game Theoretic Approaches to Solve H Problems

Global Motivation • Some H problems give Riccati equations which cause standard solvers to break down • We give a cure • The cure is extendable to linear periodic H problems and nonlinear game theory problems • This may be useful if there are numerical problems • Solution procedures are very few anyway, and numerically not well understood.

Outline • Global Motivation • Solving Continuous-Time H Riccati equations • Nonlinear Extension • Solving Discrete-Time H Riccati equations • Conclusions and Future Work

Outline • Global Motivation • Solving H Riccati equations • Detailed Motivation • Solving Riccati Equations, Kleinman and its repair • Algorithm Convergence • Game Theoretic Interpretation • Nonlinear Extension • Solving Discrete-Time Riccati Equations • Conclusions and Future Work

Detailed Motivation • Software to solve Asymptotic Riccati Equations arising in H2 problems is standard • The software can collapse on certain problems • The Kleinman algorithm will save the day: • Recursive • Requires Lyapunov equation solutions • Requires stabilizing gain to initialize • Converges quadratically (it is a Newton algorithm) • It does not extend to H equations (indefinite quadratic term) • What can we do?

Solving Riccati Equations Direct methods The example on the next transparency shows what can go wrong with an H-infinity Riccati equation if we use some direct methods. Computational disadvantages A numerical example

Numerical Problem Direct methods Too Large !! Direct methods problems are an old difficulty! It is a motivation for using the Kleinman algorithm in the LQ case.

Solving Riccati Equations Direct methods Iterative methods H2 control problem : Definite R Traditional Newton methods Computational disadvantages A numerical example LQ problem Solve AREs with R>0 Difficult to choose an initial condition Kleinman algorithm

Kleinman algorithm for H2 Question: Can we use Kleinman algorithm to solve H Riccati equations directly?

Diverge !! Kleinman algorithm for H The Kleinman algorithm cannot be used to solve H Riccati equations directly!

Solving Riccati Equations Direct methods Iterative methods H2 control problem : Definite R H control problem : Indefinite R Traditional Newton methods Computational disadvantages New algorithm ?? A numerical example LQ problem Solve AREs with R>0 Difficult to choose an initial condition Kleinman algorithm

Sign-indefinite Quadratic Term Problem setting

CARE Algorithm Sign indefinite quadratic term Simple initial choice Sign definite quadratic term Recursive algorithm using LQ Riccati equations , not Lyapunov equations!

Algorithm: A Summary Convert A sequence of H2 CAREs An H CARE Our algorithm Convert A sequence of less difficult problem A more difficult problem

Convergence • Global convergence is guaranteed provided the H control problem is solvable A monotone increasing matrix sequence is constructed to approximate the stabilizing solution of an H ARE. • Local quadratic rate of convergence A typical feature of Newton’s method

Game theoretic interpretation • Recall • Player u: minimize J; player w: maximize J. is the monotone increasing matrix sequence in our algorithm • Strategies for player u and w: uk+1 solves LQ, not game theory problem, when fixed wk is being used

Our algorithm Reduce an ARE with an indefinite quadratic term to a series of AREs with a negative quadratic term; Simple choice of the initial condition; A monotone non-decreasing matrix sequence. Easy Newton method (Kleinman) Reduce an ARE with an positive definite quadratic term to a series of Lyapunov equations; Difficult to choose an initial condition; A monotone non-increasing matrix sequence. Comparison of results For LQ game problems For LQ H2 problems

Periodic Equations--H2 • Some problems are periodic, e.g. satellite control • H2 periodic Riccati equations potentially have stabilizing periodic solution • Computational procedures are current research topic • Kleinman-likealgorithm for time-varying Riccati equations over a finite interval predates Kleinman algorithm • Kleinman-like algorithm for periodic time-varying Riccati equations over infinite interval yields stabilizing periodic solution as limit of solution of periodic Lyapunov differential equations

Periodic Equations--H • H periodic Riccati equations potentially have stabilizing periodic solution • Solution can be found by solving a sequence of H2 periodic Riccati equations (Kleinman-like algorithm will work for each one of these) • Game theoretic interpretation exists • No surprises in relation to the time-invariant case: • Local Quadratic Rate of Convergence. The algorithm carries over for the periodic case!!

Outline • Global Motivation • Solving H Riccati equations • Nonlinear Extension • Isaacs and HJB equations • Recursive solution of Isaacs via HJB equations • Quadratic convergence and game theoretic interpretation • Solving DARE • Conclusions and Future Work

Disturbance input HJB Isaacs One player game Nonlinear H-infinity control problem LQ Nonlinear Nonlinear optimal control LQ Game Riccati equations Linear H-infinity control problem LQ problem LQ and generalization

Summary of nonlinear result Hproblem solution H2 problem solution Iteration Linear-quadratic to Nonlinear-nonquadratic Linear-quadratic to Nonlinear-nonquadratic Isaacs problem solution HJBproblem solution Iterationcan be found 22

Problem Setting Sign indefinite term

Isaacs Nice initial condition HJB Nonlinear algorithm Question: How to solve HJB?

Solving HJB Hproblem solution H2 problem solution Kleinman iteration Isaacsproblem solution HJBproblem solution Iteration Linear PDE Iteration • Linear PDE iteration to give HJB solution stems from 1967 approx, though not carefully done for infinite time problem. Iteration exists • Other methods to solve HJB do exist

Convergence • Local convergence is guaranteed provided the H infinity control problem is locally solvable A monotone non-decreasing function sequence is constructed to approximate the stabilizing solution of an Isaacs equation. • Local quadratic rate of convergence

Game theoretic interpretation • Recall • Player u: minimize J; player w: maximize J. • Vk: The monotone increasing functionsequence in algorithm • Strategies for players u and w:

Our algorithm • Local quadratic rate of convergence • Simple initial choice • A natural game theoretic interpretation • Supposed to have a higher numerical accuracy and reliability Comparison Results Existing methods • Taylor Expansion Method • Stability not guaranteed • Converges slowly • Galerkin Approximation Method • Difficult to initialize • The Method of Characteristics • Converges slowly • Other Methods

Example (van der Schaft) Figure compares exact solution, iterations from method of characteristics, and iterations from our method.

A numerical example (continued)

Outline • Global Motivation • Solving H Riccati equations • Nonlinear Extension • Solving H Discrete-Time Riccati Equations • Discrete game problem • Mappings between generalized DARE and generalized CARE • The algorithm to solve DARE • Conclusions and Future Work

Discrete game problem

Discrete game problem Matrix inverse includes unknown variable P !!!

Mappings between DARE and CARE Generalized DARE • Why mapping?? • Stem from approximate 1960’s • Mappings between generalizedCARE and generalized DARE Generalized CARE

Mappings between DARE and CARE Assumption: Matrix A has no eigenvalue at -1 Other mappings exist !! LHS: parameters and solution of CARE RHS: parameters and solution of DARE

A Summary for Mappings Parameters of DARE Parameters of CARE Mapping Solution of DARE Solution of CARE

Mappings between DARE and CARE DARE we want to solve recursively Mappings exist !! CARE we have solved recursively

Algorithm to Solve DARE Given a DARE Mapping DARE into CARE Obtain the monotone sequence of the CARE Generate the monotone sequence of the DARE Compute the stabilizing solution of the DARE

Algorithm to solve DARE Monotone sequence in DARE Monotone sequence in CARE Mapping Game Theory Interpretation Game Theory Interpretation

Property of DARE Algorithm • Recursive • Simple initialization • Global convergence • Local quadratic rate of convergence • Game Theoretic Interpretation Properties of CARE algorithm Carry over !!

Conclusions and future work Conclusions • We developed a new algorithm to solve Riccati equations(Isaacs equations) arising in H control • We proved the global (local) convergence and local quadratic rate of convergence of our algorithm; • Our algorithm has a natural game theoretic interpretation.

Conclusions and future work Future work • More general Isaacs equations • Time-varying, periodic, other system structure • Viscosity case • Nonsmooth solutions • Stochastic case • Random noise in system, modifies Isaacs • Zero(Nonzero)-sum Multi-player game • Coupled HJB/Isaacs, but iteration style similar in existing algorithms • Discrete Isaacs equations

Acknowledgement • Andras Varga • Marco Lovera • Matt James • Minyi Huang • Weitian Chen

Game Theoretic Approaches to Solve H  Problems

Game Theoretic Approaches to Solve H  Problems

Presentation Transcript

Approaches to Psychology

Game-theoretic analysis tools

Game-Theoretic Analysis of Network Quality-of-Service Pricing

Review of: Detecting Network Intrusions via Sampling: A Game Theoretic Approach

Strategic Speech: Evidence for a Game-theoretic Model of Indirect Speech

Gamification Design and Analytics

Modeling Resource Sharing Dynamic of VoIP users Over a WLAN Using a Game-Theoretic Approach

Solve:

A Game-Theoretic Perspective on Registration and Recognition of 3D Shapes

Applied Game Theory Lecture 3

3.4 Using Equations to Solve Problems

Research Objectives

A Game Theoretic Approach to Geographic Profiling

A Game-theoretic Analysis of Catalog Optimization

How can drawing pictures help you solve tough word problems?

On the Optimal Placement of Mix Zones: a Game- Theoretic Approach

Game-Theoretic Selection of Paths in Motion-based Games

Selfish Caching in Distributed Systems A Game-Theoretic Analysis

Gamification in JCPS

A Game-Theoretic Analysis of Clausal Linkage in English and Japanese

Towards a Game-Theoretic Framework for Information Retrieval