1 / 24

Frank L. Lewis and Murad Abu-Khalaf Advanced Controls, Sensors, and MEMS (ACSM) group

Automation & Robotics Research Institute (ARRI). Nonlinear Network Structures for Optimal Control. Frank L. Lewis and Murad Abu-Khalaf Advanced Controls, Sensors, and MEMS (ACSM) group. System. Cost. The Usual Suspects. NONLINEAR QUADRATIC REGULATOR. Generalized HJB Equation.

eunice
Download Presentation

Frank L. Lewis and Murad Abu-Khalaf Advanced Controls, Sensors, and MEMS (ACSM) group

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Automation & Robotics Research Institute (ARRI) Nonlinear Network Structures for Optimal Control Frank L. Lewis and Murad Abu-Khalaf Advanced Controls, Sensors, and MEMS (ACSM) group

  2. System Cost The Usual Suspects

  3. NONLINEAR QUADRATIC REGULATOR Generalized HJB Equation Optimal Control (SVFB) Hamilton-Jacobi-Bellman (HJB) Equation

  4. PROBLEM- HJB usually has no analytic solution SOLUTION- Successive Approximation a stabilizing control A contraction map (Saridis) Saridis and Beard used Galerkin Approx to allow for GHJB solution Converges to optimal solution Gives u(x) in SVFB form

  5. For Constrained Controls NONLINEAR NONQUADRATIC REGULATOR with Nonquadratic form- Lyshevsky PD if u

  6. New GHJB is Natural, exact, no approximation u(t) constrained if f(.) is a saturation function! tanh(p) 1 p -1

  7. Problem- cannot solve HJB Solution- Use Successive Approximation on GHJB Iterate: a stabilizing control

  8. 1 VT WT (.) x1 (.) y1 2 (.) x2 (.) y2 3 (.) xn (.) ym L inputs outputs (.) hidden layer Two-Layer Neural Network with adjustable output weights Problem- Cannot solve GHJB! Solution- Neural Network to approximate V(i)(x) Select basis set

  9. Cost gradient approximation Let Nonzero residual! Then GHJB is

  10. Neural-network-based nearly optimal saturated control law.

  11. To minimize the residual error in a LS sense Evaluate the GHJB at a number of points on Note, if Then, GHJB is

  12. Evaluating this at N points gives L x N coefficient matrix Solve by LS NN Training Set!

  13. Select the N sample points xk Uniform Mesh Grid in Random selection- Montecarlo Approximation error is (Barron) Approximation error is Montecarlo overcomes NP-complexity problems!

  14. ASIDE- Useful for reducing complexity of fuzzy logic systems? Uniform grid of Separable Gaussian activation functions for RBF NN

  15. NN Training Set must be PE

  16. Algorithm and Proofs work for any Q(x) in Constrained input given by CONSTRAINED STATE CONTROL k large and even MINIMUM-TIME CONTROL For small R and this is approx.

  17. Example: Linear system

  18. Region of asymptotic stability for the initial controller,

  19. Region of asymptotic stability for the nearly optimal controller,

  20. Example: Nonlinear oscillator

More Related