1 / 40

Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University

Robust and Reconfigurable Flight Control by Neural Networks. Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University. Infotech@Aerospace Crystal City, VA, September 28, 2005. On-line Control Identification Planning. Routing Scheduling. e.

jara
Download Presentation

Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Robust and Reconfigurable Flight Control by Neural Networks Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University Infotech@Aerospace Crystal City, VA, September 28, 2005

  2. On-line • Control • Identification • Planning Routing Scheduling ... e A Multiphase Learning Approach for Automated Reasoning Supervised Learning: Reinforcement Learning: The same performance metric is optimized during both phases!

  3. Introduction • Stringent operational requirements introduce • Complexity • Nonlinearity • Uncertainty • Classical/neural synthesis of control systems • A-priori control knowledge • Adaptive neural networks • Dual heuristic programming adaptive critic architecture: Action network takes immediate control action Critic network evaluates the action network performance

  4. Motivation • Sigmoidal neural networks for control: coping with complexity • Applicability to nonlinear systems • Applicability to multivariable systems • Batch and incremental training • Closed-loop stability and robustness by IQCs • Constrained training for robust adaptation on line

  5. Modeling Linearizations Initialization Linear Control Full Envelope Control! On-line Training Design Approach

  6. Lift Drag mg q a p r XB YB b V ZB Thrust Nonlinear Dynamical System Full-scale Aircraft Simulation: State vector: Control vector: Vector of parameters: Output vector: In particular:

  7. k  ( ) Classical Control Design Linearizations: Flight envelope and design points: (g = m = b = 0) Altitude (m) Classical linear designs: • Multivariable control (PI) • Multi-objective synthesis (LMI) Velocity (m/s)

  8. Input-to-node variable n1 w11 1 p1 p2 . . . pq v1 d1 1 z v2 n2 2 . . . d2 b vs . . . 1 1 ns s wsq Gradient equations: ds 1 vis'(ni)wij, j = 1, ..., q One-hidden Layer Sigmoidal Neural Network Output:z = NN(p) Input: p Adjustable parameters: W,d, v Output equations: z = vTs[Wp + d] s - Hidden nodes

  9. Known neural network.. Gradient Output Input uk = vT s[Wxk + d] u = Sv ck = BkW (ck)T = WT {v s[Wxk + d]} General Algebraic Training Approach Training set: Requirements: Output and Gradient Initialization Equations:

  10. Assume each input-to-node variable,is a known constant Then, n is known and the initialization equations can be written as: Gradient-based Algebraic Training Linear algebraic initialization equations: where: ; to be solved for wa ; to be solved for v ~ ; to be solved for wx Vec operator n: vector of all input-to-node constants, nik c: vector of feedback gains b: output bias vector

  11. A: (p2 3s) sparse matrix of scheduling variables S: (ps) matrix of sigmoidal functions of n X: (npns) sparse matrix evaluated from v and n k = 1, 2, ..., p where: Initialization Matrices

  12. Comparison of Initialized PI NN and Linear Controllers Aircraft Response to Climb-Angle Command Input, at Interpolating Conditions (H0, V0) = (2Km, 95 m/s) Velocity (m/s) Linear Control Initialized Neural Network Control Large-Angle Maneuver Small-Angle Maneuver Climb Angle (deg) Time (sec)

  13. w G(s) v  Stability Analysis via Integral Quadratic Constraints (IQCs) Standard feedback interconnection between a transfer matrix G(s) or LTI system, and a causal bounded operator : IQC Stability Theorem: If there exists  > 0, such that, , then the interconnection is stable. Equivalent LMI feasibility problem with positive, real parameters pi and symmetric matrix P:

  14. BN = BV, CN = WCa Closed-loop Stability of Neural Network Controller Closed-loop system comprised of NN controller and LTI model, Lure-type System Applying the IQC Stability Theorem:  is a bdd, causal diagonal operator with repeated nonlinearities that are monotonically non-decreasing, slope-restricted, and belong to [0, 0.5]. Thus, the stability of the NN controlled system is guaranteed if there exists constant symmetric matrices M, Pss that satisfy the following LMIs: i = 1, …, s

  15. Adaptive Critic On-line Adaptation ys(t) NNC l = dV/dxa + x(t) + _ NNA _ + x(t) yc + uc u(t) NNF e a SVG xc CSG

  16. J* V* terminal cost terminal cost t0 t tf t0 t tf Dynamic Programming Approach By The Principle of Optimality, b V*abc = Vab + V*bc c a the minimization of J can be imbedded in the minimization of V(t): Time

  17. = NNC Target at t NNA Target at t Dual Heuristic Programming Recurrence relation [Howard, 1960] : Action network criterion (optimality condition): Critic network criterion:

  18. NN Target Generation e + w(t) = w0 NN wl+1 = wl +Dwl wl wl+1 RProp w(t + 1) Action/Critic Network On-line Learning, at Time t The (action/critic) network must meet its target, E Network performance e  Network error w  Network weights Modified Resilient Backpropagation (RProp)minimizes E w.r.t. w:

  19. Adaptive Critic Neural Control: Fixed Neural Control: Command Input: Adaptive vs. Fixed NN ControllersDuring a Coupled Maneuver Aircraft response, (H0, V0) = (2 Km, 95 m/s) Velocity (m/s) Climb Angle (deg) Roll Angle (deg) Sideslip Angle (deg) Time (sec)

  20. Adaptive Critic Neural Control: Fixed Neural Control: Command Input: Adaptive vs. Fixed NN ControllersDuring a Large-Angle Maneuver Aircraft response, (H0, V0) = (7 Km, 160 m/s) Velocity (m/s) Climb Angle (deg) Roll Angle (deg) Sideslip Angle (deg) Time (sec)

  21. Adaptive vs. Fixed NN Controllers During a Large-Angle Maneuver Control history, (H0, V0) = (7 Km, 160 m/s) Fixed Neural Control T (%) Adaptive Critic Neural Control S (deg) Trajectory Altitude (m) A (deg) R (deg) Time (sec) East (m) North (m)

  22. Command Input Fixed Neural Control Fixed Neural Controller Performancein the Presence of Control Failures • Control Failures: • T = 0, 0 t 10 sec • S = 0, 5 t 10 sec • R = –34o, t 5 • R = 0, 5 t 10 sec Aircraft response, (H0, V0) = (3 Km, 100 m/s) V (m/s)  (deg) Control history  (deg) T (%)  (deg) S (deg)  (deg) A (deg) Time (sec) R (deg) Time (sec)

  23. Adaptive vs. Fixed NN Controllersin the Presence of Control Failures Control history, (H0, V0) = (7 Km, 160 m/s) • Control Failures: • (10 t 15 sec) • Tmax = 50% • R = – 15o T (%) S (deg) Fixed Neural Control A (deg) Adaptive Critic Neural Control R (deg) Time (sec)

  24. Adaptive Critic Neural Control: Fixed Neural Control: Command Input: Adaptive vs. Fixed NN Controllersin the Presence of Control Failures Aircraft response after t = 10 sec, (H0, V0) = (3 Km, 100 m/s) Velocity (m/s) Climb Angle (deg) Roll Angle (deg) Sideslip Angle (deg) Angle of Attack (deg) Yaw Angle (deg) Time (sec)

  25. V M1 WR ~ ~ or xa u ~ ~ b a1 a2 a WA M2 1 Robust Adaptation: Constrained Algebraic Training

  26. Neural Network Weights Partitioning , b, A, WA constrained weights unconstrained weights construction functions Zero Randomized Design points Hyperspherical initialization

  27. Interpolation Point

  28. Linear Non-adapting Neural Adapting Neural Controller Performance at Interpolation Point

  29. Linear Non-adapting Neural Adapting Neural Controller Performance at Interpolation Point

  30. Linear Non-adapting Neural Adapting Neural On-line Cost Optimization through Adaptation

  31. Adaptive NN Controller Performance at Design Points

  32. Extrapolation Point

  33. Linear Non-adapting Neural Adapting Neural Controller Performance at Extrapolation Point

  34. Linear Non-adapting Neural Adapting Neural Controller Performance at Extrapolation Point

  35. Summary of Results • Properties of learning control system: • Improves global performance • Lends itself to stability and robustness analysis via IQCs • Preserves prior knowledge through constrained training • Suspends and resumes adaptation, as appropriate Future work: • Computational complexity • Aircraft system identification by neural networks • Stochastic effects • Optimal estimation Acknowledgment: This research is funded by the National Science Foundation.

  36. Robust and Reconfigurable Flight Control by Neural Networks Silvia Ferrari Department of Mechanical Engineering Duke University Many Thanks to: Mark Jensenius

  37. Backup Slides

  38. NNC P NNI CI CF, f[•] = 0 NNB SVG CB : Algebraic Initialization, : On-line Training. Proportional-Integral Neural Network Controller ys(t) + x(t) uI(t) - + yc(t) x(t) uc(t) u(t) NNF + + + uB(t) a(t) - e(t) xc(t) CSG

  39. zB(t) NNB a Feedback Neural Network Initialization Linear optimal control law: Initialization Requirements: At each design point (k), (R1) (R2)

  40. j = 1, 2, ..., q Network gradient: where is the lth-row of the matrix Development of Feedback Initialization Equations Network output: where and Feedback Neural Network Initialization Equations: l = 1, 2, ..., m

More Related