Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-f...Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-form solu-tion due to the nonlinearity of HJI equation,and many iterative algorithms are proposed to solve the HJI equation.Simultane-ous policy updating algorithm(SPUA)is an effective algorithm for solving HJI equation,but it is an on-policy integral reinforce-ment learning(IRL).For online implementation of SPUA,the dis-turbance signals need to be adjustable,which is unrealistic.In this paper,an off-policy IRL algorithm based on SPUA is pro-posed without making use of any knowledge of the systems dynamics.Then,a neural-network based online adaptive critic implementation scheme of the off-policy IRL algorithm is pre-sented.Based on the online off-policy IRL method,a computa-tional intelligence interception guidance(CIIG)law is developed for intercepting high-maneuvering target.As a model-free method,intercepting targets can be achieved through measur-ing system data online.The effectiveness of the CIIG is verified through two missile and target engagement scenarios.展开更多
The relationship between the technique by state- dependent Riccati equations (SDRE) and Hamilton-Jacobi-lsaacs (HJI) equations for nonlinear H∞ control design is investigated. By establishing the Lyapunov matrix ...The relationship between the technique by state- dependent Riccati equations (SDRE) and Hamilton-Jacobi-lsaacs (HJI) equations for nonlinear H∞ control design is investigated. By establishing the Lyapunov matrix equations for partial derivates of the solution of the SDREs and introducing symmetry measure for some related matrices, a method is proposed for examining whether the SDRE method admits a global optimal control equiva- lent to that solved by the HJI equation method. Two examples with simulation are given to illustrate the method is effective.展开更多
文摘Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-form solu-tion due to the nonlinearity of HJI equation,and many iterative algorithms are proposed to solve the HJI equation.Simultane-ous policy updating algorithm(SPUA)is an effective algorithm for solving HJI equation,but it is an on-policy integral reinforce-ment learning(IRL).For online implementation of SPUA,the dis-turbance signals need to be adjustable,which is unrealistic.In this paper,an off-policy IRL algorithm based on SPUA is pro-posed without making use of any knowledge of the systems dynamics.Then,a neural-network based online adaptive critic implementation scheme of the off-policy IRL algorithm is pre-sented.Based on the online off-policy IRL method,a computa-tional intelligence interception guidance(CIIG)law is developed for intercepting high-maneuvering target.As a model-free method,intercepting targets can be achieved through measur-ing system data online.The effectiveness of the CIIG is verified through two missile and target engagement scenarios.
基金supported by the National Natural Science Foundation of China(60874114)
文摘The relationship between the technique by state- dependent Riccati equations (SDRE) and Hamilton-Jacobi-lsaacs (HJI) equations for nonlinear H∞ control design is investigated. By establishing the Lyapunov matrix equations for partial derivates of the solution of the SDREs and introducing symmetry measure for some related matrices, a method is proposed for examining whether the SDRE method admits a global optimal control equiva- lent to that solved by the HJI equation method. Two examples with simulation are given to illustrate the method is effective.