In the air combat process,confrontation position is the critical factor to determine the confrontation situation,attack effect and escape probability of UAVs.Therefore,selecting the optimal confrontation position beco...In the air combat process,confrontation position is the critical factor to determine the confrontation situation,attack effect and escape probability of UAVs.Therefore,selecting the optimal confrontation position becomes the primary goal of maneuver decision-making.By taking the position as the UAV’s maneuver strategy,this paper constructs the optimal confrontation position selecting games(OCPSGs)model.In the OCPSGs model,the payoff function of each UAV is defined by the difference between the comprehensive advantages of both sides,and the strategy space of each UAV at every step is defined by its accessible space determined by the maneuverability.Then we design the limit approximation of mixed strategy Nash equilibrium(LAMSNQ)algorithm,which provides a method to determine the optimal probability distribution of positions in the strategy space.In the simulation phase,we assume the motions on three directions are independent and the strategy space is a cuboid to simplify the model.Several simulations are performed to verify the feasibility,effectiveness and stability of the algorithm.展开更多
To solve the problem that multiple missiles should simultaneously attack unmeasurable maneuvering targets,a guidance law with temporal consistency constraint based on the super-twisting observer is proposed.Firstly,th...To solve the problem that multiple missiles should simultaneously attack unmeasurable maneuvering targets,a guidance law with temporal consistency constraint based on the super-twisting observer is proposed.Firstly,the relative motion equations between multiple missiles and targets are established,and the topological model among multiple agents is considered.Secondly,based on the temporal consistency constraint,a cooperative guidance law for simultaneous arrival with finite-time convergence is derived.Finally,the unknown target maneuver-ing is regarded as bounded interference.Based on the second-order sliding mode theory,a super-twisting sliding mode observer is devised to observe and track the bounded interfer-ence,and the stability of the observer is proved.Compared with the existing research,this approach only needs to obtain the sliding mode variable which simplifies the design process.The simulation results show that the designed cooperative guidance law for maneuvering targets achieves the expected effect.It ensures successful cooperative attacks,even when confronted with strong maneuvering targets.展开更多
Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles(UAVs).In this paper,aiming to address the control proble...Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles(UAVs).In this paper,aiming to address the control problem of maneuvering target tracking and obstacle avoidance,an online path planning approach for UAV is developed based on deep reinforcement learning.Through end-to-end learning powered by neural networks,the proposed approach can achieve the perception of the environment and continuous motion output control.This proposed approach includes:(1)A deep deterministic policy gradient(DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs;(2)An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning;and(3)An algorithm of taskdecomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV’s control model built based on MN-DDPG.The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV’s flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.展开更多
The unpowered high-speed vehicle experiences a significant coupling between the disciplines of aerodynamics and control due to its characteristics of high flight speed and extensive maneuverability within large airspa...The unpowered high-speed vehicle experiences a significant coupling between the disciplines of aerodynamics and control due to its characteristics of high flight speed and extensive maneuverability within large airspace.The conventional aircraft conceptual design process follows a sequential design approach,and there is an artificial separation between the disciplines of aerodynamics and control,neglecting the coupling effects arising from their interaction.As a result,this design process often requires extensive iterations over long periods when applied to high-speed vehicles,and may not be able to effectively achieve the desired design objectives.To enhance the overall performance and design efficiency of high-speed vehicles,this study integrates the concept of Active Control Technology(ACT)from modern aircraft into the philosophy of aerodynamic/control integrated optimization.Two integrated optimization strategies,with differences in coupling granularity,have been developed.Subsequently,these strategies are put into action on a biconical vehicle that operates at Mach 5.The results reveal that the comprehensive performance of the synthesis optimal model derived from the aerodynamic/control integrated optimization strategy is improved by 31.76%and 28.29%respectively compared to the base model under high-speed conditions,demonstrating the feasibility and effectiveness of the method and optimization strategies employed.Moreover,in comparison to the single-stage strategy,the multi-stage strategy takes into deeper consideration the impact of control capacity.As a result,the control performance of the synthesis opti-mal model derived from the multi-stage strategy improves by 13.99%,whereas the single-stage strategy only achieves a 5.79%improvement.This method enables a fruitful interaction between aerodynamic configuration design and control system design,leading to enhanced overall performance and design efficiency.Furthermore,it improves the controllability of high-speed vehicles,mitigating the risk of mission failure resulting from an ineffective control system.展开更多
基金National Key R&D Program of China(Grant No.2021YFA1000402)National Natural Science Foundation of China(Grant No.72071159)to provide fund for conducting experiments。
文摘In the air combat process,confrontation position is the critical factor to determine the confrontation situation,attack effect and escape probability of UAVs.Therefore,selecting the optimal confrontation position becomes the primary goal of maneuver decision-making.By taking the position as the UAV’s maneuver strategy,this paper constructs the optimal confrontation position selecting games(OCPSGs)model.In the OCPSGs model,the payoff function of each UAV is defined by the difference between the comprehensive advantages of both sides,and the strategy space of each UAV at every step is defined by its accessible space determined by the maneuverability.Then we design the limit approximation of mixed strategy Nash equilibrium(LAMSNQ)algorithm,which provides a method to determine the optimal probability distribution of positions in the strategy space.In the simulation phase,we assume the motions on three directions are independent and the strategy space is a cuboid to simplify the model.Several simulations are performed to verify the feasibility,effectiveness and stability of the algorithm.
基金supported by the Funds for the Central Universities。
文摘To solve the problem that multiple missiles should simultaneously attack unmeasurable maneuvering targets,a guidance law with temporal consistency constraint based on the super-twisting observer is proposed.Firstly,the relative motion equations between multiple missiles and targets are established,and the topological model among multiple agents is considered.Secondly,based on the temporal consistency constraint,a cooperative guidance law for simultaneous arrival with finite-time convergence is derived.Finally,the unknown target maneuver-ing is regarded as bounded interference.Based on the second-order sliding mode theory,a super-twisting sliding mode observer is devised to observe and track the bounded interfer-ence,and the stability of the observer is proved.Compared with the existing research,this approach only needs to obtain the sliding mode variable which simplifies the design process.The simulation results show that the designed cooperative guidance law for maneuvering targets achieves the expected effect.It ensures successful cooperative attacks,even when confronted with strong maneuvering targets.
基金The authors would like to acknowledge National Natural Science Foundation of China(Grant No.61573285,No.62003267)Aeronautical Science Foundation of China(Grant No.2017ZC53021)+1 种基金Open Fund of Key Laboratory of Data Link Technology of China Electronics Technology Group Corporation(Grant No.CLDL-20182101)Natural Science Foundation of Shaanxi Province(Grant No.2020JQ-220)to provide fund for conducting experiments.
文摘Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles(UAVs).In this paper,aiming to address the control problem of maneuvering target tracking and obstacle avoidance,an online path planning approach for UAV is developed based on deep reinforcement learning.Through end-to-end learning powered by neural networks,the proposed approach can achieve the perception of the environment and continuous motion output control.This proposed approach includes:(1)A deep deterministic policy gradient(DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs;(2)An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning;and(3)An algorithm of taskdecomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV’s control model built based on MN-DDPG.The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV’s flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.
基金supported by the National Natural Science Foundation of China(Nos.92371201,52192633)the Natural Science Foundation of Shaanxi Province(No.2022JC-03)Chinese Aeronautical Foundation(No.ASFC-20220019070002)。
文摘The unpowered high-speed vehicle experiences a significant coupling between the disciplines of aerodynamics and control due to its characteristics of high flight speed and extensive maneuverability within large airspace.The conventional aircraft conceptual design process follows a sequential design approach,and there is an artificial separation between the disciplines of aerodynamics and control,neglecting the coupling effects arising from their interaction.As a result,this design process often requires extensive iterations over long periods when applied to high-speed vehicles,and may not be able to effectively achieve the desired design objectives.To enhance the overall performance and design efficiency of high-speed vehicles,this study integrates the concept of Active Control Technology(ACT)from modern aircraft into the philosophy of aerodynamic/control integrated optimization.Two integrated optimization strategies,with differences in coupling granularity,have been developed.Subsequently,these strategies are put into action on a biconical vehicle that operates at Mach 5.The results reveal that the comprehensive performance of the synthesis optimal model derived from the aerodynamic/control integrated optimization strategy is improved by 31.76%and 28.29%respectively compared to the base model under high-speed conditions,demonstrating the feasibility and effectiveness of the method and optimization strategies employed.Moreover,in comparison to the single-stage strategy,the multi-stage strategy takes into deeper consideration the impact of control capacity.As a result,the control performance of the synthesis opti-mal model derived from the multi-stage strategy improves by 13.99%,whereas the single-stage strategy only achieves a 5.79%improvement.This method enables a fruitful interaction between aerodynamic configuration design and control system design,leading to enhanced overall performance and design efficiency.Furthermore,it improves the controllability of high-speed vehicles,mitigating the risk of mission failure resulting from an ineffective control system.