期刊文献+

UAV maneuvering decision-making algorithm based on deep reinforcement learning under the guidance of expert experience

下载PDF
导出
摘要 Autonomous umanned aerial vehicle(UAV) manipulation is necessary for the defense department to execute tactical missions given by commanders in the future unmanned battlefield. A large amount of research has been devoted to improving the autonomous decision-making ability of UAV in an interactive environment, where finding the optimal maneuvering decisionmaking policy became one of the key issues for enabling the intelligence of UAV. In this paper, we propose a maneuvering decision-making algorithm for autonomous air-delivery based on deep reinforcement learning under the guidance of expert experience. Specifically, we refine the guidance towards area and guidance towards specific point tasks for the air-delivery process based on the traditional air-to-surface fire control methods.Moreover, we construct the UAV maneuvering decision-making model based on Markov decision processes(MDPs). Specifically, we present a reward shaping method for the guidance towards area and guidance towards specific point tasks using potential-based function and expert-guided advice. The proposed algorithm could accelerate the convergence of the maneuvering decision-making policy and increase the stability of the policy in terms of the output during the later stage of training process. The effectiveness of the proposed maneuvering decision-making policy is illustrated by the curves of training parameters and extensive experimental results for testing the trained policy.
出处 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第3期644-665,共22页 系统工程与电子技术(英文版)
基金 supported by the Key Research and Development Program of Shaanxi (2022GXLH-02-09) the Aeronautical Science Foundation of China (20200051053001) the Natural Science Basic Research Program of Shaanxi (2020JM-147)。
  • 相关文献

参考文献4

二级参考文献50

  • 1李建勋,佟明安,金德琨.协商微分对策理论及其在多机空战分析中的应用[J].系统工程理论与实践,1997,17(6):68-72. 被引量:14
  • 2J. Moore. UAV Fire-fighting System, U.S. Patent 20130134254 A1, May 2013.
  • 3M. B. Wang, A. Chu, L. A. Bush, B. C. Williams. Active detection of drivable surfaces in support of robotic disas- ter relief missions. In Proceedings of Aerospace Conference, IEEE, Big Sky, MT, USA, pp. 1-13, 2013.
  • 4D. Erdos, A. Erdos, S. E. Vatkins. An experimental UAV system for search and rescue challenge. IEEE Aerospace and Electronic Systems Magazine, vol. 28, no. 5, pp. 32-37, 2013.
  • 5S. Q. Zhu, D. W. Wang, C. B. Low. Ground target tracking using UAV with input constraints. Journal of Intelligent g Robotic Systems, vol. 69, no. 1-4, pp. 417-429, 2013.
  • 6Y. G. Fu, M. Y. Ding, C. P. Zhou. Phase angle-encoded and quantum-behaved particle swarm optimization applied to three-dimensional route planning for UAV. IEEE Transac- tions on Systems, Man, and Cybernetics - Part A: Systems and Humans, vol. 42, no. 2, pp. 511-526, 2012.
  • 7O. Hachour. Path planning of autonomous mobile robot. International Journal of Systems Applications, Engineering & Development, vol. 2, no. 4, pp. 178 190, 2008.
  • 8T. Stoyanov, M. Magnusson, H. Andreasson, A. J. Lilien- thal. Path planning in 3D environments using the normal distributions transform. In Proceedings of 2010 IEEE/RSJ InternationM Conference on Intelligent Robots and Sys- tems, IEEE, Taipei, Taiwan, China, pp. 3263 3268, 2010.
  • 9Z. Qi, Z. H. Shao, Y. S. Ping, L. M. Hiot, Y. K. Leong. An improved heuristic algorithm for UAV path planning in 3D environment. In Proceedings of the 2nd International Conference on Intelligent Human-machine System and Cy- bernetics, IEEE, Nanjing, China, pp. 258-261, 2010.
  • 10Z. N. Dong, Z. J. Chen, R. Zhou, R. L. Zhang. A hybrid approach of virtual force and A* search algorithm for UAV path re-planning. In Proceedings of the 6th IEEE Inter- national Conference on Industrial Electronics and Applica- tions, IEEE, Beijing, China, pp. 1140-1145, 2011.

共引文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部