Multi-robot path planning based on a deep reinforcement learning DQN algorithm 被引量：35

下载PDF

导出

摘要 The unmanned warehouse dispatching system of the‘goods to people’model uses a structure mainly based on a handling robot,which saves considerable manpower and improves the efficiency of the warehouse picking operation.However,the optimal performance of the scheduling system algorithm has high requirements.This study uses a deep Q-network(DQN)algorithm in a deep reinforcement learning algorithm,which combines the Q-learning algorithm,an empirical playback mechanism,and the volume-based technology of productive neural networks to generate target Q-values to solve the problem of multi-robot path planning.The aim of the Q-learning algorithm in deep reinforcement learning is to address two shortcomings of the robot path-planning problem:slow convergence and excessive randomness.Preceding the start of the algorithmic process,prior knowledge and prior rules are used to improve the DQN algorithm.Simulation results show that the improved DQN algorithm converges faster than the classic deep reinforcement learning algorithm and can more quickly learn the solutions to path-planning problems.This improves the efficiency of multi-robot path planning.

作者 Yang Yang Li Juntao Peng Lingling

机构地区 School of Management School of Information

出处《CAAI Transactions on Intelligence Technology》 2020年第3期177-183,共7页 智能技术学报（英文）

基金 This research has been supported by Yueqi Youth Scholar Funding of China University of Mining and Technology(Beijing) the Major Programme of the National Natural Science Foundation of China(No.71831001).

关键词 ALGORITHM robot SHORTCOMINGS

分类号 TN9 [电子电信—信息与通信工程]

引文网络
相关文献

参考文献11

1胡俊,朱庆保.未知环境下基于有先验知识的滚动Q学习机器人路径规划[J].控制与决策,2010,25(9):1364-1368. 被引量：11
2李维刚,王肖,赵云涛,李梓响.基于栅格法的钢厂无人天车调度系统[J].系统仿真学报,2020,32(4):687-699. 被引量：6
3黄立新,耿以才.基于动态人工势场法移动机器人路径规划研究[J].计算机测量与控制,2017,25(2):164-166. 被引量：10
4魏彤,龙琛.基于改进遗传算法的移动机器人路径规划[J].北京航空航天大学学报,2020,46(4):703-711. 被引量：104
5孙炜,吕云峰,唐宏伟,薛敏.基于一种改进A*算法的移动机器人路径规划[J].湖南大学学报（自然科学版）,2017,44(4):94-101. 被引量：67
6周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251. 被引量：1737
7张丹露,孙小勇,傅顺,郑彬.智能仓库中的多机器人协同路径规划方法[J].计算机集成制造系统,2018,24(2):410-418. 被引量：56
8马磊,张文旭,戴朝华.多机器人系统强化学习研究综述[J].西南交通大学学报,2014,49(6):1032-1044. 被引量：14
9刘艳红,陈田田,张方方.基于改进粒子群算法的移动机器人路径规划[J].郑州大学学报（理学版）,2020,52(1):114-119. 被引量：16
10方敏,李浩.基于状态回溯代价分析的启发式Q学习[J].模式识别与人工智能,2013,26(9):838-844. 被引量：9

二级参考文献168

1Laura RAY.Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning[J].控制理论与应用（英文版）,2011,9(3):440-450. 被引量：2
2胡小兵,黄席樾.基于蚁群算法的三维空间机器人路径规划[J].重庆大学学报（自然科学版）,2004,27(8):132-135. 被引量：22
3张捍东,郑睿,岑豫皖.移动机器人路径规划技术的现状与展望[J].系统仿真学报,2005,17(2):439-443. 被引量：120
4魏英姿 ,赵明扬 .强化学习算法中启发式回报函数的设计及其收敛性分析[J].计算机科学,2005,32(3):190-193. 被引量：13
5朱庆保,张玉兰.基于栅格法的机器人路径规划蚁群算法[J].机器人,2005,27(2):132-136. 被引量：123
6朱庆保.动态复杂环境下的机器人路径规划蚂蚁预测算法[J].计算机学报,2005,28(11):1898-1906. 被引量：50
7宋清昆,胡子婴.基于经验知识的Q-学习算法[J].自动化技术与应用,2006,25(11):10-12. 被引量：7
8王景存,张晓彤,陈彬,陈和平.一种基于Dijkstra算法的启发式最优路径搜索算法[J].北京科技大学学报,2007,29(3):346-350. 被引量：27
9Ahuh D J, Park J H. Path planning and navigation for autonomous mobile robot[C]. IEEE 28th the Annual Conf of the Industrial Electronics Society. Seville: IEEE Press, 2002: 1538-1542.
10Cabin I, Land S. Adaptation of the A* algorithm for the computation of fastest paths in deterministic discrete- time dynamic networks[J]. IEEE Trans on Intelligent Transportation Systems, 2002, 3(1): 60-74.

共引文献2103

1陆文超,崔海朋.一种基于融合自编码与神经网络的协同过滤算法[J].中国水运（下半月）,2022,22(3):18-20.
2杜佳峰,王景松,杨宝军,薛勇新,郑春华.基于卷积神经网络的船舶水尺字符识别方法研究[J].中国水运（下半月）,2020(3):1-3. 被引量：1
3陆天和,刘莉,贺云涛,杨盾.多无人机航迹规划算法及关键技术[J].战术导弹技术,2020(1):85-90. 被引量：7
4徐雪松,曾智,邵红燕,杨胜杰,李想.基于个体-协同触发强化学习的多机器人行为决策方法[J].仪器仪表学报,2020(5):66-75. 被引量：11
5林桢哲,王桂棠,陈建强,符秦沈.基于残差网络深度学习的肺部CT图像结节良恶性分类模型[J].仪器仪表学报,2020,41(3):248-256. 被引量：22
6陈仁祥,张勇,杨黎霞,陈才,徐向阳.基于整周期数据和卷积神经网络的谐波减速器健康状态评估[J].仪器仪表学报,2020,41(2):245-252. 被引量：20
7鲍光海,林善银,徐林森.基于改进型卷积网络的汽车高度调节器缺陷检测方法[J].仪器仪表学报,2020,41(2):157-165. 被引量：13
8谭宇辰,蔡晶晶,倪辰.基于深度学习的Web攻击检测技术研究[J].信息网络安全,2020(S02):122-126.
9任杰,李钢,赵燕姣,姚琼辛,田培辰.基于改进Faster RCNN的城市道路货车检测[J].计算机系统应用,2022,31(12):316-321. 被引量：3
10胡伟,文武,魏敏.改进U-Net的高分辨率遥感图像轻量化分割[J].计算机系统应用,2022,31(12):135-146. 被引量：2

同被引文献304

1马随阳,余永周,吕英豪.动态规划算法对航道岸线中无人机测绘路径的优化[J].中国水运（下半月）,2022,22(10):76-78. 被引量：1
2王苏彧,张铃炜,齐佳丽,盖禹成.自适应导向蚁群算法优化移动机器人路径规划[J].计算机应用研究,2020,37(S01):116-117. 被引量：9
3李辉,祁宇明.一种复杂环境下基于深度强化学习的机器人路径规划方法[J].计算机应用研究,2020,37(S01):129-131. 被引量：13
4周瑶瑶,李烨.基于排序优先经验回放的竞争深度Q网络学习[J].计算机应用研究,2020,37(2):486-488. 被引量：7
5董一群,艾剑良.自主空战技术中的机动决策:进展与展望[J].航空学报,2020(S02):4-12. 被引量：12
6刘洋,李建军.深度确定性策略梯度算法优化[J].辽宁工程技术大学学报（自然科学版）,2020(6):545-549. 被引量：2
7DUAN HaiBin & LIU SenQi National Key Laboratory of Science and Technology on Holistic Flight Control,School of Automation Science and Electrical Engineering,Beijing University of Aeronautics and Astronautics,Beijing 100191,China.Unmanned air/ground vehicles heterogeneous cooperative techniques:Current status and prospects[J].Science China(Technological Sciences),2010,53(5):1349-1355. 被引量：18
8韩松臣,秦俊奇,韩品尧,邵成勋.马尔可夫决策过程在目标分配中的应用[J].哈尔滨工业大学学报,1996,28(2):32-36. 被引量：12
9孙斌,韩大鹏,韦庆.基于滚动窗口算法的机器人路径规划应用研究[J].计算机仿真,2006,23(6):159-162. 被引量：9
10陈英武,蔡怀平,邢立宁.动态武器目标分配问题中策略优化的改进算法[J].系统工程理论与实践,2007,27(7):160-165. 被引量：14

引证文献35

1Tianyun Qiu,Yaxuan Cheng.Applications and Challenges of Deep Reinforcement Learning in Multi-robot Path Planning[J].Journal of Electronic Research and Application,2021,5(6):25-29. 被引量：1
2赵国庆,徐君明,刘爱东.降低方差的深度确定性策略梯度算法[J].兵工自动化,2022,41(6):41-46. 被引量：2
3王涛,黎玉康,刘文学.无人车辆路径规划算法发展现状[J].舰船电子工程,2022,42(5):15-22. 被引量：2
4曾斌,张鸿强,李厚朴.针对无人潜航器的反潜策略研究[J].系统工程与电子技术,2022,44(10):3174-3181. 被引量：2
5Zheng Zhang,Juan Chen,Qing Guo.Application of Automated Guided Vehicles in Smart Automated Warehouse Systems:A Survey[J].Computer Modeling in Engineering & Sciences,2023(3):1529-1563. 被引量：5
6禹鑫燚,杜丹枫,欧林林.不确定环境下的深度强化学习编队避障控制[J].高技术通讯,2022,32(8):836-844. 被引量：2
7李生,李明宇,方舟,黄俊杰,王桢,易朋兴.基于改进LPA*算法的虚拟人路径规划[J].装备制造技术,2022(8):82-84. 被引量：2
8高万博,朱俊武,章永龙,章小卫.基于选择交叉烟花算法的无人车路径规划[J].计算机工程,2022,48(11):314-320. 被引量：2
9张鑫菠,李乐,冀海军,彭星光.基于Q学习的水下滑翔机路径规划方法[J].计算机测量与控制,2022,30(11):192-198.
10Zhang Xin,Lou Haoran,Jiang Li,Xiao Qianhao,Cai Zhuwen.Vehicle-following system based on deep reinforcement learning in marine scene[J].The Journal of China Universities of Posts and Telecommunications,2022,29(5):10-20.

二级引证文献53

1阚保强.防疫巡逻机器人系统[J].齐齐哈尔大学学报（自然科学版）,2023,39(4):6-10.
2任志伟,胡平,闫方,曲富柱.基于蚁群-改进人工势场法的移动机器人路径规划[J].河南科技学院学报（自然科学版）,2023,51(4):52-63. 被引量：3
3张铁监.大数据与深度学习技术在驾驶员培训中研究与应用[J].长江信息通信,2023,36(7):135-137.
4黄振华,肖银宝.移动机器人障碍感知的应用研究分析[J].科技风,2023(27):7-9. 被引量：1
5倪建云,李浩,谷海青,杜合磊,吴杰,薛晨阳.基于改进VSRB-RRT算法的机器人路径规划仿真实验[J].实验技术与管理,2023,40(9):172-178. 被引量：1
6梁晓龙,王宁,王维佳,可唯一.海上跨域无人集群研究进展综述[J].空军工程大学学报,2023,24(5):2-15. 被引量：3
7郭政堃,姚志广,王颜辉,韩振华.室内自主移动机器人的设计与实现[J].移动信息,2023,45(9):212-214. 被引量：1
8高甲博,肖玮,何智杰.P3C-MADDPG算法的多无人机协同追捕对抗策略研究[J].指挥控制与仿真,2023,45(6):7-18.
9李国飞,汤清璞,吴云洁.从飞行器无导引头的主-从式多飞行器协同制导方法[J].兵工学报,2023,44(11):3436-3446.
10Wenbing Zhao,Chenxi Huang,Yizhang Jiang.Introduction to the Special Issue on ComputerModeling for Smart Cities Applications[J].Computer Modeling in Engineering & Sciences,2024,138(2):1015-1017.

1徐坤,方阳,宫梦,陈应泉,陈旭,王贤华,杨海平,陈汉平.葡萄糖催化热解制备左旋葡萄糖酮特性研究[J].化工学报,2020,71(8):3594-3601. 被引量：2
2Yongtao ZHANG,Xuyuan ZHOU,Peng LENG,Zhaoshuai GU,Deqiang CAO.Current Development Status of Cucumber Industry in Linyi City and Countermeasures for Improving Quality and Benefits[J].Asian Agricultural Research,2020,12(5):21-24.
3Zheng Guichu.Working Together[J].Beijing Review,2020,63(28):22-25.
4Ke Wang,Jiping Zhou.Kinematical Analysis and Simulation of High-Speed Plate Carrying Manipulator Based on Matlab[J].Engineering（科研）,2012,4(12):850-856. 被引量：6
5WORLD[J].Beijing Review,2020,63(33):8-9.
6Jiechao Ma,Yang Song,Xi Tian,Yiting Hua,Rongguo Zhang,Jianlin Wu.Survey on deep learning for pulmonary medical imaging[J].Frontiers of Medicine,2020,14(4):450-469. 被引量：7
7格桑.权衡码头自动化的利弊[J].海运情报,2020(7):28-29.
8Yun-Peng Xiao,Yu-Kun Lai,Fang-Lue Zhang,Chunpeng Li,Lin Gao.A survey on deep geometry learning:From a representation perspective[J].Computational Visual Media,2020,6(2):113-133. 被引量：14
9Tian-bao DU,Guo-hua SHEN,Zhi-qiu HUANG,Yao-shen YU,De-xiang WU.Automatic traceability link recovery via active learning[J].Frontiers of Information Technology & Electronic Engineering,2020,21(8):1217-1225. 被引量：3
10Ying ZHOU,Lingling WANG,Lieyun DING,Zhouping TANG.Intelligent technologies help operating mobile cabin hospitals effectively cope with COVID-19[J].Frontiers of Engineering Management,2020,7(3):459-460. 被引量：3

CAAI Transactions on Intelligence Technology

2020年第3期

浏览历史

内容加载中请稍等...