C-V2X环境下基于队友模型的多智能体通信切换优化

Optimization of Multi-Agent Handover Based on Team Model in C-V2X Environments

下载PDF

导出

摘要蜂窝车联网(cellular vehicle-to-everything,C-V2X)通信技术是未来智能交通系统(intelligent transportation systems,ITS)的重要组成部分.毫米波(millimeter wave,mmWave)作为C-V2X通信技术的主要载体之一,可以为用户提供高带宽.然而,由于传播距离有限和对遮挡的敏感性,毫米波基站必须密集部署以维持可靠的通信,这使得智能联网车辆(intelligent connected vehicle,ICV)在行驶过程中不得不频繁地进行通信切换,极易造成局部资源短缺,进而导致服务质量低下和用户体验不佳.为了应对这些挑战,每辆ICV被视为一个智能体,并将ICV的通信切换问题建模为一个合作型多智能体博弈问题.为了解决这一问题,提出了一个基于队友模型的合作型强化学习框架.具体来说,首先设计了一个队友模型,用于量化复杂动态环境下智能体之间的相互依赖关系;接着提出了一种动态权重分配方案,生成了队友间的加权互信息,用于混合网络的输入,旨在帮助队友切换到可以提供良好QoS和QoE的基站,以获得高吞吐量和低通信切换频率.在算法训练过程中,设计了一种激励相容训练算法,旨在协调智能体的个体目标与集体目标的一致性,提升通信吞吐量.实验结果显示,提出的方法在不同规模车辆的场景中均展示出了卓越的性能,相较于现有的基于通信基准方法有13.8%~38.2%的吞吐量提升. Cellular vehicle-to-everything(C-V2X)communication technology is a crucial component of future intelligent transportation systems(ITS).Millimeter wave(mmWave),as one of the primary carriers for C-V2X technology,offers high bandwidth to users.However,due to limited propagation distance and sensitivity to obstructions,mmWave base stations must be densely deployed to maintain reliable communication.This requirement causes intelligent connected vehicle(ICV)to frequently switch communications during travel,easily leading to local resource shortages,thus degrading service quality and user experience.To address these challenges,we treat each ICV as an agent and model the ICV communication switching issue as a cooperative multi-agent game problem.To solve this problem,we propose a cooperative reinforcement learning framework based on a teammate model.Specifically,we design a teammate model to quantify the interdependencies among agents in complex dynamic environments.Furthermore,we propose a dynamic weight allocation scheme that generates weighted mutual information among teammates for the input of the mixing network,aiming to assist teammates in switching to base stations that provide satisfactory QoS and QoE,thereby achieving high throughput and low communication switching frequency.During the algorithm training process,we design an incentive-compatible training algorithm aimed at aligning the individual goals of the agents with collective goals,enhancing communication throughput.Experimental results demonstrate that this algorithm achieves a 13.8%to 38.2%increase in throughput compared with existing communication benchmark algorithms.

作者刘冰艺王东东施海勇王恩澍吴黎兵汪建平 Liu Bingyi;Wang Dongdong;Shi Haiyong;Wang Enshu;Wu Libing;Wang Jianping(School of Computer Science and Artificial Intelligence,Wuhan University of Technology,Wuhan 430070;Sanya Science and Education Innovation Park,Wuhan University of Technology,Sanya,Hainan 572000;School of Cyber Science and Engineering,Wuhan University,Wuhan 430070;Department of Computer Science,City University of Hong Kong,Kowloon,Hong Kong,China)

机构地区武汉理工大学计算机与人工智能学院武汉理工大学三亚科教创新园武汉大学国家网络安全学院香港城市大学计算机科学系

出处《计算机研究与发展》 EI CSCD 北大核心 2024年第11期3806-3820,共15页 Journal of Computer Research and Development

基金国家自然科学基金项目(62272357,62302326,62202348,U20A20177) 湖北省重点研发计划基金项目(2022BAA052) 香港研究资助局NSFC/RGC项目(N_CityU140/20)。

关键词蜂窝车联网资源分配通信切换多智能体强化学习合作多智能体强化学习 C-V2X resource allocation handover multi-agent reinforcement learning cooperative multi-agent reinforcement learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1乌兰,刘全,黄志刚,朱斐,张立华.优势加权互信息最大化的最大熵分层强化学习[J].计算机学报,2023,46(10):2066-2083. 被引量：1
2唐娜.卫星物联网中的资源调度与网络优化技术[J].电子元器件与信息技术,2024,8(6):147-149.
3禹鑫燚,刘飞,欧林林.基于图神经网络的多智能体路径规划方法[J].高技术通讯,2024,34(10):1081-1090.
4李超,李文斌,高阳.图多智能体任务建模视角下的协作子任务行为发现[J].计算机研究与发展,2024,61(8):1904-1916.
5杜威,丁世飞,郭丽丽,张健,丁玲.基于价值函数分解和通信学习机制的异构多智能体强化学习方法[J].计算机学报,2024,47(6):1304-1322.
6杨超群,徐梦蝶,梁潇洧,朱鑫潮,张恒,曹向辉.基于随机有限集滤波器的可分辨群目标跟踪技术研究综述[J].信号处理,2024,40(10):1763-1772.
7Dongxiao Yu.Editorial[J].Intelligent and Converged Networks,2024,5(3):I0001-I0002.
8胡梦杰,庄元,郑元洲,钱龙,刘欣宇,张远锋.基于改进动态窗口算法的船舶局部路径规划[J].武汉理工大学学报（交通科学与工程版）,2024,48(5):984-990.
9马云舰,鲁华伟,成晨,刘媛妮.基于大语言模型的小样本网络入侵检测方法[J].信息通信技术,2024,18(3):50-56.
10Arya Kharche,Sanskar Badholia,Ram Krishna Upadhyay.Implementation of blockchain technology in integrated IoT networks for constructing scalable ITS systems in India[J].Blockchain(Research and Applications),2024,5(2):99-125.

计算机研究与发展

2024年第11期

浏览历史

内容加载中请稍等...

C-V2X环境下基于队友模型的多智能体通信切换优化

相关作者

相关机构

相关主题

浏览历史