期刊文献+
共找到78,191篇文章
< 1 2 250 >
每页显示 20 50 100
Event-Triggered Asymmetric Bipartite Consensus Tracking for Nonlinear Multi-Agent Systems Based on Model-Free Adaptive Control
1
作者 Jiaqi Liang Xuhui Bu +1 位作者 Lizhi Cui Zhongsheng Hou 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第3期662-672,共11页
In this paper,an asymmetric bipartite consensus problem for the nonlinear multi-agent systems with cooperative and antagonistic interactions is studied under the event-triggered mechanism.For the agents described by a... In this paper,an asymmetric bipartite consensus problem for the nonlinear multi-agent systems with cooperative and antagonistic interactions is studied under the event-triggered mechanism.For the agents described by a structurally balanced signed digraph,the asymmetric bipartite consensus objective is firstly defined,assigning the agents'output to different signs and module values.Considering with the completely unknown dynamics of the agents,a novel event-triggered model-free adaptive bipartite control protocol is designed based on the agents'triggered outputs and an equivalent compact form data model.By utilizing the Lyapunov analysis method,the threshold of the triggering condition is obtained.Subsequently,the asymptotic convergence of the tracking error is deduced and a sufficient condition is obtained based on the contraction mapping principle.Finally,the simulation example further demonstrates the effectiveness of the protocol. 展开更多
关键词 Asymmetric bipartite consensus tracking eventtriggered model-free adaptive control(MFAC) nonlinear systems signed digraph
下载PDF
Human-in-the-Loop Consensus Control for Nonlinear Multi-Agent Systems With Actuator Faults 被引量:7
2
作者 Guohuai Lin Hongyi Li +2 位作者 Hui Ma Deyin Yao Renquan Lu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第1期111-122,共12页
This paper considers the human-in-the-loop leader-following consensus control problem of multi-agent systems(MASs)with unknown matched nonlinear functions and actuator faults.It is assumed that a human operator contro... This paper considers the human-in-the-loop leader-following consensus control problem of multi-agent systems(MASs)with unknown matched nonlinear functions and actuator faults.It is assumed that a human operator controls the MASs via sending the command signal to a non-autonomous leader which generates the desired trajectory.Moreover,the leader’s input is nonzero and not available to all followers.By using neural networks and fault estimators to approximate unknown nonlinear dynamics and identify the actuator faults,respectively,the neighborhood observer-based neural fault-tolerant controller with dynamic coupling gains is designed.It is proved that the state of each follower can synchronize with the leader’s state under a directed graph and all signals in the closed-loop system are guaranteed to be cooperatively uniformly ultimately bounded.Finally,simulation results are presented for verifying the effectiveness of the proposed control method. 展开更多
关键词 Actuator faults distributed control human-in-the-loop neighborhood observer nonlinear multi-agent systems(MASs)
下载PDF
Adaptive Memory Event-Triggered Observer-Based Control for Nonlinear Multi-Agent Systems Under DoS Attacks 被引量:7
3
作者 Xianggui Guo Dongyu Zhang +1 位作者 Jianliang Wang Choon Ki Ahn 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第10期1644-1656,共13页
This paper investigates the event-triggered security consensus problem for nonlinear multi-agent systems(MASs)under denial-of-service(Do S)attacks over an undirected graph.A novel adaptive memory observer-based anti-d... This paper investigates the event-triggered security consensus problem for nonlinear multi-agent systems(MASs)under denial-of-service(Do S)attacks over an undirected graph.A novel adaptive memory observer-based anti-disturbance control scheme is presented to improve the observer accuracy by adding a buffer for the system output measurements.Meanwhile,this control scheme can also provide more reasonable control signals when Do S attacks occur.To save network resources,an adaptive memory event-triggered mechanism(AMETM)is also proposed and Zeno behavior is excluded.It is worth mentioning that the AMETM's updates do not require global information.Then,the observer and controller gains are obtained by using the linear matrix inequality(LMI)technique.Finally,simulation examples show the effectiveness of the proposed control scheme. 展开更多
关键词 Adaptive memory event-triggered mechanism(AMETM) compensation mechanism denial-of-service(DoS)attacks nonlinear multi-agent systems(MASs) observer-based anti-disturbance control
下载PDF
Consensus of second-order nonlinear multi-agent systems via sliding mode observer and controller 被引量:1
4
作者 Xiaolei Li Xiaoyuan Luo +2 位作者 Shaobao Li Jianjin Li Xinping Guan 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2017年第4期756-765,共10页
This paper investigates the consensus problem of second-order nonlinear multi-agent systems (MASs) via the sliding mode control (SMC) approach. The velocity of each agent is assumed to be unmeasurable. A second-order ... This paper investigates the consensus problem of second-order nonlinear multi-agent systems (MASs) via the sliding mode control (SMC) approach. The velocity of each agent is assumed to be unmeasurable. A second-order sliding mode observer is designed to estimate the velocity. Then a distributed discontinuous control law based on first-order SMC is presented to solve the consensus problem. Moreover, to overcome the chatting problem, two controllers based on the boundary layer method and the super-twisting algorithm respectively are presented. It is shown that the MASs will achieve consensus under some given conditions. Some examples are provided to demonstrate the effectiveness of the proposed control laws. 展开更多
关键词 nonlinear multi-agent system sliding mode observer CONSENSUS sliding mode controller
下载PDF
Adaptive Consensus Quantized Control for a Class of High-Order Nonlinear Multi-Agent Systems With Input Hysteresis and Full State Constraints 被引量:2
5
作者 Guoqiang Zhu Haoqi Li +3 位作者 Xiuyu Zhang Chenliang Wang Chun-Yi Su Jiangping Hu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第9期1574-1589,共16页
For a class of high-order nonlinear multi-agent systems with input hysteresis,an adaptive consensus output-feedback quantized control scheme with full state constraints is investigated.The major properties of the prop... For a class of high-order nonlinear multi-agent systems with input hysteresis,an adaptive consensus output-feedback quantized control scheme with full state constraints is investigated.The major properties of the proposed control scheme are:1)According to the different hysteresis input characteristics of each agent in the multi-agent system,a hysteresis quantization inverse compensator is designed to eliminate the influence of hysteresis characteristics on the system while ensuring that the quantized signal maintains the desired value.2)A barrier Lyapunov function is introduced for the first time in the hysteretic multi-agent system.By constructing state constraint control strategy for the hysteretic multi-agent system,it ensures that all the states of the system are always maintained within a predetermined range.3)The designed adaptive consensus output-feedback quantization control scheme allows the hysteretic system to have unknown parameters and unknown disturbance,and ensures that the input signal transmitted between agents is the quantization value,and the introduced quantizer is implemented under the condition that only its sector bound property is required.The stability analysis has proved that all signals of the closed-loop are semi-globally uniformly bounded.The Star Sim hardware-in-the-loop simulation certificates the effectiveness of the proposed adaptive quantized control scheme. 展开更多
关键词 Adaptive quantized control barrier Lyapunov function input hysteresis multi-agent systems
下载PDF
Distributed Fault-Tolerant Containment Control for Nonlinear Multi-Agent Systems Under Directed Network Topology via Hierarchical Approach 被引量:3
6
作者 Shuyi Xiao Jiuxiang Dong 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第4期806-816,共11页
This paper investigates the distributed fault-tolerant containment control(FTCC)problem of nonlinear multi-agent systems(MASs)under a directed network topology.The proposed control framework which is independent on th... This paper investigates the distributed fault-tolerant containment control(FTCC)problem of nonlinear multi-agent systems(MASs)under a directed network topology.The proposed control framework which is independent on the global information about the communication topology consists of two layers.Different from most existing distributed fault-tolerant control(FTC)protocols where the fault in one agent may propagate over network,the developed control method can eliminate the phenomenon of fault propagation.Based on the hierarchical control strategy,the FTCC problem with a directed graph can be simplified to the distributed containment control of the upper layer and the fault-tolerant tracking control of the lower layer.Finally,simulation results are given to demonstrate the effectiveness of the proposed control protocol. 展开更多
关键词 Adaptive fault-tolerant control directed network topology distributed control hierarchical control multi-agent systems(MASs)
下载PDF
Adaptive Third-Order Leader-Following Consensus of Nonlinear Multi-agent Systems with Perturbations 被引量:1
7
作者 SUN Mei CHEN Ying +1 位作者 CAO Long WANG Xiao-Fang 《Chinese Physics Letters》 SCIE CAS CSCD 2012年第2期37-40,共4页
We investigate the third-order leader-following consensus problem of nonlinear multi-agent systems in undirected network topologies.Based on graph theory and Lyapunov stability theory,the adaptive control method is em... We investigate the third-order leader-following consensus problem of nonlinear multi-agent systems in undirected network topologies.Based on graph theory and Lyapunov stability theory,the adaptive control method is employed to achieve leader-following consensus in an undirected network of agents with nonlinear third-order dynamics against the perturbations.Simulation examples validate the correctness of the results and show that the control gains have a great influence on the convergence performance of errors for a short time. 展开更多
关键词 undirected nonlinear nonlinear
原文传递
Adaptive Containment Control for Fractional-Order Nonlinear Multi-Agent Systems With Time-Varying Parameters 被引量:1
8
作者 Yang Liu Huaguang Zhang +1 位作者 Yingchun Wang Hongjing Liang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第9期1627-1638,共12页
This paper investigates adaptive containment control for a class of fractional-order multi-agent systems(FOMASs)with time-varying parameters and disturbances.By using the bounded estimation method,the difficulty gener... This paper investigates adaptive containment control for a class of fractional-order multi-agent systems(FOMASs)with time-varying parameters and disturbances.By using the bounded estimation method,the difficulty generated by the timevarying parameters and disturbances is overcome.The command filter is introduced to solve the complexity problem inherent in adaptive backstepping control.Meanwhile,in order to eliminate the effect of filter errors,a novel distributed error compensating scheme is constructed,in which only the local information from the neighbor agents is utilized.Then,a distributed adaptive containment control scheme for FOMASs is developed based on backstepping to guarantee that the outputs of all the followers are steered to the convex hull spanned by the leaders.Based on the extension of Barbalat's lemma to fractional-order integrals,it can be proven that the containment errors and the compensating signals have asymptotic convergence.Finally,three simulation examples are given to show the feasibility and effectiveness of the proposed control method. 展开更多
关键词 Adaptive backstepping control command filter fractional-order multi-agent system time-varying parameters
下载PDF
Neural Network Based Adaptive Tracking of Nonlinear Multi-Agent System
9
作者 Bo-Xian Lin Wei-Hao Li +1 位作者 Kai-Yu Qin Xi Chen 《Journal of Electronic Science and Technology》 CAS CSCD 2021年第2期144-154,共11页
In this paper,the problems of robust consensus tracking control for the second-order multi-agent system with uncertain model parameters and nonlinear disturbances are considered.An adaptive control strategy is propose... In this paper,the problems of robust consensus tracking control for the second-order multi-agent system with uncertain model parameters and nonlinear disturbances are considered.An adaptive control strategy is proposed to smooth the agent’s trajectory,and the neural network is constructed to estimate the system’s unknown components.The consensus conditions are demonstrated for tracking a leader with nonlinear dynamics under an adaptive control algorithm in the absence of model uncertainties.Then,the results are extended to the system with unknown time-varying disturbances by applying the neural network estimation to compensating for the uncertain parts of the agents’models.Update laws are designed based on the Lyapunov function terms to ensure the effectiveness of robust control.Finally,the theoretical results are verified by numerical simulations,and a comparative experiment is conducted,showing that the trajectories generated by the proposed method exhibit less oscillation and converge faster. 展开更多
关键词 Coordinated tracking leader following consensus neural network based adaptive control robust control uncertain nonlinear system
下载PDF
Neural-network-based fully distributed formation control for nonlinear multi-agent systems with event-triggered communication 被引量:1
10
作者 ZHU GuoLiang LIU KeXin +1 位作者 GU HaiBo LÜJinHu 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2024年第1期209-220,共12页
This paper investigates the consensus-based formation control problem for multi-agent systems with unknown nonlinear dynamics.To achieve the desired formation,we propose two formation controllers to achieve the desire... This paper investigates the consensus-based formation control problem for multi-agent systems with unknown nonlinear dynamics.To achieve the desired formation,we propose two formation controllers to achieve the desired formation,one based on system states and the other on system outputs.The proposed controllers utilize adaptive gains to avoid global information and neural networks to estimate and compensate for nonlinearities.The proposed event-triggered schemes avoid continuous communication among agents and exclude the Zeno behavior.Stability analysis reveals that formation errors are bounded,and numerical simulations are used to validate the effectiveness of the proposed approaches. 展开更多
关键词 formation control neural network adaptive control event-triggered communication multi-agent systems
原文传递
Distributed Cooperative Anti-Disturbance Control for High-Order MIMO Nonlinear Multi-Agent Systems
11
作者 JIN Feiyu CHEN Longsheng +1 位作者 LI Tongshuai SHI Tongxin 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第4期656-666,共11页
To solve the synchronization and tracking problems,a cooperative control scheme is proposed for a class of higher-order multi-input and multi-output(MIMO)nonlinear multi-agent systems(MASs)subjected to uncertainties a... To solve the synchronization and tracking problems,a cooperative control scheme is proposed for a class of higher-order multi-input and multi-output(MIMO)nonlinear multi-agent systems(MASs)subjected to uncertainties and external disturbances.First,coupled relationships among Laplace matrix,leader-following adjacency matrix and consensus error are analyzed based on undirected graph.Furthermore,nonlinear disturbance observers(NDOs)are designed to estimate compounded disturbances in MASs,and a distributed cooperative anti-disturbance control protocol is proposed for high-order MIMO nonlinear MASs based on the outputs of NDOs and dynamic surface control approach.Finally,the feasibility and effectiveness of the proposed scheme are proven based on Lyapunov stability theory and simulation experiments. 展开更多
关键词 nonlinear disturbance observer(NDO) higher-order multi-input and multi-output(MIMO)system multi-agent system cooperative control disturbance suppression
原文传递
Event-Triggered Fixed-Time Consensus of Second-Order Nonlinear Multi-Agent Systems with Delay and Switching Topologies
12
作者 XING Youjing GAO Jinfeng +1 位作者 LIU Xiaoping WU Ping 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第4期625-639,共15页
To address fixed-time consensus problems of a class of leader-follower second-order nonlinear multi-agent systems with uncertain external disturbances,the event-triggered fixed-time consensus protocol is proposed.Firs... To address fixed-time consensus problems of a class of leader-follower second-order nonlinear multi-agent systems with uncertain external disturbances,the event-triggered fixed-time consensus protocol is proposed.First,the virtual velocity is designed based on the backstepping control method to achieve the system consensus and the bound on convergence time only depending on the system parameters.Second,an event-triggered mechanism is presented to solve the problem of frequent communication between agents,and triggered condition based on state information is given for each follower.It is available to save communication resources,and the Zeno behaviors are excluded.Then,the delay and switching topologies of the system are also discussed.Next,the system stabilization is analyzed by Lyapunov stability theory.Finally,simulation results demonstrate the validity of the presented method. 展开更多
关键词 event-triggered mechanism fixed-time consensus multi-agent systems switching topologies
原文传递
基于Multi-Agent的无人机集群体系自主作战系统设计
13
作者 张堃 华帅 +1 位作者 袁斌林 杜睿怡 《系统工程与电子技术》 EI CSCD 北大核心 2024年第4期1273-1286,共14页
针对无人集群自主作战体系设计中的关键问题,提出基于Multi-Agent的无人集群自主作战系统设计方法。建立无人集群各节点的Agent模型及其推演规则;对于仿真系统模块化和通用化的需求,设计系统互操作式接口和无人集群自主作战的交互关系;... 针对无人集群自主作战体系设计中的关键问题,提出基于Multi-Agent的无人集群自主作战系统设计方法。建立无人集群各节点的Agent模型及其推演规则;对于仿真系统模块化和通用化的需求,设计系统互操作式接口和无人集群自主作战的交互关系;开展无人集群系统仿真推演验证。仿真结果表明,所提设计方案不仅能够有效开展并完成自主作战网络生成-集群演化-效能评估的全过程动态演示验证,而且能够通过重复随机试验进一步评估无人集群的协同作战效能,最后总结了集群协同作战的策略和经验。 展开更多
关键词 multi-agent 无人集群 体系设计 协同作战
下载PDF
基于Multi-Agent的水电站变压器故障诊断系统
14
作者 乔丹 马鹏 王琦 《自动化技术与应用》 2024年第7期58-61,65,共5页
为了精准、快速完成水电站变压器的故障诊断,设计基于Multi-Agent的水电站变压器故障诊断系统。变压器状态监控agent将检测到的变压器故障信息发送给系统管理agent,系统管理agent通过通信agent将变压器故障信息发送给变压器故障诊断age... 为了精准、快速完成水电站变压器的故障诊断,设计基于Multi-Agent的水电站变压器故障诊断系统。变压器状态监控agent将检测到的变压器故障信息发送给系统管理agent,系统管理agent通过通信agent将变压器故障信息发送给变压器故障诊断agent,变压器故障诊断agent利用小波变换方法提取变压器故障特征,并将其作为IFOA-SVM模型输入,完成变压器故障分类后,获取变压器故障诊断结果,该结果通过通信agent显示给用户。实验表明,该系统可有效诊断变压器故障诊断,诊断成功率受系统故障信息丢失率的影响较小,诊断耗时、耗能小,并具有较高故障诊断成功率。 展开更多
关键词 multi-agent 水电站 变压器 故障诊断 小波变换
下载PDF
Dynamic event-triggered leader-follower consensus of nonlinear multi-agent systems under directed weighted topology
15
作者 Wu Yue Chen Xiangyong +2 位作者 Qiu Jianlong Hu Shunwei Zhao Feng 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2023年第6期3-10,21,共9页
This paper studies the dynamic event-triggered leader-follower consensus of nonlinear multi-agent systems(MASs)under directed weighted graph containing a directed spanning tree,and also considers the effects of distur... This paper studies the dynamic event-triggered leader-follower consensus of nonlinear multi-agent systems(MASs)under directed weighted graph containing a directed spanning tree,and also considers the effects of disturbances and leader of non-zero control inputs in the system.Firstly,a novel distributed control protocol is designed for uncertain disturbances and leader of non-zero control inputs in MASs.Secondly,a novel dynamic event-triggered control(DETC)strategy is proposed,which eliminates the need for continuous communication between agents and reduces communication resources between agents.By introducing dynamic thresholds,the complexity of excluding Zeno behavior within the system is reduced.Finally,the effectiveness of the proposed theory is validated through numerical simulation. 展开更多
关键词 nonlinear multi-agent systems(MASs) leader-follower consensus dynamic event-triggered control(DETC) leader of non-zero control input
原文传递
UAV-Assisted Dynamic Avatar Task Migration for Vehicular Metaverse Services: A Multi-Agent Deep Reinforcement Learning Approach 被引量:1
16
作者 Jiawen Kang Junlong Chen +6 位作者 Minrui Xu Zehui Xiong Yutao Jiao Luchao Han Dusit Niyato Yongju Tong Shengli Xie 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期430-445,共16页
Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metavers... Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metaverses. However, avatar tasks include a multitude of human-to-avatar and avatar-to-avatar interactive applications, e.g., augmented reality navigation,which consumes intensive computing resources. It is inefficient and impractical for vehicles to process avatar tasks locally. Fortunately, migrating avatar tasks to the nearest roadside units(RSU)or unmanned aerial vehicles(UAV) for execution is a promising solution to decrease computation overhead and reduce task processing latency, while the high mobility of vehicles brings challenges for vehicles to independently perform avatar migration decisions depending on current and future vehicle status. To address these challenges, in this paper, we propose a novel avatar task migration system based on multi-agent deep reinforcement learning(MADRL) to execute immersive vehicular avatar tasks dynamically. Specifically, we first formulate the problem of avatar task migration from vehicles to RSUs/UAVs as a partially observable Markov decision process that can be solved by MADRL algorithms. We then design the multi-agent proximal policy optimization(MAPPO) approach as the MADRL algorithm for the avatar task migration problem. To overcome slow convergence resulting from the curse of dimensionality and non-stationary issues caused by shared parameters in MAPPO, we further propose a transformer-based MAPPO approach via sequential decision-making models for the efficient representation of relationships among agents. Finally, to motivate terrestrial or non-terrestrial edge servers(e.g., RSUs or UAVs) to share computation resources and ensure traceability of the sharing records, we apply smart contracts and blockchain technologies to achieve secure sharing management. Numerical results demonstrate that the proposed approach outperforms the MAPPO approach by around 2% and effectively reduces approximately 20% of the latency of avatar task execution in UAV-assisted vehicular Metaverses. 展开更多
关键词 AVATAR blockchain metaverses multi-agent deep reinforcement learning transformer UAVS
下载PDF
Finite-time Prescribed Performance Time-Varying Formation Control for Second-Order Multi-Agent Systems With Non-Strict Feedback Based on a Neural Network Observer 被引量:1
17
作者 Chi Ma Dianbiao Dong 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期1039-1050,共12页
This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eli... This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eliminate nonlinearities,neural networks are applied to approximate the inherent dynamics of the system.In addition,due to the limitations of the actual working conditions,each follower agent can only obtain the locally measurable partial state information of the leader agent.To address this problem,a neural network state observer based on the leader state information is designed.Then,a finite-time prescribed performance adaptive output feedback control strategy is proposed by restricting the sliding mode surface to a prescribed region,which ensures that the closed-loop system has practical finite-time stability and that formation errors of the multi-agent systems converge to the prescribed performance bound in finite time.Finally,a numerical simulation is provided to demonstrate the practicality and effectiveness of the developed algorithm. 展开更多
关键词 Finite-time control multi-agent systems neural network prescribed performance control time-varying formation control
下载PDF
Research on Maneuver Decision-Making of Multi-Agent Adversarial Game in a Random Interference Environment
18
作者 Shiguang Hu Le Ru +4 位作者 Bo Lu Zhenhua Wang Xiaolin Zhao Wenfei Wang Hailong Xi 《Computers, Materials & Continua》 SCIE EI 2024年第10期1879-1903,共25页
The strategy evolution process of game players is highly uncertain due to random emergent situations and other external disturbances.This paper investigates the issue of strategy interaction and behavioral decision-ma... The strategy evolution process of game players is highly uncertain due to random emergent situations and other external disturbances.This paper investigates the issue of strategy interaction and behavioral decision-making among game players in simulated confrontation scenarios within a random interference environment.It considers the possible risks that random disturbances may pose to the autonomous decision-making of game players,as well as the impact of participants’manipulative behaviors on the state changes of the players.A nonlinear mathematical model is established to describe the strategy decision-making process of the participants in this scenario.Subsequently,the strategy selection interaction relationship,strategy evolution stability,and dynamic decision-making process of the game players are investigated and verified by simulation experiments.The results show that maneuver-related parameters and random environmental interference factors have different effects on the selection and evolutionary speed of the agent’s strategies.Especially in a highly uncertain environment,even small information asymmetry or miscalculation may have a significant impact on decision-making.This also confirms the feasibility and effectiveness of the method proposed in the paper,which can better explain the behavioral decision-making process of the agent in the interaction process.This study provides feasibility analysis ideas and theoretical references for improving multi-agent interactive decision-making and the interpretability of the game system model. 展开更多
关键词 Behavior decision-making stochastic evolutionary game nonlinear mathematical modeling multi-agent MANEUVER
下载PDF
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning
19
作者 Kun Jiang Wenzhang Liu +2 位作者 Yuanda Wang Lu Dong Changyin Sun 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第7期1591-1604,共14页
Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that ... Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that agents cannot directly observe. However, most of the existing latent variable discovery methods lack a clear representation of latent variables and an effective evaluation of the influence of latent variables on the agent. In this paper, we propose a new MARL algorithm based on the soft actor-critic method for complex continuous control tasks with confounders. It is called the multi-agent soft actor-critic with latent variable(MASAC-LV) algorithm, which uses variational inference theory to infer the compact latent variables representation space from a large amount of offline experience.Besides, we derive the counterfactual policy whose input has no latent variables and quantify the difference between the actual policy and the counterfactual policy via a distance function. This quantified difference is considered an intrinsic motivation that gives additional rewards based on how much the latent variable affects each agent. The proposed algorithm is evaluated on two collaboration tasks with confounders, and the experimental results demonstrate the effectiveness of MASAC-LV compared to other baseline algorithms. 展开更多
关键词 Latent variable model maximum entropy multi-agent reinforcement learning(MARL) multi-agent system
下载PDF
Development of Multi-Agent-Based Indoor 3D Reconstruction
20
作者 Hoi Chuen Cheng Frederick Ziyang Hong +2 位作者 Babar Hussain Yiru Wang Chik Patrick Yue 《Computers, Materials & Continua》 SCIE EI 2024年第10期161-181,共21页
Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent ... Large-scale indoor 3D reconstruction with multiple robots faces challenges in core enabling technologies.This work contributes to a framework addressing localization,coordination,and vision processing for multi-agent reconstruction.A system architecture fusing visible light positioning,multi-agent path finding via reinforcement learning,and 360°camera techniques for 3D reconstruction is proposed.Our visible light positioning algorithm leverages existing lighting for centimeter-level localization without additional infrastructure.Meanwhile,a decentralized reinforcement learning approach is developed to solve the multi-agent path finding problem,with communications among agents optimized.Our 3D reconstruction pipeline utilizes equirectangular projection from 360°cameras to facilitate depth-independent reconstruction from posed monocular images using neural networks.Experimental validation demonstrates centimeter-level indoor navigation and 3D scene reconstruction capabilities of our framework.The challenges and limitations stemming from the above enabling technologies are discussed at the end of each corresponding section.In summary,this research advances fundamental techniques for multi-robot indoor 3D modeling,contributing to automated,data-driven applications through coordinated robot navigation,perception,and modeling. 展开更多
关键词 multi-agent system multi-robot human collaboration visible light communication visible light positioning 3D reconstruction reinforcement learning multi-agent path finding
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部