一种连续U-树抽象状态最佳分裂点选取方法被引量：1

A Method for Selecting Best Splitting Point of Abstract State in Continuous U-Tree

下载PDF

导出

摘要经典连续U-树算法使用分布检验来确定抽象状态的最佳分裂点,但选取合适的置信阈值非常困难.提出一种基于最优的最佳分裂点选取方法,该方法将抽象状态的最佳分裂点选取问题转化为一个最优问题,从而规避了置信阈值大小难以确定的问题,并从理论上减少了连续U-树算法的时间复杂度.通过消解协商僵局的学习任务实验验证了它的有效性,表明了算法的性能得到增强. The classical continuous U-tree algorithm employs distribution tests （e. g. Kolmogorov-Smirnov test and information gain ratio test） to determine the best splitting points of abstract states, but it is very difficult to set a confidence threshold properly. A method for selecting the best splitting points of abstract states in continuous U-tree based on optimization is put forward. This method turns the task of selecting the best splitting points into and optimization one. As a result, it avoids the difficulty of setting the appropriate confidence threshold in the classical algorithm and reduces the time complexity of the algorithm in theory. As is shown by the results of experiments upon the complex learning task getting rid of negotiation deadlocks, the method is valid and the performance of the continuous U-tree algorithm utilizing the method is enhanced.

作者彭志平柯文德

机构地区茂名学院计算机科学与技术系

出处《上海交通大学学报》 EI CAS CSCD 北大核心 2008年第2期279-284,共6页 Journal of Shanghai Jiaotong University

基金广东省自然科学基金(06029281)资助项目

关键词连续U-树状态抽象最佳分裂点协商僵局 continuous U-tree state abstraction best splitting point negotiation deadlock

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献8

1Sutton R,Barto A G.An introduction to reinforcement learning[M].USA:MIT Press,1998.
2Barto A G,Mahadevan S.Recent advances in hierarchical reinforcement learning[J].Discrete Event Dynamic Systems:Theory and Applications,2003,13(4) 41-77.
3Hengst B.Discovering hierarchy in reinforcement learning[D].Sydney:University of New South Wales,2003.
4McCallum A K.Reinforcement learning with selective perception and hidden state[D].New York:University of Rochester,1995.
5Uther W T B.Tree based hierarchical reinforcement learning[D].Pittsburgh:Carnegie Mellon University,2002.
6Sutton R S,Precup D,Singh S.Between MDPs and semi-MDPs a framework for temporal abstraction in reinforcement learning[J].Artificial Intelligence,1999,112(1):181-211.
7Au M,Maire F.Automatic state construction using decision tree for reinforcement learning agents[C]// Proceedings of International Conference on Intelligent Agents,Web Technologies and Interne:Commerce (CIMCA).Gold Coast,Australia:IEEE Press,2004:212-216.
8彭志平,彭宏,郑启伦.一种双边多议题自治协商模型的研究[J].电子与信息学报,2007,29(3):733-738. 被引量：12

二级参考文献11

1郭庆,陈纯.基于整合效用的多议题协商优化[J].软件学报,2004,15(5):706-711. 被引量：27
2Das R,Hanson J E,and Kephart J O,et al..Agent-human interactions in the continuous double auction[C].Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence,Washington,2001:1169-1176.
3Fatima S S,Wooldridge M,and Jennings N R.An agenda-based framework for multi-issue negotiation.Artificial Intelligence,2004,152(1):1-45.
4Fatima S S,Wooldridge M,and Jennings N R.Optimal agendas for multi-issue negotiation.Proceedings 2nd International Conference on Autonomous Agents and Multi-agent Systems,Melbourne,Australia,2003:129-136.
5Luo Xudong,Jennings N R,and Shadbolt N,et al..A fuzzy constraint based model for bilateral,multi-issue negotiation in semi-competitive environments.Artificial Intelligence,2003,148(1-2):53-102.
6Zeng DJ and Sycara K.Bayesian leaning in negotiation.International Journal of Human-Computer Studies,1998,48(1):125～141.
7Sandholm T W and Zhou Y H.Surplus equivalence of leveled commitment contracts.Artificial Intelligence,2002,142(2):239～264.
8Braynov S and Sandholm T.Contracting with uncertain level of trust.Computational Intelligence,2002,18(4):125～141.
9Jennings N R and Faratin P,et al..Automated negotiation:prospects,method and challenges.International Journal of Group Decision and Negotiation,2001,10(2):199～215.
10Sandholm T W.Negotiation among self-interested computationally limited agents[Ph.d.Thesis].Amherst,MA:University of Massachusetts,1996.

共引文献11

1彭志平,李绍平.一种基于神经模糊系统的协商策略[J].系统仿真学报,2008,20(3):623-626.
2彭志平,李绍平.一种基于PSO的分层策略搜索算法[J].模式识别与人工智能,2008,21(1):98-103. 被引量：1
3彭志平,张慧.一种改进的粒子群算法在协商优化中的应用[J].计算机工程,2008,34(10):155-157. 被引量：1
4杨清平,蒲国林,王刚,邱玉辉.基于交互历史的多Agent自动协商研究[J].计算机科学,2008,35(9):226-229. 被引量：8
5李剑,景博,杨义先.一种竞争环境下基于自适应遗传算法的多边多议题协商[J].电子与信息学报,2008,30(11):2613-2616.
6孙天昊,朱庆生,李双庆.一对多协商模型研究[J].电子与信息学报,2009,31(5):1031-1034. 被引量：3
7吕洪柱,廉佐政,李敬有.基于马氏距离的动态限时协商模型研究[J].哈尔滨理工大学学报,2009,14(4):21-24.
8孙天昊,邓俊昆,陈飞,朱庆生.基于增强学习协商策略的研究及优化[J].计算机工程与应用,2012,48(23):44-46. 被引量：1
9柯文德,彭志平,陈珂,蔡则苏.新的多移动机器人任务协商模型[J].计算机应用,2013,33(2):346-349.
10丁咏梅.多主体多目标水资源协商过程分析[J].水利建设与管理,2014,34(10):81-84.

同被引文献1

1丁晓锋,卢炎生,潘鹏,洪亮,魏琼.基于U-tree的不确定移动对象索引策略[J].软件学报,2008,19(10):2696-2705. 被引量：11

引证文献1

1宋佳佳,王作为.基于有效实例的改进U树算法[J].计算机工程与科学,2019,41(1):185-190.

1牛建强.探究计算机云计算的SLIQ并行算法[J].城市地理,2015(3X). 被引量：3
2黄刚,孙媛.基于Hadoop平台的SPRINT算法的分析与研究[J].南京师大学报（自然科学版）,2016,39(4):25-30. 被引量：2
3朱慧云,陈森发,曹杰,张丽杰.基于最佳分裂点的客户分类变化挖掘方法[J].信息与控制,2012,41(6):668-674. 被引量：1
4杨长春,沈晓玲.基于云计算的SLIQ并行算法研究[J].计算机工程与科学,2012,34(3):62-66. 被引量：6
5马健美.基于数据挖掘的信用卡风险评估系统设计[J].自动化技术与应用,2016,35(5):37-40.
6彭志平,陈珂.一种消解协商僵局的多目标粒子群优化算法[J].电子学报,2007,35(8):1452-1457. 被引量：7
7朱王晓嘉,余建坤.基于类标签变化的改进SLIQ算法研究[J].微型电脑应用,2015,31(10):27-31. 被引量：2
8彭志平,李绍平.一种基于PSO的分层策略搜索算法[J].模式识别与人工智能,2008,21(1):98-103. 被引量：1
9王黎明,沈扬.协商僵局的消解策略研究[J].计算机应用,2010,30(6):1519-1522. 被引量：2
10彭志平,彭宏.基于并发Options的双边多议题协商模型优化[J].华南理工大学学报（自然科学版）,2007,35(9):95-100. 被引量：2

上海交通大学学报

2008年第2期

浏览历史

内容加载中请稍等...

一种连续U-树抽象状态最佳分裂点选取方法被引量：1

参考文献8

二级参考文献11

共引文献11

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种连续U-树抽象状态最佳分裂点选取方法 被引量：1

参考文献8

二级参考文献11

共引文献11

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种连续U-树抽象状态最佳分裂点选取方法被引量：1