二叉决策树生成算法的VC维上界被引量：1

On Upper Bound of VC Dimension of Binary Decision Tree Algorithms

下载PDF

导出

摘要在统计学习理论中 ,尤其对于分类问题 ,VC维扮演着中心作用。大多数常用算法的VC维未知。该文计算了二叉决策树生成算法的VC维上界 ,获得了定理 2 ,认为该上界随决策树的复杂度和节点可调参数个数的增大而提高。作为补充 ,还计算了单变量决策树非叶子节点的VC维上界 ,获得了定理 3。为了评估定理 2的数值结果 ,通过实验验证了有关的经验结论 ,发现它们在决策树复杂度较大时能够与实际符合。比较定理 2和经验结论发现两者存在较大的数值差别但是变化趋势相同。 VC dimension plays a central role in the Statistical Learning Theory especially for classification problems. Since there does not exist a universal method to calculate this combinatorial dimension, this value for most of the classification algorithms is still unknown. Binary decision tree algorithms for classification constitute a large family in the fields of machine learning, pattern recognition, and data mining. Investigating their VC dimension seems to be a helpful step towards further improvement of their generalization performance. For this purpose, the paper calculated, in theorem 2, an upper bound of VC dimension for binary decision tree algorithms, which rises as the complexity of resulting tree and the number of adjustable parameters at each node increases. To facilitate the calculation, only continuous attributes are considered here, and hypothesis function class of decision tree algorithms is considered as a continuously expending group of uniform Boolean formulas. As a supplement, upper bound of single non-leaf node of univariate decision tree algorithms is also calculated in theorem 3, which displays its special classification capability when compared with its multivariate counterpart. For the purpose of comparison, experiential conclusions on VC dimension of univirate decision tree algorithms are evaluated by experiments. They are found to be appropriate when complexity of the resulting tree is large enough. Based on this observation, a numerical comparison between the experiential value of VC dimension and the upper bound calculated by us was made. Although the numeric discrepancy between them is large, they display the same tendency. Possible origins of the exaggeration in the upper bound were discussed. Conclusions of the paper is helpful for us to understand the essential purpose of some techniques of improvement such as pruning a grown decision tree or imposing earlier-stop criteria in training, which are assumed to limit the VC dimension of the algorithm to a reasonable range and hence alleviate the serious problem of over training.

作者杨杰叶晨洲周越陈念贻

机构地区上海交通大学图象处理及模式识别研究所

出处《计算机仿真》 CSCD 2005年第2期74-78,126,共6页 Computer Simulation

关键词决策树维统计学习理论 Decision tree Dimension Statistical learning theory

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献10

1边肇祺张学工.模式识别(第二版)[M].北京：清华大学出版社,1999.224-227.
2叶晨洲,杨杰,姚莉秀,陈念贻.采用重复剪辑近邻法提高决策树算法的性能[J].控制与决策,2003,18(1):96-98. 被引量：4
3M Vidyasagar. A Theory of Learning and Generalization [M]. Great Britain: Springer, 1997.
4V Vapnik. The Nature of Statistical Learning Theory (the second edition)[M].NewYork:Springer-Verlag1998(中译本张学工译统计学习理论的本质北京:清华大学出版社2000).
5Simon Haykin. Neural Networks: A Comprehensive Foundation (2nd Edition ) [ M]. New Jersey: Prentice Hall, 1999.
6Martin Anthony,Peter L.Bartlett. Neural Network Learning:Theoretical Foundations[M]. Cambridge: Cambridge University Press, 1999.
7Apte Chidanand and Weiss Sholom. Data mining with decision trees and decision rules[J]. Future Generation Computer Systems, 1997, 13:197- 210.
8Sreeramma K Murthy, Simon Kasif, Steven Salzberg. A System for Induction of Oblique Decision Trees[ J]. Journal of Artificial Intelligence Research, 1994,2:1 - 32.
9Peter L Bartlett, Vitaly Maiorov. Almost Linear VC Dimension Bounds for Hecewise Polynomial Networks[M]. Advances in Neural Information Processing Systems 11(Edited by Michael S. Kearns,Sara A.Solla,David A.Cohn), Cambridge Massachusetts: MIT Press 1999:190 -196.
10Peter Bartlett. Learning Theory and Generalization: Neural Networks and other Supervised Learning Techniques [ DB ] . http : //discus. anu. edu. au/～ bartlett/nips98/nipistutorial. html. 2003.

二级参考文献1

1边肇祺张学工.模式识别（第2版）[M].北京:清华大学出版社,1999..

共引文献21

1王毅刚,朱小冬,甘茂治.基于神经网络的软件关键模块的识别方法[J].计算机应用,2005,25(6):1336-1338. 被引量：1
2王连亮,陈怀新.基于改进的模糊C-均值的分级递减聚类算法[J].系统工程与电子技术,2005,27(7):1304-1306. 被引量：2
3田学全,舒兰.基于自组织特征映射的属性离散化方法[J].四川师范大学学报（自然科学版）,2005,28(4):432-435. 被引量：1
4陈伏兵,张生亮,高秀梅,杨静宇.小样本情况下Fisher线性鉴别分析的理论及其验证[J].中国图象图形学报,2005,10(8):984-991. 被引量：17
5王猛,杨杰,白洪亮.基于区域分割的水下目标实时识别系统[J].计算机仿真,2005,22(8):101-105. 被引量：5
6陈伏兵,谢永华,严云洋,杨静宇.分块PCA鉴别特征抽取能力的分析研究[J].计算机科学,2006,33(3):155-159. 被引量：17
7贾小军.基于BP网的手写数字符号识别[J].嘉兴学院学报,2006,18(6):89-91.
8贺国光,鲁静罡,唐福萍,陈宇红.基于MFCM的改进聚类算法及其在交通中的应用[J].长沙交通学院学报,2007,23(1):51-55.
9丁卫,龚振邦,谢少荣,邹海荣.基于视觉的低空跟踪系统[J].光学精密工程,2007,15(6):957-965. 被引量：3
10周晓飞,杨静宇,姜文瀚.核最近邻凸包分类算法[J].中国图象图形学报,2007,12(7):1209-1213. 被引量：6

同被引文献4

1Isabelle Guyon & Andre Elisseeff.An Introduction to Variable and Feature Selection[J].Journal of Machine Learning Research.2003,3:1157-1182.
2J R Quinlan.C4.5:Programs for Machine Learning[M].Morgan Kaufmann.1993
3Bart Baesens,Rudy Setiono,Christophe Mues,Jan Vanthienen.Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation[J].Management Science.March 2003,49(3):312-329
4Marc Boulle,Khiops.A Statistical Discretization Method of Continuous Attributes[J].Machine Learning.2004,(55):53-69

引证文献1

1刘晓平.决策树的自动生成模板[J].计算机仿真,2005,22(12):76-79.

1程玉胜,任广永.基于ROUGH集的决策树测试属性选择方法[J].安庆师范学院学报（自然科学版）,2004,10(4):89-92. 被引量：2
2王梅,张四平,余国清.基于构造超平面的两阶段决策树算法的研究[J].价值工程,2011,30(8):168-168. 被引量：1
3张四平,王梅,佘维.基于超平面的多变量决策树算法的研究[J].微计算机信息,2011,27(9):206-207.
4赵翔,刘同明.基于主成分分析的多变量决策树构造方法[J].计算机应用研究,2005,22(9):37-38. 被引量：5
5黄俊南.基于决策类划分新型多变量决策树算法实例分析[J].齐齐哈尔大学学报（自然科学版）,2015,31(1):4-9. 被引量：5
6葛浩,袁万莲.基于核属性的决策树构造算法研究[J].滁州学院学报,2008,10(6):53-55.
7侯利娟,颜宏文.基于粗糙集和信息熵的多变量决策树的变压器故障诊断[J].电脑与信息技术,2007,15(6):21-23.
8苗夺谦,王珏.基于粗糙集的多变量决策树构造方法[J].软件学报,1997,8(6):425-431. 被引量：120
9王妍妍,王艳宁,王敏.基于单变量决策树的网络故障诊断方法[J].计算机工程与设计,2007,28(22):5414-5416.
10飞思卡尔关注嵌入式应用市场[J].电子产品世界,2005,12(08B):20-20.

计算机仿真

2005年第2期

浏览历史

内容加载中请稍等...

二叉决策树生成算法的VC维上界被引量：1

参考文献10

二级参考文献1

共引文献21

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

二叉决策树生成算法的VC维上界 被引量：1

参考文献10

二级参考文献1

共引文献21

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

二叉决策树生成算法的VC维上界被引量：1