摘要
提出一种基于双支持向量机的偏二叉树多类分类算法,偏二叉树双支持向量机多类分类算法.该算法综合了二叉树支持向量机和双支持向量机的优势,实现了在不降低分类性能的前提下,大大缩短训练时间.理论分析和UCI(University of California Irvine)机器学习数据库数据集上的实验结果共同证明,偏二叉树双支持向量机多类分类算法在训练时间上具有绝对的优势,尤其在处理稍大数据集的多类分类问题时,这一优势尤为突出;实验仿真结果还证明,在采用非线性核时,该算法取得了比基于经典支持向量机的一对其余多类分类算法及二叉树支持向量机更好的分类效果;同时该算法还解决了后两种算法可能存在的样本不平衡问题,以及基于经典支持向量机的一对其余多类分类算法可能存在的不可分区域问题.
A new algorithm for multiclass classification problem is presented in this paper.This algorithm,referred here as PBT-TSVM(partial binary tree and twin support vector machines),is a combination of the advantages of binary tree support vector machines(BT-SVM) with those of twin support vector machines(TSVM).Theoretical analysis and experimental results on UCI datasets prove that our PBT-TSVM algorithm not only significantly reduces the training time especially on large datasets,but also gets better classification accuracy via nonlinear kernel functions than 1-v-r SVM(one-versus-rest support vector machines) and BT-SVM.At the same time,our algorithm solves the potential problem of unbalanced dataset of 1-v-r SVM and BT-SVM when dealing with multiclass classifications.Furthermore,it solves the potential problem existing in 1-v-r SVM that some samples cannot be classified.
出处
《南京大学学报(自然科学版)》
CAS
CSCD
北大核心
2011年第4期354-363,共10页
Journal of Nanjing University(Natural Science)
基金
中央高校基本科研业务费专项资金(GK200901006)
中央高校基本科研业务费专项资金(GK201001003)
陕西省自然科学基础研究计划(2010JM3004)
关键词
双支持向量机
偏二叉树支持向量机
支持向量机
多类分类
multiclass classification
binary tree support vector machines
twin support vector machines
support vector machines