期刊文献+

基于D-S证据理论的不完整数据混合分类算法 被引量:14

A D-S Evidence Reasoning Based Hybrid Classification Algorithm for Incomplete Data
原文传递
导出
摘要 针对传统不完整数据插补聚类算法未考虑插补值对类中心的影响以及不完整样本建模带来的不确定性等问题,提出了一种基于D-S证据理论的不完整数据混合分类算法.首先,利用经典软聚类算法对数据集中的完整样本进行聚类并选择训练样本,再根据剩余样本已知属性构建若干训练集,并利用基础分类器分类;然后在D-S证据理论下,将属于若干个类别概率相近的样本划分到相应复合类以降低误分类率;最后,对处于复合类中的不完整样本,分别在构成其复合类的单类中进行K近邻插补并分类,将若干个分类结果自适应融合以决定这些样本的最终类别.模拟数据集和UCI数据集验证表明,算法能够合理地表征由缺失值引起的不确定性,降低了误分率. To address the problems of the traditional incomplete data imputation clustering algorithm,which does not consider the influence of imputation on the class center and the uncertainty caused by incomplete sample modeling,a hybrid classification algorithm for incomplete data based on the D-S evidence theory(HCA)is proposed.First,the classical soft clustering algorithm is used to cluster the complete samples in the dataset and select the training samples.Then,several training sets are constructed on the basis of the known attributes of the remaining samples,and the basic classifiers are used to classify them.Under the D-S evidence theory,samples belonging to several classes with similar probability are divided into corresponding metaclasses to reduce the misclassification rate.Finally,the incomplete samples in the metaclasses are classified after imputing by K-nearest neighbor to their hard-to-distinguish classes,and several classification results are adaptively fused to determine the final class of these samples.The validation of the simulated datasets and UCI standard datasets show that the algorithm can reasonably represent the uncertainty caused by missing values and reduce the error rate.
作者 段中兴 毕瀚元 张作伟 DUAN Zhongxing;BI Hanyuan;ZHANG Zuowei(School of Information&Control Engineering,Xi'an University of Architecture and Technology,Xi'an 710055,China;State Key Laboratory of Green Building in Western China,Xi'an 710055,China;School of Automation,Northwestern Polytechnical University,Xi'an 710072,China)
出处 《信息与控制》 CSCD 北大核心 2020年第4期455-463,471,共10页 Information and Control
基金 国家自然科学基金资助项目(51678470)。
关键词 不完整数据 聚类 D-S证据理论 不确定性 多源信息融合 incomplete data clustering D-S evidence theory uncertainty multi-source information fusion
  • 相关文献

参考文献11

二级参考文献98

共引文献640

同被引文献149

引证文献14

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部