期刊文献+

嵌套删失数据期望最大化的高斯混合聚类算法 被引量:5

Adapted Expectation Maximization Algorithm for Gaussian Mixture Clustering With Censored Data
下载PDF
导出
摘要 针对聚类问题中的非随机性缺失数据,本文基于高斯混合聚类模型,分析了删失型数据期望最大化算法的有效性,并揭示了删失数据似然函数对模型算法的作用机制.从赤池弘次信息准则、信息散度等指标,比较了所提出方法与标准的期望最大化算法的优劣性.通过删失数据划分及指示变量,推导了聚类模型参数后验概率及似然函数,调整了参数截尾正态函数的一阶和二阶估计量.并根据估计算法的有效性理论,通过关于得分向量期望的方程得出算法估计的最优参数.对于同一删失数据集,所提出的聚类算法对数据聚类中心估计更精准.实验结果证实了所提出算法在高斯混合聚类的性能上优于标准的随机性缺失数据期望最大化算法. To provide a solution for clustering with data of missing not at random, this paper provided the efficiency analysis on the adapted expectation-maximization(EM) algorithm for Gaussian mixture clustering model with censored data. We also revealed the impact mechanism of the likelihood function of censored data on the clustering model and its estimation algorithm. With Akaike′s information criterion and Kullback-Leibler divergence,the performance of the proposed algorithm was compared with the standard EM algorithm. Based on data partition and the indicating variables of the censored data set, the paper proposed derived the posterior and likelihood function of the parameters, and adjusted its first and second moments of the truncated normal functions. According to the principles of efficient influence function, the optimal parameters of the algorithm are obtained by the equation of the expectation of the score vector. For the censored data, the proposed clustering algorithm is more accurate in estimating its centroids. The experimental results demonstrated that the proposed algorithm in Gaussian mixture clustering outperformed the standard EM algorithm, which was designed for the data of missing at random.
作者 余海燕 陈京京 邱航 王永 王若凡 YU Hai-Yan;CHEN Jing-Jing;QIU Hang;WANG Yong;WANG Ruo-Fan(Chongqing Key Laboratory of Electronic Commerce and Mod-ern Logistics,Chongqing University of Posts and Telecomms.,Ch-ongqing 404615;School of Computer Science and Engineering,University of Electronic Science and Technology,Chengdu 611731;Big Data Research Center,University of Electronic Science and Technology,Chengdu 611731;School of Information Technology Engineering,Tianjin University of Technology a nd Education,Tianjin 300222)
出处 《自动化学报》 EI CAS CSCD 北大核心 2021年第6期1302-1314,共13页 Acta Automatica Sinica
基金 国家自然科学基金(71601026,61601331,71571105) 重庆市产业类重大主题专项(cstc2017zdcy-zdzxX0013) 四川省重点研发项目(2018SZ0114,2019YFS0271) 天津市自然科学基金青年项目(18JCQNJC04700)资助。
关键词 高斯混合聚类 删失数据 期望最大化算法 截尾正态函数 二阶估计量 Gaussian mixture clustering censored data expectation-maximization truncated normal function second order moment
  • 相关文献

参考文献7

二级参考文献72

  • 1SUN R, Robust reasoning: integrating rule-based and similarity- based reasoning[J]. Artificial Intelligence, 1995, 75(2): 241-295.
  • 2MARLING C R, PFTOT G J, STERLING g S. Integrating case-based and rule-based reasoning to meet multiple design constraints[J]. Computational Intelligence, 1999, 15(3): 308-332.
  • 3BAKER J W, SCHUBERT M, FABER M H. On the assessment ofrobustness[J]. Structural SafeO,, 2008, 30(3): 253-267.
  • 4GOLDING A, RROSENBLOOM P S. Improving accuracy by combining rule-based and case-based reasoning[J]. Artificial Intelligence, 1996, 87(1-2): 215-254.
  • 5ROSSILLE D, LAURENT J F, BURGUN A. Modeling a decision- support system for oncology using rule-based and case-based reasoning methodologies[J]. International Journal of Medical Infbrmatics, 2005, 74(2[): 299-306.
  • 6PRENTZAS J, HATZILYGEROUDIS I. Categorizing approaches combining rule-based and case-based reasoning[J]. Expert Systems, 2007, 24(2): 97-122.
  • 7KUMAR K A, SINGH Y, SANYAL S. Hybrid approach using case-based reasoning and rule-based reasoning for domain independent clinical decision support in ICU[J]. Expert Systems with Applications, 2009, 36(1 ): 65-71.
  • 8TUNG Y H, TSENG S S, WENG J F, et al. A rule-based CBR approach for expert finding and problem diagnosis[J]. Expert System with Applications, 2010, 37(3): 2 427-2 438.
  • 9LUENGO J, HERRERA F. Domains of competence of fuzzy rule based classification systems with data complexity measures: A case of study using a fuzzy hybrid genetic based machine learning method[J]. Fuzzy Sets and Systems, 2010, 16l(1): 3-19.
  • 10COUDRAY N, BUESSLER J I_, URBAN J P. Robust threshold estimation for images with unimodal histograms[J]. Pattern Recognition Letters, 2010, 31(9): 1 010-1 019.

共引文献26

同被引文献50

引证文献5

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部