基于流形结构的多聚类中心近邻传播聚类算法

Manifold structure based multi-exemplar affinity propagation

下载PDF

导出

摘要多聚类中心近邻传播聚类算法(MEAP),在处理任意形状具有流形分布结构的数据时,往往得不到理想的聚类结果。为此,基于流形学习的思想,设计了一种全新的相似性度量,该相似性度量能够扩大位于同一流形中数据点间的相似性,同时缩小处于不同流形上数据点间的相似性,从而使得相似性矩阵能够准确地反映数据集内在的流形分布结构。将该相似性度量与MEAP相结合,提出基于流形结构的多聚类中心近邻传播聚类算法MS-MEAP(Manifold Structure based Multi-Exemplar Affinity Propagation),从而有效地拓展了算法处理任意形状具有流形分布结构数据集的能力,同时提高了算法的运行效率。在人工数据集与USPS手写体数据集上进行了实验,仿真实验结果及算法有效性分析证明,MS-MEAP算法相比于原算法在处理任意形状具有流形分布结构的数据时,具有更好的聚类性能。 When dealing with arbitrary shape data set with manifold structure, multi-exemplar affinity propagation cannot obtain good clustering results. To overcome this shortcoming, this paper designs a brand new similarity measure based on the idea of manifold learning. This similarity can amplify the similarity between data points of the same manifold and reduce the similarity between data points of different manifolds. As a result, the similarity matrix can reflect the internal manifold structure of the data set precisely. Based on this similarity matrix, this paper proposes the novel manifold structure based multi-exemplar affinity propagation, which can solve the problem mentioned above effectively and also improve the efficiency of this algorithm. It obtains promising results both on artificial datasets and USPS handwritten digits datasets. The simulation results show that the new method outperforms traditional MEAP algorithm.

作者陈雷雷葛洪伟杨金龙袁运浩

机构地区江南大学物联网工程学院轻工过程先进控制教育部重点实验室

出处《计算机工程与应用》 CSCD 北大核心 2016年第6期67-73,共7页 Computer Engineering and Applications

基金国家自然科学基金(No.61305017 No.60975027) 江苏省自然科学基金(No.BK20130154) 江苏高校优势学科建设工程资助项目

关键词近邻传播聚类多聚类中心近邻传播聚类基于密度的聚类流形结构相似性度量 affinity propagation multi-exemplar affinity propagation density-based clustering manifold structure similarity measure

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献17

1贺玲,吴玲达,蔡益朝.数据挖掘中的聚类算法综述[J].计算机应用研究,2007,24(1):10-13. 被引量：226
2孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量：1076
3Frey B J,Dueck D.Clustering by passing messages between data points[J].Science,2007,315(5814):972-976.
4Zhou Yong,Xing Yan.Summary of affinity propagation[J].Advanced Materials Research,2011,268(270):811-816.
5Wang Changdong,Lai Jianhuang,Ching Y,et al.Multiexemplar affinity propagation[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2013,35(9):2223-2237.
6Givoni I E,Frey B J.A binary variable model for affinity propagation[J].Neural Computation,2009,21(6):1589-1600.
7Jain A K.Data clustering:50 years beyond k-means[J].Pattern Recognition Letters,2010,31(8):651-666.
8Avi H I,Mieghem J,Rub L.Multiple subclass pattern recognition:a maximin correlation approach[J].IEEE Trans on Pattern Analysis and Machine Intelligence,1995,17(4):418-431.
9Liu Ming,Jiang Xin,Kot A C.A multi-prototype clustering algorithm[J].Pattern Recognition,2009,42(3):689-698.
10Zhou Ding,Bousquet O.Learning with local and global consistency[C]//Proceedings of Advances in Neural Information Processing Systems,Cambridge,2004:372-378.

二级参考文献70

1李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量：114
2Frey B J and Dueck D. Clustering by passing messages between data points. Science, 2007, 315(5814): 972-976.
3Givoni I E and Frey B J. A binary variable model for affinity propagation. Neural Computation, 2009, 21(6): 1589-1600.
4Jia Sen, Qian Yun-tao, and Ji Zhen, Band hyperspectral imagery using affinity. Proceedings of the 2008 Digital Image Techniques and Applications, Canberra, ACT selection for Propagation. Computing: 1-3.12.2008:137-141.
5Gang Li, Lei brain MR International (ISCAS 2009) Guo, and Liu Tian-ming, et at. Grouping of images via affinity propagation. IEEE Symposium on Circuits and Systems, 2009 Taipei, Taiwan, 5.24. 2009: 2425-2428.
6Dueck D, Frey B J, and Jojic N, et al. Constructing treatment portfolios using affinity propagation[C]. Proceedings of 12th Annual International Conference, RECOMB 2008. Singapore. 3.30-4.2, 2008: 360-371.
7Leone M, Sumedha, and Weigt M. Clustering by soft-constraint affinity propagation: applications to gene- expression data. Bioinformatics, 2007, 23(20): 2708-2715.
8Alexander Hinneburg and Daniel A Keim. A general approach to clustering in large databases with noise. Knowledge and Information Systems, 2003, 5(4): 387-415.
9Little M A, McSharry P E, Hunter E J, and Lorraine O. Suitability of dysphonia measurements for telemonitoring of Parkinson's disease. IEEE Transactions on Biomedical Engineering, 2009, 56(4): 1015-1022.
10Zhang W，Proc 23rd VL DB Conf，1997年，186页

共引文献1479

1刘壮,张悦.统计学方法在生物信息学分析中的应用[J].医学信息学杂志,2020,41(6):20-23. 被引量：1
2丁小军,陈杰,李霖,徐碧通,朱晓姝.一种基于聚类结果稳定性来确定聚类数的方法[J].玉林师范学院学报,2020(3):43-47. 被引量：1
3王玥,李文权,梁爽,余静财.基于改进聚类算法的共享汽车网点选址研究[J].武汉理工大学学报,2021,43(2):79-85.
4林耿堃,盛积良.乡村振兴时代背景下农民消费结构变迁研究[J].农业农村部管理干部学院学报,2021(2):76-81. 被引量：3
5高显义,林欣晖.基于文本聚类的变电工程变更特征识别研究[J].建筑经济,2020,41(S02):200-203. 被引量：2
6毛颖颖,杨新凯.融合拓扑势的自适应层次聚类算法研究[J].计算机应用研究,2020,37(S01):37-39.
7张睿恺,吴克河.基于优化特征集的LeNet-5攻击检测模型的态势感知技术[J].计算机应用研究,2020,37(S01):287-289. 被引量：4
8李对红,王裴岩 ,张桂平,张少阳.基于字簇的多模型中文分词方法研究[J].计算机应用研究,2020,37(2):355-359. 被引量：2
9孙伟鹏,吴锡生,孟斌.基于Spark并行的密度峰值聚类算法[J].计算机应用研究,2020,37(1):163-166. 被引量：2
10尧少波,蒋励剑,赵文文,卢铮,吴昌聚,陈伟芳.耦合聚类的数据驱动稀薄流非线性本构计算方法[J].航空学报,2022,43(S02):43-56.

1孙剑,刘渊,赵新杰.基于聚类的应用层DDoS攻击检测方法研究[J].计算机工程与应用,2016,52(21):116-120. 被引量：3
2魏衡华,彭飞.基于改进遗传算法的神经网络手写体数字识别[J].微型机与应用,2012,31(18):57-59. 被引量：1
3杜世强,石玉清,王维兰,马明.基于图正则化的半监督非负矩阵分解[J].计算机工程与应用,2012,48(36):194-200. 被引量：7
4吴漫川,李元香,郑波尽.解决非静态优化问题的MEAP算法[J].计算机工程与科学,2005,27(8):73-75. 被引量：1
5FENG Ding-Cheng CHEN Feng XU Wen-Li.Detecting Local Manifold Structure for Unsupervised Feature Selection[J].自动化学报,2014,40(10):2253-2261. 被引量：3
6Zhou Yatong,Li Lin,Xia Kewen.RESEARCH ON WEIGHTED PRIORITY OF EXEMPLAR-BASED IMAGE INPAINTING[J].Journal of Electronics(China),2012,29(1):166-170. 被引量：28
7YANG Zhen,WANG Laitao,FAN Kefeng,LAI Yingxu.Exemplar-Based Clustering Analysis Optimized by Genetic Algorithm[J].Chinese Journal of Electronics,2013,22(4):735-740. 被引量：1
8卢小雷.大道至简 Canon imageCLASS LBP8780x[J].个人电脑,2014,20(8):15-17.
9曹立.基于矩阵SVD的手写数字分类及其特征区域的SVD优化[J].软件,2016,37(7):31-37. 被引量：1
10作为信息安全管理的MEAP智能数码复合机[J].办公自动化（办公设备与耗材）,2009(11):51-51.

计算机工程与应用

2016年第6期

浏览历史

内容加载中请稍等...

基于流形结构的多聚类中心近邻传播聚类算法

参考文献17

二级参考文献70

共引文献1479

相关作者

相关机构

相关主题

浏览历史