分类数据的聚类边界检测技术被引量：5

Cluster boundary detection technology for categorical data

下载PDF

导出

摘要随着分类属性数据集的应用越来越广泛,获取含有分类属性数据集的聚类边界的需求也越来越迫切。为了获取聚类的边界,在定义分类数据的边界度和聚类边界的基础上,提出了一种带分类属性数据的聚类边界检测算法——CBORDER。该算法首先利用随机分配初始聚类中心和边界度对类进行划分并获取记录边界点的证据,然后运用证据积累的思想多次执行该过程来获取聚类的边界。实验结果表明,CBORDER算法能有效地检测出高维分类属性数据集中聚类的边界。 With the wide application of categorical-attribute dataset,the demand of obtaining the cluster boundary of categorical-attribute dataset becomes more and more urgent.In order to get cluster boundaries,a categorical-attribute data boundary detection algorithm： CBORDER（Categorical dataset BORDER detection algorithm） was proposed.In this algorithm,firstly,this paper initialized the center of cluster by using random allocation and utilizing boundary-degree to partition clusters;at the same time,the evidence of captured boundary records was got.Then,based on the evidence accumulation,the above procedure was executed repeatedly to acquire the boundaries of clusters at the end.The experimental results demonstrate that CBORDER can effectively detect the boundaries of the high-dimensional categorical data.

作者邱保志王波

机构地区郑州大学信息工程学院

出处《计算机应用》 CSCD 北大核心 2012年第6期1654-1656,1669,共4页 journal of Computer Applications

基金河南省重点科技攻关项目(112102310073) 河南省教育厅自然科学研究计划项目(2009A520028)

关键词边界度证据积累聚类边界分类数据 boundary-degree evidence accumulation cluster boundary categorical data

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献12

1ESTER M, KRIEGEL H P, SANDER J. A density-based algorithm for discovering clusters in large spatial databases with noise[ C]// Proceedings of the 2nd International Conference on Knowledge Dis- covery and Data Mining. Oregon, Portland: AAAI Press, 1996: 226 - 231.
2XIA CHENYI, HSU WYNNE, LEE MONGLI, et al. BORDER: ef- ficient computation of boundary points[ J]. Knowledge and Data En- gineering, 2006, 18(3) : 289 -303.
3QIU BAOZHI, YUE FENG, SHEN JUNYI. BRIM: A efficient boundary points detecting algorithm[ C]// Proceedings of Advances in Knowledge Discovery and Data Mining. Berlin: Springer-Verlag, 2007:761-768.
4邱保志,刘洋,陈本华.基于网格熵的边界点检测算法[J].计算机应用,2008,28(3):732-734. 被引量：7
5邱保志,岳峰.基于引力的边界点检测算法[J].小型微型计算机系统,2008,29(2):279-282. 被引量：3
6邱保志,曹鹤玲.一种高效的基于联合熵的边界点检测算法[J].控制与决策,2011,26(1):71-74. 被引量：6
7NOSOVSKIY G V, LIU DONGQUAN, SOURINA O. Automatic clustering and boundary detection algorithm based on adaptive influ- ence function[ J]. Pattern Recognition, 2008, 41 (9) : 2757 - 2776.
8BARBARA D, COUTO J, LI YI. COOLCAT: an entropy-based algorithm for categorical clustering [ C ]// Proceedings of the eleventh International Conference on Information and Knowledge Management. New York: ACM Press, 2002:4-9.
9FRED A L N, JAIN A K. Data clustering using evidence accumulation [ C ]// Proceedings of the 16th International Conference on Pattern Recognition. Washington, DC : IEEE Computer Society, 2002 : 276 - 280.
10FRED A L N, JAIN A K. Evidence accumulation clustering based on the k-means algorithm [ C ]// Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition. London : Springer-Verlag, 2002:303 - 333.

二级参考文献16

1邱保志,沈钧毅.网格聚类中的边界处理技术[J].模式识别与人工智能,2006,19(2):277-280. 被引量：13
2邱保志,张西芝.基于网格的参数自动化聚类算法[J].郑州大学学报（工学版）,2006,27(2):91-93. 被引量：14
3邱保志,沈钧毅.基于扩展和网格的多密度聚类算法[J].控制与决策,2006,21(9):1011-1014. 被引量：25
4Han J W, Kamber M. Data mining: Concepts and techniques[M]. 2nd ed. New York: Morgan Kaufmann, 2006: 384.
5Xia C Y, Hsu W, Lee M L, et al. Border: Efficient computation of boundary point[J]. IEEE Trans on Knowledge and Data Engineering, 2006, 18(3): 289-303.
6Ester M, Kriegel H P, Sander J. A density-based algorithm for discovering clusters in large spatial databases with noise[C]. Proc of the 2nd Int Conf on Knowledge Discovery and Data Mining. Portland: AAAI Press, 1996: 226-231.
7Qiu B Z, Yue F, Shen J Y. BRIM: An efficient boundary points detecting algorithm[C], Proc of Advances in Knowledge Discovery and Data Mining. Heidelberg: SDrin~,er, 2007: 761-768.
8Karypis G, Han E H, Kumar V. Chameleon: A hierarchical clustering algorithm using dynamic modeling[J]. IEEE Computer, 1999, 32(8): 68-75.
9Hsu C M, Chen M S. Subspace clustering of high dimensional spatial data with noises[C]. Proc of Advances in Knowledge Discovery and Data Mining. Heidelberg: Springer, 2004: 31-40.
10Han J W Kamber M 范明孟小峰译.数据挖掘概念与技术[M].北京:机械工业出版杜,2001.147-158.

共引文献10

1杨竹苹,黄琦志,梁海珍,陈琪.一种可用于数据集优化的网格相似度聚类算法研究[J].军事交通学院学报,2010,12(3):77-80.
2何佃伟,杨承志,张荣,吴宏超.一种基于改进网格聚类的雷达信号分选算法[J].雷达与对抗,2011,31(2):43-45. 被引量：11
3江先伟.基于网格聚类中边界点的处理[J].科技视界,2012(34):67-67.
4邱磊,杨承志,何佃伟.一种新的基于网格聚类的雷达信号预分选算法[J].现代防御技术,2013,41(2):167-172. 被引量：5
5邱保志,王有为.基于二路生成树的聚类边界检测算法[J].计算机应用与软件,2013,30(10):130-132. 被引量：1
6王桂芝,王广亮.基于闭合曲线边界的聚类算法研究[J].河南科学,2013,31(9):1391-1395. 被引量：1
7张西芝,李涛,刘敏娟.基于距离和密度的多阶段聚类[J].现代计算机（中旬刊）,2014(1):15-18.
8李向丽,耿鹏,邱保志.混合属性数据集的聚类边界检测技术[J].控制与决策,2015,30(1):171-175. 被引量：5
9张勇强,汤建龙.基于数字信道化接收机的聚类分选算法[J].中国电子科学研究院学报,2017,12(2):143-148. 被引量：4
10周传华,任太娇,罗岚,周昊.基于联合熵的非平衡数据边界混合重采样[J].计算机与现代化,2024(9):95-100.

同被引文献41

1於跃成,刘彩生,生佳根.分布式约束一致高斯混合模型[J].南京理工大学学报,2013,37(6):799-806. 被引量：3
2蒋盛益,李庆华.一种基于引力的聚类方法[J].计算机应用,2005,25(2):286-288. 被引量：9
3FAHIM A.M,SALEM A.M,TORKEY F.A,RAMADAN M.A.An efficient enhanced k-means clustering algorithm[J].Journal of Zhejiang University-Science A(Applied Physics & Engineering),2006,7(10):1626-1633. 被引量：30
4Tan P N, Michael Steinbach, Vipin K'umar. Introduction to data mining[M]. New Jersey: Pearson Education, 2007: 305-402.
5Xia C, Hsu W, Lee M L, et al. BORDER: An efficient computation of boundary points[J]. IEEE Trans onKnowledge and Data Engineering, 2006, 18(3): 289-303.
6Ester M, Kriegel H P, Sander J, et al. A density-based algorithm for discovering clusters in large spatial databases with noise[C]. Int Conf on Knowledge Discovery and Data Mining. Portland: ACM, 1996: 226-231.
7Qiu B Z, Yue F, Shen J Y. BRIM: An efficient boundary points detecting algorithm[C]. Advances in Knowledge Discovery and Data Mining. Berlin: Springer, 2007: 761- 768.
8Qiu B Z, Wang S. A boundary detection algorithm of clusters based on dual threshold segmentation[C]. The 7th Int Conf on Computational Intelligence and Security(CIS). Sanya: IEEE, 2011: 1246-1250.
9Desoer C A. Slowly varying system z = A(t)z[J]. IEEE Trans on Automatic Control, 1969, 14(6): 780-781.
10Datta Souptik,Giannella Chris,Kargupta Hillol.Approximate Distributed K-Means Clustering over a Peer-to-Peer Network[J].IEEE Transactions on Knowledge and Data Engineering,2009,21(10):1372-1388.

引证文献5

1韩海.逼近法确定球形簇的球心与半径[J].江汉大学学报（自然科学版）,2013,41(5):62-64. 被引量：3
2刘子龙,胡少凯,蒋辰飞,韩光鲜.基于ARM和计算机视觉的餐厅快速结算系统设计[J].信息技术,2013,37(11):80-84. 被引量：5
3李向丽,耿鹏,邱保志.混合属性数据集的聚类边界检测技术[J].控制与决策,2015,30(1):171-175. 被引量：5
4盛华,张桂珠.一种融合K-means和快速密度峰值搜索算法的聚类方法[J].计算机应用与软件,2016,33(10):260-264. 被引量：13
5邢瑞康,李成海.改进的聚类算法在入侵检测系统中的应用[J].火力与指挥控制,2019,44(2):124-128. 被引量：8

二级引证文献34

1吕敬.基于RFID技术的智能餐盘在高校食堂中的应用[J].信息系统工程,2014,27(9):85-85. 被引量：4
2崔桢,宋欢,郑亚楠,张晓晞.最优节约智能餐盘设计[J].价值工程,2015,34(3):321-322. 被引量：1
3张震宇,汪洋,张家龙.基于OpenCV的餐厅自动化结算研究[J].浙江科技学院学报,2017,29(3):189-194. 被引量：2
4杨佳润.数据挖掘之聚类分析算法综述[J].通讯世界,2017,23(16):291-291. 被引量：9
5李向丽,曹晓锋,邱保志.基于矩阵模型的高维聚类边界模式发现[J].自动化学报,2017,43(11):1962-1972. 被引量：4
6张晓栋,董宝田,陈光伟.基于BIRCH-LKD的在站车辆中时异常检测算法[J].北京理工大学学报,2017,37(11):1122-1128. 被引量：1
7杜洪波,白阿珍,朱立军.改进的K-means融合微粒群优化的基因选择方法[J].沈阳工程学院学报（自然科学版）,2018,14(1):66-70. 被引量：1
8操小伟,曹燕,汪文明,钱萌,黄师化,陈子燕.基于物联网技术的食堂智能结算与信息管理系统设计[J].安庆师范大学学报（自然科学版）,2017,23(4):53-56.
9邹臣嵩,杨宇.基于最大距离积与最小距离和协同K聚类算法[J].计算机应用与软件,2018,35(5):297-301. 被引量：15
10刘絮雨,张相芬,马燕,李传江,杨燕勤.基于改进空间模糊聚类的DTI图像分割算法[J].中国生物医学工程学报,2018,37(4):394-403. 被引量：9

1张显全,苏勤,蒋联源,李国祥.一种快速的随机Hough变换圆检测算法[J].计算机工程与应用,2008,44(22):62-64. 被引量：22
2蒋联源,苏勤,祝英俊.快速随机Hough变换多圆检测算法[J].计算机工程与应用,2009,45(17):163-166. 被引量：20
3罗钧,李锐,陈伟民.基于全局搜索和证据积累的多圆检测方法[J].光学学报,2010,30(9):2608-2612. 被引量：1
4刘斌.一种可还原的口令加密方法[J].现代计算机,1997(3):47-48.
5赖思渝,李明东.一种可逆加密算法在 MIS 中的应用[J].中国民航飞行学院学报,2005,16(5):25-27.
6方菲,耿春明.基于Hough变换运用形状角及梯度检测圆[J].机械工程与自动化,2015(1):135-137. 被引量：6
7蒋联源.随机圆检测快速算法[J].光电工程,2010,37(1):70-75. 被引量：12
8赵训坡,胡占义.一种实用的基于证据积累的图像曲线粗匹配方法[J].计算机学报,2005,28(3):357-367. 被引量：11
9邓峰.多跳网络中分类属性数据模糊聚类仿真[J].计算机仿真,2017,34(1):292-295. 被引量：12
10李向丽,耿鹏,邱保志.混合属性数据集的聚类边界检测技术[J].控制与决策,2015,30(1):171-175. 被引量：5

计算机应用

2012年第6期

浏览历史

内容加载中请稍等...

分类数据的聚类边界检测技术被引量：5

参考文献12

二级参考文献16

共引文献10

同被引文献41

引证文献5

二级引证文献34

相关作者

相关机构

相关主题

浏览历史

分类数据的聚类边界检测技术 被引量：5

参考文献12

二级参考文献16

共引文献10

同被引文献41

引证文献5

二级引证文献34

相关作者

相关机构

相关主题

浏览历史

分类数据的聚类边界检测技术被引量：5