期刊文献+

Stable Initialization Scheme for K-Means Clustering 被引量:15

Stable Initialization Scheme for K-Means Clustering
原文传递
导出
摘要 Though K-means is very popular for general clustering, its performance, which generally converges to numerous local minima, depends highly on initial cluster centers. In this paper a novel initialization scheme to select initial cluster centers for K-means clustering is proposed. This algorithm is based on reverse nearest neighbor (RNN) search which retrieves all points in a given data set whose nearest neighbor is a given query point. The initial cluster centers computed using this methodology are found to be very close to the desired cluster centers for iterative clustering algorithms. This procedure is applicable to clustering algorithms for continuous data. The application of the proposed algorithm to K-means clustering algorithm is demonstrated. An experiment is carried out on several popular datasets and the results show the advantages of the proposed method. Though K-means is very popular for general clustering, its performance, which generally converges to numerous local minima, depends highly on initial cluster centers. In this paper a novel initialization scheme to select initial cluster centers for K-means clustering is proposed. This algorithm is based on reverse nearest neighbor (RNN) search which retrieves all points in a given data set whose nearest neighbor is a given query point. The initial cluster centers computed using this methodology are found to be very close to the desired cluster centers for iterative clustering algorithms. This procedure is applicable to clustering algorithms for continuous data. The application of the proposed algorithm to K-means clustering algorithm is demonstrated. An experiment is carried out on several popular datasets and the results show the advantages of the proposed method.
出处 《Wuhan University Journal of Natural Sciences》 CAS 2009年第1期24-28,共5页 武汉大学学报(自然科学英文版)
基金 Supported by the National Natural Science Foundation of China (60503020, 60503033, 60703086) the Natural Science Foundation of Jiangsu Province (BK2006094) the Opening Foundation of Jiangsu Key Labo-ratory of Computer Information Processing Technology in Soochow University ( KJS0714) the Research Foundation of Nanjing University of Posts and Telecommunications (NY207052, NY207082)
关键词 CLUSTERING unsupervised learning K-MEANS INITIALIZATION clustering unsupervised learning K-means initialization
  • 相关文献

参考文献10

  • 1Thiesson B,Meck C,Chickering D M, et al.Learning Mix- tures of Bayesian Networks[]..1997
  • 2Su T.Another Look at Non-Random Methods for Initializing K-Means Clustering[]..2003
  • 3Greene D,Cunningham P.Producing Accurate Interpretable Clusters from High-Dimensional Data[].Proceedings of the th European Conference on Principles and Practice of Knowledge Discovery in Databases.2005
  • 4Blake C L,Merz C J.UCI Repository of Machine Learning Database[].htpp://wwwicsuci edu/ MLRepositoryhtml.2008
  • 5MacQueen J.Some methods for classification and analysis of multivariate observations[].In: Proceedings of the th Berkeley Symposium on Mathematical Statistics and Probability.1967
  • 6Fukunaga K.Introduction to statistical pattern recognition[]..1990
  • 7Jain,R Dubes.Algorithms for Clustering Data[]..1988
  • 8Duda RO,Hart PE.Pattern Classification and Scene Analysis[]..1973
  • 9Bradley,PS,Mangasarian,OL,Street,WN,Mozer,MC,Jordan,MI,Petsche,T.Clustering via concave minimization[].Advances in Neural Information Processing Systems.1997
  • 10Bradley,P.S.,and Fayyad,U.M.Refining initial points for k-means clustering[].Proceedings of the th international conference on machine learning.1998

同被引文献98

引证文献15

二级引证文献94

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部