期刊文献+

动态索引树文本聚类方法中节点阀值的优化

Optimization of Node Threshold in Text Clustering based on Dynamic Indexing Tree
下载PDF
导出
摘要 文本聚类是聚类的一个重要研究分支,在文本处理领域中有着广泛的应用。在描述聚类特征树与动态索引树的文本聚类方法后,将原动态索引树文本聚类方法中的合并阀值由单一线性依赖关系修改为依赖于聚类节点半径值。实验证明,改进后的算法在聚类结果精确率与聚类时间上都有明显提高。 Text clustering is an important research branch in clustering; it has been used in a wide range of application areas.This report describes the clustering feature tree and the dynamic index tree clustering method.The node's threshold in Dynamic Indexing Tree was depended on the single linear relationship,we revised to depend on the cluster node radius.After experimental,the experimental results show that the improved algorithm in clustering and cluster precision time has improved significantly.
作者 王利峰
机构地区 东华大学
出处 《电脑开发与应用》 2010年第9期62-65,共4页 Computer Development & Applications
关键词 动态索引树 阀值 层次聚类 节点半径 dynamic index tree threshold hierarchical clustering node radius
  • 相关文献

参考文献7

二级参考文献37

  • 1唐焕玲,孙建涛,陆玉昌.文本分类中结合评估函数的TEF-WA权值调整技术[J].计算机研究与发展,2005,42(1):47-53. 被引量:26
  • 2蒋盛益,李庆华.聚类分析中的差异性度量方法研究[J].计算机工程与应用,2005,41(11):146-149. 被引量:4
  • 3HANJ,KAMBERM.数据挖掘概念与技术[M].范明,孟小峰,译.北京:机械工业出版社,2006.
  • 4ZHANG TIAN, RAMAKRISHNAN R, LIVNY M. BIRCH: An efficient data clustering method for very large databases[ J]. ACM SIGMOD Record, 1996, 25(2) : 103 - 114.
  • 5HUANG ZHEXUE. Extensions to the k-means algorithm for clustering large data sets with categorical values[ J], Data Mining and Knowledge Discovery, 1998, 2(3) : 283 - 304.
  • 6MACQUEEN J. Some methods for classification and analysis of multivariate observations[ C]// Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability. Berkeley: University of California Press, 1967, 1 : 281 - 297.
  • 7HUANG ZHEXUE. A fast clustering algorithm to cluster very large categorical data sets in data mining[ C]// Proceedings of SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery. [S. l ] : ACM Press, 1997:1 -8.
  • 8MERZ C J, MERPHY P. UCI repository of machine learning databases[ EB/OL]. [ 2008 - 09 - 01 ]. http://www, ics. uci. edu/-mlearn/MLRRepository, html.
  • 9The Apache Jakarta Project:Lucene[EB/OL].http://jakarta.apache.org/lucene/,2003-04.
  • 10车东.在应用中加入全文检索功能--基于Java的全文索引引擎Lucene简介[EB/OL].http://www.chedong.com/tech/lucene.html,2002-08.

共引文献216

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部