摘要
聚类作为一种自动化程度较高的无监督机器学习方法,近年来在信息检索、多文档自动文摘、智能搜索引擎、短文本信息处理等领域获得了广泛的应用。本文首先讨论了文本聚类(Text clustering)的应用,然后对文本聚类算法、聚类关键技术进行了综述。
As an unsupervised machine learning method, text clustering has been widely used in some applications such as information retrieval, automatic multi -document summarization, intelligent search engine, and the short texts information processing etc,. In this paper the application of text clustering is discussed firstly, and then some related problems, including clustering algorithm and clustering key technologies, are surveyed.
出处
《河池学院学报》
2008年第2期86-91,共6页
Journal of Hechi University
基金
广西教育科学"十一五"规划课题资助项目(编号:2006A-E004)
河池学院科研基金资助项目(编号:2007B-N004)
关键词
文本聚类
综述
降维
聚类算法
聚类评价
text clustering
survey
dimension reduction
clustering algorithm
clustering evaluation-