期刊文献+

基于ESA的文本分类算法研究

Text Categorization Research Based on ESA
下载PDF
导出
摘要 本文借助中文维基百科知识库,提出基于ESA算法的文本分类算法.并选取2015年3月5日在中文维基百科网站下载的主题文章,对其进行适当处理,将处理结果作为该算法使用的语义概念知识库.在复旦大学中文文本分类语料上显示的实验结果表明,该方法比纯粹的词袋模型方法效果要好. On the basis of the Chinese Wikipedia, text categorization based on ESA is studied in this paper. We used Chinese Wikipedia snapshot as of March 5, 2015, and processed Wikipedia XML dump as the semantic knowledge base of concept. Experimental results on the corpus of Chinese text categorization of Fudan University show that, this method is better than BOW-based methods.
作者 刘海静
出处 《洛阳师范学院学报》 2016年第2期68-71,共4页 Journal of Luoyang Normal University
基金 太原工业学院科学基金项目(2015LQ17)
关键词 ESA 文本分类 特征生成 ESA text categorization feature generation
  • 相关文献

参考文献6

二级参考文献27

  • 1王元珍,钱铁云,冯小年.基于关联规则挖掘的中文文本自动分类[J].小型微型计算机系统,2005,26(8):1380-1383. 被引量:13
  • 22008年第二次手机短信息状况调查报告[EB/OL].http://www.12321.cn/viewnews.php?id=10753.
  • 3Healy,M Delany,S,Zamolotskikh,A.An Assessment of Case Base Reasoning for Short Text Message Classification[C].In:Norman Creaney (ed.) Proceedings of the 16th Irish Conference on Artificial Intelligence & Cognitive Science (AICS'05),257-266,2005.
  • 4Zelikovitz,S,Marquez,F.Transductive Learning for Short-Text Classification Problems using Latent Semantic Indexing[J].International Journal of Pattern Recognition and Artificial Intelligence,Vol.19(2),143-163,2005.
  • 5Zelikovitz,S.Transductive LSI for Short Text Classification Problems[C].In:Proceedings of the 17th International Flairs Conference,556-561,2004.
  • 6Han Jia-wei,Pei Jian,Yin Yi-wen.Minning Frequent Patterns Without Candidate Generation[C].In:Chen Wei-dong,Jeffrey F M,Philip A B.Proceedings of the 2000 ACM Sigmod Internal Conference on Management of Data.Dallas,Texas:ACM Press,2000.1-12.
  • 7中文停用词表[EB/OL].http://download.csdn.net/source.
  • 8Leacock C,Chodorow M.Combining Local Context and WordNet Similarity for Word Sense Identification[EB/OL].(1998-05-18).http://www.bibsonomy.org/bibtex/2087c974c471792ddlfa536aa6a 75eobc/asalber.
  • 9Resnik P Using Information Content to Evaluate Semantic Similarity in a Taxonomy[C]//Proc.of the 14th International Joint Conference on Artificial Intelligence.[S.l.]:Springer,1995:448-453.
  • 10Struve M,Ponzetto S P.WikiRelate!Computing Semantic Relatedness Using Wikipedia[C]//Proc.of Association for the Advancement of Artificial Intelligence.Boston,USA:IEEE Press,2006:1419-1424.

共引文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部