期刊文献+

VSM的权重改进对文档相似度的影响研究 被引量:3

Research of Documents Similarity Influence Based on Improved VSM Weight
下载PDF
导出
摘要 向量空间模型是以索引项权重为核心的模型,索引项权重对文本分类、检索等的效果起着重要的作用。文中使用了一个基于关键词的权重,并利用它改进传统向量空间模型的权重算法。改进后的模型综合考虑原有索引项权重和文档中关键词的权重。在特定领域FAQ的检索中作测试实验,结果表明,改进的方法提高了检索的查准率、查全率。 The terms weight is the core in VSM ,it plays the important role in text classification,text retrieval,etc. A new weight based on key is put forward, so as to improve the weight formula of VSM. Further more, original characteristic terms weight is also combined in the new VSM. With the test based on special domain FAQ, Experiment results show that the improved method raised the precision, recall and the F test value.
作者 苏小虎 SU Xiao-Hu (School of Computer, Anhui University of Technology, Ma'anshan 243002, China)
出处 《电脑知识与技术》 2008年第4期135-137,共3页 Computer Knowledge and Technology
关键词 向量空间模型 关键词权重 查准率 查全率 VSM key-weight precision recall
  • 相关文献

参考文献4

二级参考文献21

  • 1黄萱青 吴立德.独立于语种的文本分类方法[M].,2000.37-43.
  • 2鲁松 白硕 等.文本中词语权重计算方法的改进[M].,2000.31-36.
  • 3卜东波.聚类/分类理论研究及其在大模型文本挖掘的应用:博士论文[M].,2000..
  • 4[1]Studer R,Benjamins V R,Fensel D,et al. Knowledge engineering,principles and methods [J]. Data and Knowledge Engineering, 1998,25 (1-2): 161 - 197.
  • 5[2]Manola Frank, Miller Eric. RDF primer [EB/OL]. http ://www. w3. org/TR/rdf-primer/,2004 - 08 - 27.
  • 6[3]Genest D,Chein M. An experiment in document retrieval using conceptual graphs [A]. Proc of the Fifth International Conference on Conceptual Structures: Fulfilling Peirce′s Dream table of Contents [C]. London: SpringerVerlag, 1997. 489 - 504.
  • 7[4]Myaeng Sung H. Conceptual graph matching as a plausible inference technique for text retrieval [A]. Proc of the 5th Conceptual Structures Workshop [C]. Boston:Ma,1990. 117 - 120.
  • 8[6]Melnik Sergey. RDF API draft [EB/OL]. http://wwwdb. stanford. edu/~ melnik/rdf/api. html,2004 - 08 - 27.
  • 9[7]McBride Brian. An introduction to RDF and the Jena RDF API [EB/OL]. http ://jena. sourceforge. net/tutorial/RDF_APL/index. html,2004 - 08 - 27.
  • 10黄萱菁,2000 International Conference on Multilingual Information Processing,2000年,37页

共引文献345

同被引文献23

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部