期刊文献+

基于GVSM的文本相似度算法研究 被引量:4

Research on similarity algorithm of text based on GVSM
下载PDF
导出
摘要 提出了一种基于WordNet和GVSM的文本相似度算法,通过语义的路径长度和路径深度计算两个词的语义相似度,结合改进的GVSM模型计算文本相似度,并对基于TFIDF-VSM模型和本文方法进行了比较。实验结果表明,该算法取得了更好的准确率和效率。 This paper presents a text similarity algorithm based on WordNet and GVSM,computing the similarity of two words by semantics of path length and depth,combined with the improved GVSM model.Then compare the TFIDF-VSM-based model with this method.The experimental results show that this algorithm can achieve a better precision and efficiency.
出处 《微型机与应用》 2011年第3期9-11,共3页 Microcomputer & Its Applications
关键词 文本相似度 语义相似度 词网 广义向量空间模型 text similarity semantic relatedness WordNet GVSM
  • 相关文献

参考文献9

  • 1WILLETT P. Recent trends in hierarchical document clustering: a critical review. Inf Process and Manage, 1988 : 577-597.
  • 2夏天.汉语词语语义相似度计算研究[J].计算机工程,2007,33(6):191-194. 被引量:63
  • 3李峰,李芳.中文词语语义相似度计算——基于《知网》2000[J].中文信息学报,2007,21(3):99-105. 被引量:106
  • 4WONG, S. K. M. Generalized vector SIGIR ACM, 1985 Wojciech Ziarko, spaces model in Patrick C. N. Wong. information retrieval.
  • 5TSATSARONIS G, PANAGIOTOPOULOU V. A generalized vector space model for text retrieval based on semantic relatedness. Proceedings of the EACL 2009 Student Research Workshop, 2009:70-78.
  • 6SALTON, MCGILL M J. Introduction to modem information retrieval. McGraw-Hill, 1983.
  • 7VAZIRGIANNIS T M. Word sense disambiguation with spreadingactivation networks generated from thesauri [C]. In Proc. of the 20th IJCAI, 2007:1725-1730.
  • 8HALL P, PARK B U, SAMWORTH R J. Choice of neighbor order in nearest-neighbor classification. Annals of Statistics : 2008 : 2135-2152.
  • 9Qinglin Guo. The similarity computing of documents based on VSM. IEEE International Computer Software and Applications Conference. 2008:585-586.

二级参考文献15

  • 1刘亚军,徐易.一种基于加权语义相似度模型的自动问答系统[J].东南大学学报(自然科学版),2004,34(5):609-612. 被引量:36
  • 2夏天,樊孝忠,刘林,骆正华.基于ALICE的汉语自然语言接口[J].北京理工大学学报,2004,24(10):885-889. 被引量:11
  • 3吴健,吴朝晖,李莹,邓水光.基于本体论和词汇语义相似度的Web服务发现[J].计算机学报,2005,28(4):595-602. 被引量:218
  • 4刘群 李素建.基于《知网》的词汇语义相似度计算[C]..第三界汉语词汇语义研讨会[C].台北,2002..
  • 5刘群 李素建.基于《知网》的词汇语义相似度的计算[A]..第三届汉语词汇语义学研讨会[C].台北,2002..
  • 6Eneko Agirre, German Rigau. A Proposal for Word Sense Disambiguation using Conceptual Distance [A].In: Proceedings of the First International Conference on Recent Advanced in NLP [C]. 1995.
  • 7Dekang Lin. An Information-Theoretic Definition of Similarity Semantic distance in WordNet [A]. In:Proceedings of the Fifteenth International Conference on Machine Learning [C]. 1998.
  • 8HowNet [R]. HowNet's Home Page. http://www.keenage. com.
  • 9BUDANITSKY, A. AND HIRST, G. Semantic distance in WordNet : An experimental, application-oriented evaluation of five measures [A]. In: Workshop on WordNet and Other Lexical Resources, Second meeting of the North American Chapter of the Association for Computational Linguistics[C]. 2001.
  • 10同义词词林[R].http://www.ir—lab.org/.

共引文献157

同被引文献57

  • 1周晓英.论信息集合的信息构建(IA)[J].情报学报,2004,23(4):456-462. 被引量:4
  • 2张振亚,王进,程红梅,王煦法.基于余弦相似度的文本空间索引方法研究[J].计算机科学,2005,32(9):160-163. 被引量:54
  • 3张敏,耿焕同,王煦法.一种利用BC方法的关键词自动提取算法研究[J].小型微型计算机系统,2007,28(1):189-192. 被引量:19
  • 4Corley C, Mihalcea R.Measuring the semantic similarity of texts[C]//Proceedings of the ACL Workshop on Empir- ical Modeling of Semantic Equivalence and Entailment, 2005 : 13-18.
  • 5Coelho T A S, Caladl P P, Souza L V, et al.lmage retrieval using multiple evidence ranking[J].IEEE Trans on Knowl- edge and Data Engineering,2004, 16(4) :408-417.
  • 6Ko Y,Park J, Seo J.Improving text categorization using the importance of sentences[J].Information Processing and Management, 2004,40 ( 1 ) : 65-79.
  • 7Kumar N.Approximate string matching algorithm[J].Inter- national Journal on Computer Science and Engineering, 2010,2(3) :641-644.
  • 8Erkan G, Radev D.Lexrank: graph-based lexical centrality as salience in text summarization[J].Journal of Artificial Intelligence Research, 2004,22 (7) : 457-479.
  • 9Mitra M,Hadi A,Man L,et al.Sense sentiment similarity: an analysis[C]//Proceedings of the 26th AAAI Confer- ence on Artificial Intelligence,2012: 1706-1712.
  • 10Salton G, Mcgill M J.Introduction to modem informa- tion retrieval[M].New York : McGraw-Hill, 1983.

引证文献4

二级引证文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部