期刊文献+

基于统计模型和小波变换的文本检索方法

Document Retrieval Method Based on Statistical Model and Wavelet Transform
下载PDF
导出
摘要 针对当前几种常用文本检索方法的不足,文中基于统计模型和小波变换,提出了一种新的文本检索方法。与传统方法的主要区别在于:1)利用小波变换把输入信号引入到频域进行处理,消除了交叉比较运算的巨大计算量;2)在进行相关度计算时,同时考虑了检索词的出现次数和出现位置因素,有效提高了检索精确度。理论分析和实验结果表明该方法较传统方法在查准率和查询速度上均有所提高。 A novel document retrieval method based on statistical model and wavelet transform is proposed after analyzing the disadvantage of several common used document retrieval methods. First,It analyses the input signal in frequency domain where cross compare computations were avoided;Second,it considers both the term count and position when calculating the similarity,which gets a high precision. Experimental results illustrate it a efficient method considering both precision and speed when compare with common used document retrieval methods.
作者 魏彬 张军 项颖 WEI Bin, ZHANG Jun, XIANG Ying (Department of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China)
出处 《电脑知识与技术》 2009年第3期1686-1687,1698,共3页 Computer Knowledge and Technology
关键词 文本检索 词向量 谱向量 小波变换 相似度 document retrieval term vector spectra vector wavelet transform similarity
  • 相关文献

参考文献4

二级参考文献37

  • 1黄昌宁.统计语言模型能做什么?[J].语言文字应用,2002(1):77-84. 被引量:31
  • 2[1]Salton G.The SMART retrieval system-experiments in automatic document processing. USA :Prentice Hall, 1971
  • 3Robertson S E,Jones S K.Relevance Weighting of Search Terms.JASIS,1976,27:129-146.
  • 4Bookstein A,Swanson D R.Probabilistic models for autortmtic indexing.Journal of the American Society for Information Science,1974,25:312~319.
  • 5Van Rijsbergen C J.A Theoretical Basis for the Use of Co-Occurrence Data in Information Retrieval.Journal of Documentation, 1977,33:106~119.
  • 6Hatter S P, A probabilistic approach to automatic keyword indexing. Information Science, 1975,26:197-205(Part Ⅰ), 280-289(Part Ⅱ).
  • 7Sparck J K, Jackson D M. The use of automatically - obtained keyword classifications for information retrieval. Information Storage and Retrieval, 1970,5 : 175- 201.
  • 8Chow C K,Liu C N. Approximating discrete probability distributions with dependence trees. IEEE Transactions on information theory,1968,IT-14:462-467.
  • 9Margulis E L. Modeling documents with multiple Poisson distributions, Information Processing and Management, 1993,29 (2) :215-227.
  • 10Titterington D M,Markov U E, Smith A F M. Statistical Analysis of Finite Mixture Distributions. John Wiley and Sons,1985.

共引文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部