期刊文献+

基于特征权重的词条匹配系统研究与实现

Study and Implementation of the Vocabulary Entry Matching System Based on Feature Weight
下载PDF
导出
摘要 阐述了词条匹配系统的设计思路、系统的总体架构及实现的技术方法,在现有理论基础上,借助盘古分词技术对特征来源文本进行分词与去噪,通过计算关键词的特征权重使得特征词条选取最优;运用先进的特征匹配算法对系统进行了详细设计和实践开发.对实验数据的分析结果表明,该系统对图书辅助分类校验具有一定的实际应用价值. The design idea of the system, the overall architecture of the system, the realization of the technical methods were elaborated. On the basis of existing theory, Pangu word segmentation technology was used for text segmentation and denoising. Then the calculation of feature weight keywords was used to determine feature term optimal selection. The advanced feature matching algorithm of the system were employed to provide detailed design and practice development. The analysis showed that the system has a certain practical value for the book classification and verification.
作者 周建 高晓东
出处 《南通大学学报(自然科学版)》 CAS 2017年第3期15-19,共5页 Journal of Nantong University(Natural Science Edition) 
基金 江苏省现代教育技术研究立项课题(2015-R-42479)
关键词 词条 词条匹配 特征权重 文本特征 vocabulary entry vocabulary entry matching feature weight text feature
  • 相关文献

参考文献6

二级参考文献87

  • 1林浩.基于综合倒排索引的个性化搜索技术研究[D].贵阳:贵州大学,2008.
  • 2Liu Chun, Guo Qing Ping. Analysis and Research of Web Chinese Retrieval System Based Lunece [ J ]. Computer society,2009 (12) :1051-1055.
  • 3Zhang Yong, Li Jian-lin. Research and Improvement of Search Engine Based on Lucene [ C ] //International Conference on Intelligent Human- Machine Systems and Cybernetics. Zhejiang: [ s. n. ] ,2009:270-273.
  • 4Zhou Ning, Wu JiaXin, Zhang ShaoLong, et al. Mining Weighted Association Rules with Lucene Index [ J ]. Wireless Communications, Networking and Mobile Computing, 2007 (9) :3697-3700.
  • 5Kim Min-Soo, Whang Kyu-Young, Lee Jae-Gil, et al. n- Gram/2L: A Space and Time Effieient Two-Level n-Gram Inverted Index Structure [ C ]//Proceedings of the 31 st international conference on Very large data bases. Trondheim,Norway : [ s. n. ] ,2005:325-336.
  • 6Hatcher E, Gospodnetic O. lucene in action [ M]. Greenwich, CT, USA : Manning Publications Co,2004.
  • 7Baeza-Yates R, Gionis A, Junqueira F P, et al. Design Trade-Offs for Search Engine Caching[ J]. ACM Transactions on the Web,2008(10) :1-28.
  • 8江毅铭.专业搜索引擎所以技术的设计与实现[D].北京:北京化工大学,2005.
  • 9曾哗垠.全文索引技术中索引归并算法的研究与分析[D].成都:电子科技大学,2007.
  • 10李原.中文文本分类中分词和特征选择方法研究[D].长春:吉林大学,2011.

共引文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部