摘要
为了提高搜索引擎的主题倾向性和准确率,在Nutch平台上实现了带有中文分词插件的垂直搜索引擎,给出了改进后引擎的系统功能和体系结构,并从用例角度分析了系统的功能,介绍了基于该体系结构实现的港口物流信息垂直搜索引擎以及和一般引擎运行情况的比较。实验结果表明,这些改进提高了主题判别的准确度和效率,使信息的定位和查找更加精确,减少了不相关信息的干扰,并提高了系统对于互联网复杂环境的处理能力。
In order to improve subject tendence and correct rate of search engine, vertical search engine with Chinese plug-in based on Nutch is implemented. The engine' s improved system function and system structure is given and the system function from the point of using is analyzed. It also introduces the port logistics information vertical search engine based on this system structure and compares with general engine' s running conditions. The experimental result shows these improvements reduce the interference of irrelevant information and improve the system ability to deal with complex environment of Internet.
出处
《计算机工程与设计》
CSCD
北大核心
2011年第2期539-542,548,共5页
Computer Engineering and Design
基金
国家科技支撑计划基金重大项目(2007BAH10B01)