摘要
站内搜索引擎是针对某个网站内部的全文检索服务,具备信息检索的核心技术。提出在文件系统上构建的解决方案,使用全文检索开发工具包——Lucene,实现站内搜索引擎系统。不仅针对关系数据库的数据,还对服务器文件系统上的各种非结构化文档数据进行加工、信息抽取,并创建索引文件进行搜索,最终实现对站内被检索数据的导航浏览,关键字高亮提示,筛选排序等。经过测试,检索效率较高,效果良好。
Search engine within the Website is the full-text search service in a Website,with the core technology of information retrieval.Proposes a solution built on the file system and uses the full-text search development kit-lucene,to realize a search engine.Not only the data in the relational database but also various unstructured document on the server file system are processed and extracts the information of them.And creates the index file to search,and finally the navigation view,Keyword highlighting tips,and filter order of the data to be retrieved within the Website.Test result shows that the retrieval efficiency is high.
出处
《现代计算机》
2011年第8期64-67,84,共5页
Modern Computer
基金
国家自然科学基金(No.61070154)
广东省自然科学基金(No.20092A008)
广州市科技攻关项目(No.200922-D081)