摘要
本文对查询日志在相关领域内的研究现状与进展进行了总结.首先介绍了web查询日志的常用信息和公开的数据集;进而阐述了查询日志在web搜索、信息抽取等方面的相关研究,并对它们进行了细致的介绍和分析;最后指出基于查询日志研究所面临的问题和挑战.重在对基于查询日志研究的主流方法和前沿进展进行概括、比较和分析,以期对后续研究有所助益.
This paper surveys the state-of-the-art research on query logs analysis.First,the existing corpus of query logs and the information embedded in are summarized and analyzed. Then,important tasks benefiting from query logs are introduced,includ- ing web search, information extraction, as well as some closely related topics. Finally, the problems and challenges of current re- searches are discussed. This paper aims to make a summ^try, comparison and analysis of the mainstream methods and the latest progress, expecting to be helpful to the future research.
出处
《电子学报》
EI
CAS
CSCD
北大核心
2013年第9期1800-1808,共9页
Acta Electronica Sinica
基金
国家自然科学基金(No.60736044
No.61073126)
关键词
查询日志分析
查询日志挖掘
WEB搜索
信息抽取
analysis on query logs
mining on query logs
web search
information extrac6on