摘要
随着Deep Web数量和规模的快速增长,通过对其发起查询请求以得到存储在后台数据库中的相关信息,日渐成为用户获取信息的主要方式。为了方便用户有效地利用Deep Web中的信息,越来越多的研究者致力于这一领域的研究,重点之一是Deep Web后台数据库的数据集成。由于Deep Web后台数据库存储的主要是文本信息,使得从文本处理角度出发,针对Deep Web中存储的内容进行查询与检索的研究具有十分广阔的应用前景。本文对Deep Web的研究现状进行了较为详细的分析,同时对研究的发展方向进行了展望。
With the rapid increase in numbers and scales of deep web sites on the Internet,search for data or information from deep web sites by submiting queries to and obtaining results from the backend databases has become a major means in information retrieval from the Web.This area has attracted many researchers to devote their efforts on development of technologies to make better use of information in th deep web.One challenge is searching for and integration of data from various databases in deep web.Since deep web is dominated by text data,research and development of technologies for text information retrieval from deep web have a broad application potential.In this paper,we review the state-of-the-art of deep web research in details and propose some future research directions.
出处
《集成技术》
2012年第3期47-54,共8页
Journal of Integration Technology