摘要
为了有效地利用Deep Web资源,Deep Web数据集成成为当前研究的热点之一.能否高效地发现Deep Web站点是Deep Web数据集成的基础和关键.在此,提出了一种Deep Web接口发现方法,包括基于领域知识来确定合适的查询提交词和用启发式规则发现领域内Deep Web接口.实验结果表明,该方法达到了较高的准确率和召回率,具有良好的可行性和实用性.
To make use of deep web resource effectively, Deep Web data integration has become one of the hot-spot in current study. It is the basis and crucial to integrate deep web data that whether or not discovery deep web sites efficiently. In this case, we present a deep web interface discovery method, which includes to deterimine the query terms based on domain knowledge and to discovery deep web interfaces with heuristic rules. The experimental results show that the method can achieve high accuracy and recall with good feasibility and practicability.
出处
《河北大学学报(自然科学版)》
CAS
北大核心
2010年第1期107-112,共6页
Journal of Hebei University(Natural Science Edition)
基金
河北省教育厅科学研究重点项目(ZH200804)