摘要
由于异构数据源集成系统需要集成包括WWW在内的各种数据源,有些数据源既无规则的模式结构,又无强有力的查询功能,给查询规划造成一定的困难.在分析异构集成系统中查询规划生成需求的基础上,引入数据源能力描述的概念,进而提出数据源能力描述框架.该框架以数据源局部模式与中介模式的语义映射以及数据源查询能力的描述为支撑,较好的满足了查询规划的需求,并为查询优化提供保证.在此基础上,设计了一个基于数据源能力描述的查询规划系统框架,并通过一个完整的例子说明数据源能力描述框架在查询规划中的应用.
The heterogeneous data integration systems witness a rapid increase in the number of structured and unstructured data sources that are available online, which can be full-fledged databases, simple files, HTML pages or specialized data sources that posses diverse query processing capabilities, making it difficult for query planning. This paper investigates the issues in data sources' capability descriptions in terms of their schema correspondences and query capabilities, and proposes a mechanism based on framework to describe the capabilities of the data sources in fine detail. Finally, we demonstrate that our approach will smoothly help to query planning in heterogeneous data integration systems by example.
出处
《小型微型计算机系统》
CSCD
北大核心
2006年第8期1509-1513,共5页
Journal of Chinese Computer Systems
基金
国家自然科学基金项目(60172012)资助
湖南省自然科学基金重点基金项目(03JJY3110)资助
关键词
异构数据源集成
查询规划
能力描述
查询模式
heterogeneous data sources integration
query planning
capability description
query pattern