摘要
In order to solve the problem of semantic heterogeneity in information integration, an ontology based semantic information integration (OSII) model and its logical framework are proposed. The OSII adopts the hybrid ontology approach and uses OWL (web ontology language) as the ontology language. It obtains unified views from multiple sources by building mappings between local ontologies and the global ontology. A tree- based multi-strategy ontology mapping algorithm is proposed. The algorithm is achieved by the following four steps: pre-processing, name mapping, subtree mapping and remedy mapping. The advantages of this algorithm are: mapping in the compatible datatype categories and using heuristic rules can improve mapping efficiency; both linguistic and structural similarity are used to improve the accuracy of the similarity calculation; an iterative remedy is adopted to obtain correct and complete mappings. A challenging example is used to illustrate the validity of the algorithm. The OSII is realized to effectively solve the problem of semantic heterogeneity in information integration and to implement interoperability of multiple information sources.
针对信息集成中的语义异构问题,提出了一个基于本体的语义信息集成模型OSII,并给出了逻辑框架.OSII采用混和本体方式建模,以OWL描述本体,通过局部本体与全局本体之间的映射获得多源统一视图.提出了一种基于树结构的多策略本体映射算法,该算法包含4个步骤,即预处理,名称映射,子树映射和映射矫正.其特点在于:按照数据类型分类进行映射,并采用启发式规则,提高映射效率;同时考虑概念的语言相似性和结构相似性,提高相似度计算的准确性;采用迭代矫正,最终得到正确而完整的映射对.通过一个挑战性的实例说明了算法的有效性.OSII能很好地解决信息集成中的语义异构难点,实现多信息源之间的互操作.