摘要
核心本体对最基本的领域知识建模,并在上位本体和领域本体之间建立联系。上位本体是领域无关的而核心本体是领域相关的,因此在自动创建中文核心本体过程中,映射中文核心术语到上位本体概念有很多的错误。本文提出的改进方法首先找到共享后缀术语集内被共享的术语条数更多、与各术语的意义更接近的上位概念;然后用其来改进词集中的核心术语和概念之间的映射。实验证明,该方法有效的提高了核心本体自动创建的精确度。
A core ontology models fundamental domain knowledge and bridges the gap between an upper ontology and a domain ontology. Since the upper ontology is domain independent, many errors are introduced when mapping core terms to the upper ontology concepts in automatic Chinese core ontology construction. This paper proposes an extraction method making use of terms sharing the same suffixes to find the hypernyms; the term that is more frequently shared by other terms and are closer in meanings to those terms. These hypernyms are then used to improve the mapping of these terms to the correct concepts. Experiments show that a significant improvement is achieved in terms of accuracy for core ontology construction.
出处
《中文信息学报》
CSCD
北大核心
2010年第1期48-53,共6页
Journal of Chinese Information Processing
基金
香港理工大学教育资助委员会(UGC)
研究资助局(RGC)角逐研究用途补助金资助项目(CERG)(PolyU 5225/05E
PolyU 5190/04E
PolyU 5246/08E)
关键词
计算机应用
中文信息处理
本体构建
领域核心本体
上位本体
领域本体
上位关系
computer application
Chinese information processing
ontology construction
core ontology
upper ontology
domain ontology
hypernymy