摘要
本文以文献[2]的语料为主要对象,讨论语句的逻辑结构和篇章结构怎样约束信息模板的类型,并约束对当前句中缺失的或以代词等形式表达的信息项目的求解。首先说明什么是基于论元结构的逻辑结构和篇章结构知识,然后分析否定算子、时体成分怎样改变事件的类型及其跟有关事件模板的匹配关系。接着,讨论动词的论元结构的内嵌和名词化等句法操作,怎样造成有关论元及相应的信息项目的分布位置发生变化。最后,讨论怎样利用篇章结构知识来求解本句中缺失的或以代词、指示词形式表达的信息项目。
This paper demonstrates how to use the knowledge of logic and discourse structure to restrain the template-matching in information extraction (briefly, IE), and to recover the missing information items or ones expressed by pronouns or deixis. It firstly explains what is the knowledge of the argument structure-based logic structure and discourse structure. Then it illustrates how the negative and aspect operators can change the type of event of a sentence and the matching relation between the sentence and the related event-template. And it shows how the embedding and nominalization of argument structure can change the syntactic position of some arguments and the related information items. Finally, it discusses how to use the knowledge of discourse structure to recover the missing information items or ones expressed by pronouns or deixis.
出处
《中文信息学报》
CSCD
北大核心
2005年第4期39-45,共7页
Journal of Chinese Information Processing
基金
教育部人文社会科学研究"十五"规划资助(01JB740006)
关键词
计算机应用
中文信息处理
信息抽取
论元结构
逻辑结构
篇章结构
代词
指示词
computer application
Chinese information processing
information extraction
argument structure
logic structure
discourse structure
pronoun
deixis