摘要
文章从结构的视角界定了汉语最长名词短语(MNP)的复杂性概念,提出以内部结构的分布倾向性和结构标记性两个指标作为判断复杂结构的依据,将最长名词短语分为简单MNP和复杂MNP。复杂MNP包括了有标记和无标记两种情况,其中有标记的复杂MNP形式多样,占据了主要部分。从分布位置上看,复杂MNP的'的'前定语位置对复杂结构的容纳性最强,但中心语位置也包含少量复杂结构。复杂结构外化为线性表面,形成了动词介词成分内含、边界处连续动词介词分布、歧义结构等识别中的难点问题,针对性地研究这些问题有助于最长名词短语的识别工作。
This paper defined the complexity of Chinese maximal noun phrase(MNP)from the perspective of language structure,and put forward two indicators,the distribution tendentiousness of internal structure and the markedness of structure,to determine if a maximal noun phrase is a complex phrase.It also classified and described the maximal noun phrases into Simple MNPs and Complex MNPs on the basis of new complexity definition.The Complex MNPs are either marked or unmarked,and the marked ones which are in the majority have various forms.Judging from the distribution location,the attribute location of Complex MNP in front of“De”is most inclusive of complex structures,and the head location also contains a small number of complex structures.The complex structures externalize into linear surface,leading to the recognition problems such as verbs or prepositions inside of MNPs,verbs or prepositions distributing continuously at the boundary,ambiguous structures.The study on these problems will contribute to the recognition task of MNP.
作者
钱小飞
侯敏
QIAN Xiaofei;HOU Min
出处
《语料库语言学》
2017年第1期20-30,100,共12页
Corpus Linguistics
基金
上海市青年教师培养资助计划“汉语最长名词短语识别方法研究”(shu11053)
国家语言资源监测与研究中心科研项目“面向浅层句法分析的最长NP研究”(YZYS08-04)资助