摘要
蒙古语文信息处理已初步完成字、词处理阶段的基本任务,正在步入句处理阶段,并且在国家自然科学基金的资助下构建了蒙古语依存树库MDTB。该文以MDTB为训练和评测数据,设计实现了一种基于词汇依存概率的蒙古语依存句法分析模型。目前,该模型的无标记准确率、有标记准确率和核心词准确率分别达到了71.24%、61.42%和93.05%。
Mongolian language information processing has completed the basic task of word processing stage,and now is entering the stage of sentence processing.Under the support of National Natural Science Foundation,we have constructed the Mongolian Dependency Treebank(MDTB).In this paper,we use MDTB as training and evaluation data,designing and implementing a Mongolian dependency parsing model based on lexical dependent probability.Currently,the model achieves accuracies of 71.24%,61.42% and 93.05% in the unlabelled annotation score,the labeled annotation score and the head word annotation score,respectively.
出处
《中文信息学报》
CSCD
北大核心
2012年第3期27-32,共6页
Journal of Chinese Information Processing
基金
国家自然科学基金项目(60763003)
国家社科基金项目(10CYY022)
教育部人文社会科学研究项目(09yjc740045)
关键词
蒙古文
依存语法
句法分析
概率模型
Mongolian
dependency grammar
parsing
probability model