摘要
介绍了未登录词识别在自然语言理解中的地位,针对一类典型的未登录词———专业词汇的识别进行了详细分析和阐述,并根据专业词汇的特点提出了基于双侧语料评价模型的专业词汇算法。经过实验证明该算法具有良好的准确率和召回率。
This paper introduces the position of recognition of unknown words in nature language understanding. It focuses on recognition of a kind of special unknown word, that is, specialized terms, and gives detailed analyses. Based on features of specialized terms, this article puts forward a recognition algorithm which is based on Bi-directions Estimation Model of Corpus. Experiments prove the algorithm has a high recall rate and accuracy.
出处
《计算机与现代化》
2005年第9期13-15,18,共4页
Computer and Modernization
关键词
未登录词
单名词
复合名词
双侧语料评价模型
unknown word
single noun
compound noun
bi-dlrecfions estimation model of corpus