摘要
词语是能独立使用的最小语法单位,词汇大纲是语言教学的基础,研制一个科学的、反映语言生活现实与人类认知规律的词表,对于提高汉语教学效果具有重要意义。本文基于历时语料,从词频和词义两方面对词语的稳定性进行度量,以期为汉语词表的构建提供参考。对词频稳定性的2种统计指标进行相关性分析,在词义稳定性度量中引入词向量,对词语的稳定性分布情况进行考察。通过对HSK汉语水平考试词汇等级大纲(2012年修订)的分析表明,总体上,本文提出的稳定性度量能较好地体现出大纲的等级分布,即大纲级别越低,词语稳定性越高,并可以为大纲的更新与调整提供依据。
Word is the smallest grammatical unit that can be used independently while lexicon is the foundation of language teaching.To improve the effectiveness of Chinese teaching,it is of great significance to develop a scientific vocabulary that reflects the reality of language life and the laws of human cognition.Based on a diachronic corpus,This paper measured the stability of words from two aspects,word frequency and word meaning,to provide a reference for the construction of Chinese vocabulary.This paper made a statistical correlation analysis of the two word frequency stability measures,and introduced word embeddings into the word sense stability measure.Quantitative analysis of word stability distribution was carried out based on the diachronic corpus.After investigation of the HSK vocabulary level outline,it showed that the computed word stability could correlate well with the vocabulary levels,and provided a good knowledge source for the updating and adjustment of the outline.
作者
张卫华
Zhang Weihua(School of Electrical Engineering,Zhengzhou University,Zhengzhou Henan 450001)
出处
《河南科技》
2017年第7期56-59,共4页
Henan Science and Technology
关键词
历时语料
词语稳定性
词频稳定性
词义稳定性
HSK词汇大纲
diachronic corpus
word stability
word frequency stability
word sense stability
HSK vocabulary outline