摘要
藏文基本属性的研究是藏文信息处理技术的基础 ,现代藏字的研究是藏文信息处理的重点。藏字全集是有限集 ,为了更好地研究现代藏字 ,本文以现代藏字为研究对象 ,按照现代藏文文法的规律 ,对全部现代藏字用计算机辅助统计了藏字全集的个数、藏字的字长、藏字的结构方式、位置特征、字符频度以及所有现代藏字中的整基字丁 ,并且简要地分析了这些数据。这些数据可以较全面地反映现代藏字的本质特征 ,可为藏文研究和藏字信息处理提供基础数据。
A study of the basic qualities of the Tibetan language forms the basis for the Tibetan information processing. Study of modern Tibetan character is an important aspect in developing Tibetan information processing. All modern Tibetan characters set is finite, and useful for better researching modern Tibetan character, This thesis is concerned with the modern Tibetan character and how to, according to Tibetan grammar rules and using computer, do the following: calculate the total number of character, length of character, structural mode, quality of position, letter frequency, and entire character. Moreover, this thesis will also examine in a summary manner the above figures. This thesis will use modern Tibetan language analysis to better understand the nature of the language, thus offering a basic understanding for the study of the Tibetan language and Tibetan information processing.
出处
《中文信息学报》
CSCD
北大核心
2005年第1期71-75,共5页
Journal of Chinese Information Processing
关键词
计算机应用
中文信息处理
藏字全集
藏字结构
藏字频度
computer application
Chinese information processing
Tibetan character set
structural mode
letter frequency