摘要
军用词典库的设计,关键是对装备有词类进行"类分"和"组分"。其数据结构由常用的位置代码改为父级代码,同时记录本层次的代码。词处理模块选用中科院的多层隐马模型分词系统ICTCLAS及哈工大的统计分词系统HIT_IRLab,并设计判决器及其判决规则。理论值和实际统计所得数据仿真比较表明,该词典库有利于提高装备信息管理的自动化水平。
The most important step in classifying the word-class of equipments is classification and grouping when designing the militarily dictionary base. Its data organization structure uses the usual position code to replace the parent field code and meanwhile records the same level code. The judgment and judgment-rule of the word process module, which is built on ICTCLLAS and HIT_IRLab, are built. It is improved that the auto-processing level of information management system can be prompted by the dictionary across the emulation based on the theoretical value and statistical value.
出处
《兵工自动化》
2007年第8期50-51,65,共3页
Ordnance Industry Automation
基金
军队重点实验室建设项目(2110201)
关键词
军用词典库
词类
数据结构
分词技术
判决规则
Militarily dictionary base
Word-class
Data organization structure
Lexical technology
Judgment rule