基于模糊模型相似测量的字符无监督分类法被引量：3

A Unsupervised Character Classification Based on Similarity Measure in Fuzzy Model

下载PDF

导出

摘要该文提出一种基于模糊模型相似测量的文本分析系统的字符预分类方法 ,用于对字符的无监督分类 ,以提高整个字符识别系统的速度、正确性和鲁棒性 .作者在字符印刷结构归类的基础上 ,采用模板匹配方法将各类字符分别转换成基于一非线性加权相似函数的模糊样板集合 .模糊字符的无监督分类是字符匹配的一种自然范例并发展了加权模糊相似测量的研究 .该文讨论了该模糊模型的特性、模糊样板匹配的规则 ,并用于加快字符分类处理 ,经过字符分类。 This paper presents a character preclassification method based on similarity measure in Fuzzy model to perform unsupervised character classification for improvement in robustness, correctness, and speed of a character recognition system. On the basis of character typographical structure categorization, a pattern matching is used to classify the characters in each category into a set of fuzzy prototypes based on a nonlinear weighted similarity function. The emphasis of inequality measure for small characters guarantees no misclassification, but a little redundancy is encountered on the fuzzy prototype set. This redundancy can be removed by self grouping of the final prototype set. The fuzzy unsupervised character classification, which is natural in the representation of prototypes for character matching, is developed and a weighted fuzzy similarity measure is explored. A fuzzy model of prototypes is defined and several propositions of the features of the fuzzy model are given. The characteristics of the fuzzy model and rule based matching of fuzzy prototypes are discussed and used in speeding up the classification process. The fuzzy model of prototype has been verified to reduce the effect of noise. Based on prototypes that are free of noise, the recognition problem will be simplified and the speed as well as recognition rate will be increased. For ambiguous characters, probably as merged, the accuracy of postprocessing also will be improved. After classification, the character recognition, which is simply applied on a smaller set of the fuzzy prototypes, becomes much easier and less time consuming.

作者卢达钱忆平谢铭培浦炜

机构地区常熟高等专科学校复旦大学计算机科学系

出处《计算机学报》 EI CSCD 北大核心 2002年第4期423-429,共7页 Chinese Journal of Computers

基金国家自然科学基金(7870 0 12 ) 江苏省教委留学回国人员科研基金(199715 5 1) 江苏省教委自然科学研究基金 (99KGB14 0 0 0 9)资助

关键词模糊模型加权模糊相似测量字符无监督分类匹配算法分级归类字符识别字符匹配 fuzzy model, weighted fuzzy similarity measure, unsupervised character classification, matching algorithm, classification hierarchy

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1卢达,谢铭培,钱忆平,浦炜,常熟.一种基于骨架法形态分析的粘连字符图象分切方法[J].中文信息学报,1999,13(2):40-45. 被引量：8
2卢达,谢铭培,浦炜.基于印刷字符模糊结构分析的字符预分类方法[J].软件学报,2000,11(10):1397-1404. 被引量：4
3卢达,浦炜,谢铭培.文本行字符基线的精确测定算法[J].小型微型计算机系统,2000,21(7):726-728. 被引量：2

二级参考文献9

1Wang L，IEEE Trans Pattern Anal Machine Intell，1993年，15卷，10期，1053页
2De Luca P G，Pattern Recognition，1991年，24卷，7期，609页
3K.Y.Wong,R.G.Casey,and F.M.Wahl."Document analysis system"[].IBM JResDevelopment.1982
4Rosenfeld R,Kak A. C.Digital Pecture Processing[]..1982
5S.C.Hinds,,J.L.Fisher,,D.P.D‘Amato.A Document Skew Detection Method Using Run-length Smearing and the Hough Transform[].Proc intl Confon Pattern Recognition.1990
6Y Nakano,Y Shima,H Fujisacw.An algorithm for the skew nor-malization of document image.Proc.IEEEE 10th nit.Conf.PattemRecognition,Atlantic City,NJ,U[].SA.1990
7Hartigan,J. A. Clustering algorithms . 1975
8P G DeLuca,A Gisotti.Printed character Preclassification basedonword structure[].Pattern Recognition.1991
9霍宇翔,丁宇,陈耘,金龙,周兆英.细化畸变节点形态分析及修正策略研究[J].计算机辅助设计与图形学学报,1997,9(6):500-505. 被引量：11

共引文献11

1吴微,侯利昌.基于LL(1)文法的印刷体数学公式结构分析方法[J].大连理工大学学报,2006,46(3):454-459. 被引量：4
2孔月萍,郭世雄,梁韶军.一种新的粘连字符图像分割方法[J].电子技术应用,2009,35(7):136-138. 被引量：7
3李先岭,龚晖.远程作业技术中公式编辑问题的解决方案[J].计算机应用与软件,2009,26(12):146-147. 被引量：2
4常丹华,何耘娴,苗丹.中英混排文档图像粘连字符分割方法的研究[J].激光与红外,2010,40(12):1369-1373. 被引量：2
5殷庆立,苏慕珍.水泥基材料强度影响因素:分析与综述[J].硅酸盐通报,1999,18(4):60-62. 被引量：9
6张文杰,王大通,李洁.基于模板匹配的多目标精确粘连分割算法[J].计算机与现代化,2011(6):22-24. 被引量：2
7卢达,浦炜,钱忆平,谢铭培.基于模糊模型相似测量的小类别数汉字及数字识别[J].计算机工程与应用,2000,36(11):78-80. 被引量：3
8张杰武,张会林,李伦清.一种去噪去污的人民币序列号分割方法[J].计算机工程与应用,2015,51(7):179-183.
9巨志勇,何晓蕾,王超男.基于文本行基线的倾斜角检测算法[J].电子科技,2016,29(10):39-42.
10缪永伟,胡争光,孙瑜亮,张旭东,刘震.结合分类卷积神经网络和形状上下文的线画图检索[J].计算机辅助设计与图形学学报,2019,31(4):513-521. 被引量：1

同被引文献17

1FUKUSHIMA K , WAKE N . Handwritten alphanumeric character recognition by the Neocognitron [ J]. IEEE transactions on neural network, 1991:355 -365.
2CARPENTER G-A, GRESSBERG S. The ART of adaptive pattern recognition by a self-organizing neural network[ A]. IEEE computer,1988,21:77-88.
3LEE H-M, SHEN C-C. A handwritten Chinese characters recognition method based on primitive and fuzzy features via SEART neural net model[C]. IEEE Int. Conf. Syst. Man Cybern. 1995. 1939-1944.
4Zhang J Y, Ding X Q, Chen Y S, et al. Multi-scale feature extraction and nested-subset classifier design for high accuracy handwritten character recognition [A]. In: The 15th ICPR'2000[C]. 2000. 581-584.
5Ching R H, Lee C W, Chen Z, et al. Preclassification of handwritten Chinese character based on basic Stroke substructures [J]. Patt Recogn Lett, 1995,16: 1023- 1032.
6Carpenter G A, Gressberg S. The ART of adaptive pattern recognition by a self-organizing neural network [J]. IEEE Computer,1988, 21: 77-88.
7Fukushima K, Wake N. Handwritten alphanumeric character recognition by the Neocognitron [J]. IEEE Trans on Neural Network,1991. 355 - 365.
8吴佑寿.丁晓青.汉字识别,理论,方法与实现[M].北京:高等教育出版社,1992..
9Y Zhang,X Q Ding et al.Multi-scale feature extraction and nestedsubset classifier design for high accuracy handwritten character recognition[C].In: 15^th ICPR'2000,2000:581-584.
10F H Cheng,W H Hsu.Research on Chinese OCR in Taiwan[C]. In : IPRAI'91,1991 ; 139-164.

引证文献3

1卢达,陈琦玮,谢铭培.基于模糊规则和相似测量的手写汉字预分类法[J].计算机工程与应用,2005,41(25):75-77.
2卢达,浦炜,陈琦玮,谢铭培.基于神经网络和模糊匹配算法的手写汉字预分类研究[J].计算机应用,2005,25(10):2418-2421. 被引量：2
3卢达,浦炜,谢铭培.基于SEART网和模糊相似测量的手写汉字预分类法[J].东南大学学报（自然科学版）,2005,35(A02):79-83.

二级引证文献2

1孟建军,吴庆立,祁文哲,方永锋,高明.基于FPS200的指纹门锁控制系统设计与实现[J].核电子学与探测技术,2008,28(3):671-676. 被引量：4
2方兴林.基于字符笔画斜率特征的车牌识别算法研究[J].重庆工商大学学报（自然科学版）,2014,31(9):72-76. 被引量：3

1卢达,钱忆平,谢铭培,浦炜.An Approach to Unsupervised Character Classification Based on Similarity Measure in Fuzzy Model[J].Journal of Southeast University(English Edition),2002,18(4):370-376.
2李少远,田永青.一种基于相似测量的模糊推理方法[J].天津纺织工学院学报,1999,18(1):5-9.
3卢达,浦炜,谢铭培.一种用于提高字符识别速度的字符预分类法研究　[J].计算机工程与应用,2000,36(4):78-81.
4卢达,浦炜,钱忆平,谢铭培.基于模糊模型相似测量的小类别数汉字及数字识别[J].计算机工程与应用,2000,36(11):78-80. 被引量：3
5李天铎.高质量产品和集约化销售[J].管理观察,1999,0(2):32-32.
6路小波,凌小静,刘斌.基于组合特征的车牌字符识别[J].仪器仪表学报,2006,27(7):698-701. 被引量：11
7杨秋芬,桂卫华,胡豁生.基于改进非线性加权的图像融合算法[J].计算机工程与应用,2014,50(14):22-25. 被引量：6
8崔艳华,卢朝阳.自动分割视频运动目标的一种实现方法[J].计算机工程与应用,2003,39(24):103-105. 被引量：2
9赵忱,贾克文,王一柏.一种提高样板匹配精度的实现算法[J].计算机工程,2000,26(2):34-36.
10戴峻峰,付丽辉,曹洁.小波变换在医学图像边缘增强中的应用[J].计算机应用与软件,2008,25(12):135-137. 被引量：3

计算机学报

2002年第4期

浏览历史

内容加载中请稍等...

基于模糊模型相似测量的字符无监督分类法被引量：3

参考文献3

二级参考文献9

共引文献11

同被引文献17

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于模糊模型相似测量的字符无监督分类法 被引量：3

参考文献3

二级参考文献9

共引文献11

同被引文献17

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于模糊模型相似测量的字符无监督分类法被引量：3