期刊文献+

字符模糊的中文纸质发票文字识别方法

Chinese Paper Invoice Text Recognition Method with Character Blurring
下载PDF
导出
摘要 基于纸质发票字符模糊导致OCR识别性能低下的问题,本文提出一种自适应迭代视觉语义模型来解决此问题。该模型包含2个模块:识别模块利用ResNet作为编码器,Transformer为解码器对模糊文本进行初步预测;修正模块将识别模块的预测结果传入双向语义模型,依据上下文语义信息修正字符,进行初步的文本修正,再将结果与标签输入判别器,若判别成功则直接输出结果,若判别失败则会将结果迭代语义模型,进一步修正,提高识别率。实验结果表明,本文所提模型相比目前的中文识别模型ch_PP-OCRv3的识别正确率高出3.39个百分点,与其他模型相比识别率平均提高6.81个百分点,并且在IC15、IIIT5K和IC03-Word等公开数据集中均表现出色,验证了模型的泛化性能。 This paper addresses the problem of low OCR recognition performance caused by character blurring in paper invoices.A novel adaptive iterative visual semantic model is proposed to tackle this issue.The model consists of two modules:the recogni⁃tion module utilizes ResNet as the encoder and Transformer as the decoder to make initial predictions on the blurred text.The cor⁃rection module takes the recognition module’s predictions and feeds them into a bidirectional language model,which leverages contextual semantic information to refine characters,thereby performing initial text correction.The results are then input to a dis⁃criminator,which outputs them directly if successful or iterates the language model for further refinement if failed,effectively im⁃proving the recognition accuracy.Experimental results demonstrate that the proposed model outperforms the current state-of-the art Chinese recognition model ch_PP-OCRv3 by 3.39 percentage points in recognition accuracy and achieves an average 6.81 percentage points improvement compared to other models.Moreover,the model exhibits excellent generalization performance on public datasets such as IC15,IIIT5K,and IC03-Word,validating its effectiveness.
作者 来坤 LAI Kun(School of Communication and Information Engineering,Xi’an University of Science and Technology,Xi’an 710600,China)
出处 《计算机与现代化》 2024年第8期114-119,共6页 Computer and Modernization
基金 国家重点研发计划项目(2018YFC0808300) 陕西省科技计划重点产业创新链(群)项目(2020ZDLGY15-07) 西安市科技计划科技创新引导项目(201805036YD14CG20(4))。
关键词 文字识别 模糊文本 纸质发票 神经网络 ResNet text recognition blurry text paper invoice neural network ResNet
  • 相关文献

参考文献7

二级参考文献39

  • 1赵永涛,李志敏,王洪建,陈志云,王林.印章识别中的图像预处理研究[J].仪器仪表学报,2004,25(z3):401-403. 被引量:8
  • 2庞韶宁,李介谷.票据识别系统数据获取过程研究[J].计算机工程,1997,23(S1):287-289. 被引量:1
  • 3天捷 沙飞 张新生.实用图像分析与处理技术[M].北京:电子工业出版社,1995..
  • 4Djeziri S, Nouboud F. Plamondon R. Extraction of Items from Checks.In 4th International Conference on Document Analysis and Recognition, t997: 749-752.
  • 5Ling L. Lizaraga M. Gomes N. et al. A Prototype for Brazilian Bankcheck Recognition. International Journal of Pattern Recognition and Artificial Intelligence, World Scientifie, 1997:549-569.
  • 6Yu B.Lin X,Wu Y,et al.Isothetic Polygon Vepresentation for Contours.CVGO:Image Understading.1982,56:264-268
  • 7Such C Y , Xu Qizhi, Lam L. Automatic Recognition of Handwritten Data on Cheques ± Fact or Ection. Pattern Recognition Letters, 1999,20:1287-1295.
  • 8Al-Ohatia Y. MohamedCheriet B, Suena C. Databases for Recognition of Handwritten Arabic Cheques. Pattern Recognition 2003.36: 111-121
  • 9Dimauro G. Impedovo S, Pirlo D et al.Automatic Bankcheck Processing: A New Engineered System.International Journal of Pattern Recognition and Arti-ficial Intelligence.World Scientific.1997:467-503
  • 10YANG M C K, LEE JS, LIEN C C, et al. Hough transfoim modi-fied by line connectivity Intelligence, 1997,19(8) :905 - 910.

共引文献182

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部