期刊文献+

语音特征匹配的图像配准方法

An approach to speech feature matching using image registration algorithm
下载PDF
导出
摘要 为了解决传统DTW算法准确度和效率不高的问题,提出了一种基于图像配准方法的语音特征匹配算法.该方法将MFCC参数映射为二值图像,并通过引入图像配准的方法进行模板匹配,达到了语音特征匹配的目的.实验结果表明,与传统的DTW算法相比,该方法的准确率、召回率和算法执行效率有了明显的提高. To overcome the low accuracy and efficiency of the traditional DTW algorithm used in speech feature matching, an approach employing the image registration algorithm is proposed. The speech feature matching process was implemented by mapping MFCC coefficients to the binary image, and introducing the image registration algorithm to the template matching. The experimental result shows that, comparing with the traditional DTW algorithm, the proposed algorithm achieves a better performance in precision rate, recall rate and the cost of computation.
出处 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2008年第7期1152-1155,共4页 Journal of Harbin Institute of Technology
基金 吉林省科技发展计划国际合作项目(20050703-1)
关键词 DTW 模板匹配 图像配准 语音识别 DTW template matching image registration speech recognition
  • 相关文献

参考文献11

  • 1姚天任.数字语音处理[M].武汉:华中科技大学出版社.2003.
  • 2张晨燕,孙成立.非特定人孤立词语音识别系统的片上实现[J].计算机工程与应用,2007,43(13):194-196. 被引量:10
  • 3韩继庆,张磊,郑铁然.语音信号处理[M].北京:清华大学出版社,2004.
  • 4赵文,杨澄宇,杨鉴.孤立字词识别[J].计算机应用,2001,21(6):12-14. 被引量:7
  • 5刘敬伟,徐美芝,郑忠国,程乾生.基于DTW的语音识别和说话人识别的特征选择[J].模式识别与人工智能,2005,18(1):50-54. 被引量:13
  • 6BOU-GHAZALE S E, HANSEN J H L. A comparative study of traditional and newly proposed features for recognition of speech under stress [ J ]. IEEE Transition on Speech and Audio Processing ,2000, 8 (4) :429 -442.
  • 7SKOWRONSKI M D, HARRIS J G. Increased MFCC filter bandwidth for noise - robust phoneme recognition [ C]//IEEE International Conference on Acoustics, Speech, and Signal Processing. Florida, USA : [ s. n. ] , 2002:801 - 804.
  • 8GHAFFARY B K, SAWCHUK A A. A survey of new tech- niques for image registration and mapping [ C ]//Proceedings of the SPIE : Applications of Digital Image Processing. Bellingham, WA, USA: [ s. n. ], 1983:222 - 239.
  • 9HUTTENLOCHER D P, KLANDERMAN G A, RUCKLIDGE W J. Comparing images using the Hausdorff distance[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1993, 15 (9) :850 - 863.
  • 10VENTURA A D, RAMPINI A, SCHETTINI R. Image registration by recognition of corresponding structures [J ]. IEEE Transactions on Geoscience and Remote Sensing, 1990, 28 (3) : 305 - 314.

二级参考文献27

  • 1杜利民,谢凌云,刘斌.HMM非特定人连续语音识别的嵌入式实现[J].电子与信息学报,2005,27(1):60-63. 被引量:6
  • 2拉宾纳 R.W.谢弗.语音信号数字处理[M].北京:科学出版社,1984..
  • 3周浩.Windows实时语音命令识别系统.中国科学院自动化研究所硕士论文[M].,1994..
  • 4Lawrence Rabiner,Fundamentals of speech recognition,1996年
  • 5杨行峻,语音信号数字处理,1995年
  • 6周浩,学位论文,1994年
  • 7拉宾纳,语音信号数字处理,1984年
  • 8Campbell J P. Speaker Recognition: A Tutorial. Proc of the IEEE, 1997, 85(9): 1437-1462.
  • 9Furui S. Recent Advances in the Speaker Recognition. Pattern Recognition Letters, 1997, 18(9): 859-872.
  • 10Pandit M, Kittler J. Feature Selection for a DTW-Based Speaker Verification System. In: Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Seattle,USA, 1998, Ⅱ: 769-772.

共引文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部