期刊文献+

说话人识别系统中MFCC参数的改进算法 被引量:1

Improved MFCC Algorithm in Speaker Recognition System
下载PDF
导出
摘要 在说话人识别系统中,如何在语音信号中提取出能够表征说话人个性的特征参数是系统的关键问题之一。目前使用最多的M FCC参数主要描述了表征声道特性的谱包络特征,一般忽略了基音频率对M FCC的影响。由于基音频率能够影响M FCC参数对声道特性的准确描述,进而影响说话人识别系统的性能,因此本文提出了一种基于平滑短时幅度谱包络的S MFCC参数(smoothing MFCC)。实验表明,改进后的M FCC参数能够很好地减少基因频率对M FCC的影响,尤其对于基音频率较高的女性说话者,效果更为显著。 In speaker recognition system,one of the key problems is how to extract the feature parameters which are characterizing the speaker.The currently most widely used MFCC parameter primarily describes the spectrum envelope of the sound tract characteristics and ignores the impacts of fundamental frequency theoretically.Given that the fundamental frequency is able to influence the description accuracy of MFCC parameters about the sound track characteristics,thus influence the performance of the speaker recognition system,this paper puts forward the smoothing MFCC(SMFCC),which is based on smoothing short-term spectral amplitude envelope.Experimental results show that the improved MFCC parameters can degrade the bad influences of fundamental frequency effectively and upgrade the performances of speaker recognition system,especially for female speakers,who have higher fundamental frequency.
出处 《洛阳理工学院学报(自然科学版)》 2013年第4期51-55,63,共6页 Journal of Luoyang Institute of Science and Technology:Natural Science Edition
关键词 说话人识别 MFCC 谱包络 五点三次算法 speaker recognition MFCC spectral envelop five-dot-cubic method
  • 相关文献

参考文献7

二级参考文献27

  • 1汪峥,连翰,王建军.说话人识别中特征参数提取的一种新方法[J].复旦学报(自然科学版),2005,44(1):197-200. 被引量:16
  • 2章熙春,曹燕,张军,韦岗.语音MFCC特征计算的改进算法[J].数据采集与处理,2005,20(2):161-165. 被引量:6
  • 3林玮,杨莉莉,徐柏龄.基于修正MFCC参数汉语耳语音的话者识别[J].南京大学学报(自然科学版),2006,42(1):54-62. 被引量:23
  • 4郭武,王仁华,戴礼荣.基于基音周期与清浊音信息的梅尔倒谱参数[J].数据采集与处理,2007,22(2):229-233. 被引量:1
  • 5杨行峻 迟惠生.数字语音信号处理[M].北京:电子工业出版社,1995..
  • 6张雄伟,陈亮,杨吉斌.现代语音技术及应用[M].北京:机械工业出版社.2003.
  • 7Fakhr W,Salam A A,Hamdy N.Enhancement of mismatched conditions in speaker recognition for multimedia applications [J].IEEE International Conference on Acoustics,Speech,and Signal Processing, 2004.
  • 8Sambur M R.Selection of Acoustic Features for Speaker Identification[C].IEEE Trans On ASSP, 1975: 176-182.
  • 9Shajith Ikbal, H.Hermansky, H.Bourlard.Nonlinear Spectral Transformations for Robust Speech Recognition[A].in Proc. of IEEE ASRU 2003 workshop, Nov-Dec, 2003 : 393-398.
  • 10Reynodls D,Rose R.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Trans on Speech and Audio processing, 1995,3(1 ): 72-83.

共引文献132

同被引文献13

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部