期刊文献+

基于计算听觉场景分析的语音增强改进算法 被引量:2

Improved Speech Enhancement Based on Computational Auditory Scene Analysis
下载PDF
导出
摘要 针对单通道语音增强问题,基于计算听觉场景分析(CASA)的原理,提出了一种基于CASA计算模型的语音增强改进算法。该算法在特征提取中选择了目标语音有效能量、信道互相关等特征,对语谱能量和互相关特征的阈值选取进行了改进。在5种低信噪比噪声干扰条件下的仿真实验结果证明,该算法输出增强语音的信噪比平均提高了9.32dB,有效地抑制了噪声。 Based on computational auditory scene analysis (CASA), this paper proposes an improved algorithm for monaural speech enhancement. In the proposed algorithm, both effective energy of target speech and cross-channel correlation are chosen as extracted feature. Moreover, this algorithm improves the threshold selection on energy spectrum and cross-channel correlation feature. Under the condition of low SNR with 5 different noises, the experimental results show that the proposed algorithm can raise the output SNR by 9.32 dB averagely, and attenuates noise effectively.
出处 《华东理工大学学报(自然科学版)》 CAS CSCD 北大核心 2012年第5期617-621,共5页 Journal of East China University of Science and Technology
关键词 语音增强 计算听觉场景分析 语音有效能量 信道互相关 二值掩码 speech enhancement computational auditory scene analysis effective energy of targetspeech cross channel correlation binary mask
  • 相关文献

参考文献12

二级参考文献15

  • 1程俊,张璞,戴善荣,易克初.小波变换用于信号突变的检测[J].通信学报,1995,16(3):96-104. 被引量:36
  • 2JWAndr.A comparison of auditory and blind separation techniques for speech separation [J].IEEE Trans on Speech and Audio Processing,2001,9(3):189-195.
  • 3Gulzow T,Engelsberg A,Heute U.Comparison of a Discrete Wavelet Transformation and a Nonuniform Polyphase Filterbank Applied to Spectral-subtraction Speech Enhancement[J].Signal Processing,1998,64(1):5-19.
  • 4Smith J O,Abel J S.Bark and ERB Bilinear Transforms[J].IEEE Trans.on Speech and Audio Processing,1999,7(6):697-708.
  • 5Evangelism G,Cavaliere S.Frequency-warped Filter Banks and Wavelet Transforms:A Discrete-time Approach via Laguerre Expansion[J].IEEE Trans.on Signal Processing,1998,46(10):2638-2650.
  • 6Vary P.Digital Filter Banks with Unequal Resolution[C] //Proc.of EUSIPCO Conf.on Short Communication Digest.Lausanne,Switzerland:[s.n.] ,1980:41-42.
  • 7Crochiere R,Rabiner L.Multirate Digital Signal Processing[M].New Jersey,USA:Prentice-Hall,Inc.,1983.
  • 8Kadambe S, et al. Application of the wavelet transform for pitch detection of speech signals[J]. IEEE Trans. on IT, 1992, 38(2): 917-924.
  • 9Jackson P, Shadle CH. Pitch-Scaled Estimation of Simultaneous Voiced and Turbulence Components in Speech[J]. IEEE Trans. on Speech and Audio Processing, 2001,9(7): 713-726.
  • 10Brown G J, Cooke M. Computational auditory scene analysis[J]. Computer Speech and Language, 1994, 8: 297-336.

共引文献9

同被引文献23

  • 1赵鹤鸣,葛良,陈雪勤,俞一彪.基于声音定位和听觉掩蔽效应的语音分离研究[J].电子学报,2005,33(1):158-160. 被引量:16
  • 2王珊,许刚.基于计算听觉场景分析的语音混叠信号分离[J].计算机工程,2007,33(18):211-213. 被引量:1
  • 3Wang Deliang, Brown G J. Computational Auditory Scene Analysis: Principles, Algorithms, and Applications [M]. USA.- IEEE Press, 2006.
  • 4Hu Guoning, Wang Deliang. Segregation of unvoiced speech from nonspeech interferences[J]. Journal of the Acoustical Society of America, 2008,124(2): 1306-1379.
  • 5Hu Ke, Wang Deliang. Unvoiced speech segregation from nonspeech interference via CASA and spectral substraction [J]. IEEE Transactions on Audio, Speech and Language Pro cessing, 2011,19(6) : 1600-1609.
  • 6Hu Guoning, Wang Deliang. Monaural speech segregation based on pitch tracking and amplitude modulation [J]. IEEE Transactions on Neural Networks, 2004, 15(5):1135-1149.
  • 7Hu Guoning, Wang Deliang. Auditory segmentation based on onset and offset analysis[J]. IEEE Transactions on Speech and Audio Processing, 2007,15(2): 396 -405.
  • 8Wang Yu, Lin Jiajun, Chen Ning, et al. Improved monaural speech segregation based on computational auditory scene analysis [J]. EURASIP Journal on Audio, Speech, and Music Processing,2013(2) : 1-15.
  • 9Kuldip Paliwal, Kamil Wojcicki, Belinda Schwerin. Single- channel speech enhancement using spectral subtraction in the short-time modulation domain[ J]. Speech Communication, 2010, 52(5) :450-475.
  • 10Hu Ke, Wang Deliang. An unsupervised approach to cochan- nel speech separation [J]. IEEE Transactions on Audio, Speech, and Language Processing, 2013,21(1): 120-129.

引证文献2

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部