期刊文献+

基于声源方位信息和非线性时频掩蔽的语音盲提取算法 被引量:10

Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking
原文传递
导出
摘要 针对欠定卷积混合的语音信号模型,提出一种基于声源方位信息和非线性时频掩蔽的语音盲提取算法。首先对低频段混合语音信号进行时频分析估计瞬时相对时延(ITD)并采用势函数聚类分析方法估计出声源个数及其ITD,接着锁定目标提取准确的目标语音方位信息,最后利用独立语音在时频域上的近似W一分离正交性,采用非线性时频掩蔽的方法提取目标语音。仿真实验表明,该方法能锁定任意感兴趣目标方位,能有效提取目标语音,文中实验条件下信噪比增益平均达9.5 dB。 For the underdetermined convolution mixture model, a new speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking was proposed. At first, instantaneous ITDs were calculated through time-frequency analysis in lower frequency domain, and the number of sources and their ITDs were estimated using the potential function. Then the object source was locked and accurate azimuth information of object was estimated. At last, the object speech was extracted via nonlinear time-frequency masking which was based on the azimuth information of object. Simulation results showed that our proposed speech extraction algorithm can lock interested object speech from random direction and extract object speech effectively, the signal-noise-ratio gain (SNRG) was obtained 9.5 dB averagely in our experiment condition.
出处 《声学学报》 EI CSCD 北大核心 2013年第2期224-230,共7页 Acta Acustica
基金 国家自然科学基金(61071159)资助项目
关键词 盲提取算法 语音信号 时频分析 方位信息 非线性 掩蔽 声源 聚类分析方法 Acoustic generators Blind source separation
  • 相关文献

参考文献4

二级参考文献34

  • 1章晋龙,谢胜利,何昭水.盲分离问题的可分性理论(英文)[J].自动化学报,2004,30(3):337-344. 被引量:6
  • 2饶丹,谢菠荪,谢志文.双通路立体声条件下的双耳掩蔽[J].电声技术,2005,29(2):53-56. 被引量:8
  • 3谢志文,尹俊勋,饶丹.空间掩蔽效应的实验研究[J].声学学报,2006,31(4):363-369. 被引量:10
  • 4Freymaaa et al. The role of perceived spatial separation in the unmasking of speech. J. Acoust. Soc. Am., 1999; 106:3578-3588
  • 5Good et al. The relation between detection in noise and localization in noise in the free field. Binaural and Spatial Heaving in Real and Virtual Environments, Edited by R.Gilkey and T. Anderson Erlbaum, New York, 1997: 349-376
  • 6Doll T J, Hanna T E. Spatial and spectral release from masking in three-dimensional auditory displays. Hum.Factors, 1995; 37:341-355
  • 7Gatehouse R W. Further research on free-field masking. J.Acoust. Soc. Am. 1987; 82(Suppl.1): S108
  • 8Moore B C J. An introduction to the psychology of hearing. Second Edition, Academic Press, Orlando, F1, USA,1982, Chapter 5
  • 9Johnston J D, Ferreira A J. Sum-difference stereo transfer coding. In: Proc. IEEE ICASSP, 1992:569-571
  • 10Douglas S et al. The effects of spatial separation in distance on the informational and energetic masking of a nearby speech signal. J. Acoust. Soc. Am., 2002; 112(2): 664-676

共引文献60

同被引文献105

引证文献10

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部