期刊文献+

智能麦克风阵列语音分离和说话人跟踪技术研究 被引量:9

Smart Microphone Arrays for Speech Sources Separation and Speaker Tracking
下载PDF
导出
摘要 本文介绍一种新的基于麦克风阵列的语音分离和说话人跟踪技术 .该技术使用麦克风阵列 ,形成一个指向感兴趣说话人的波束来增强信号 ,并通过方向置零来抑制其他说话人的声音和噪声 ,同时用自适应算法跟踪说话人的方位变化 .仿真验证了该技术的有效性 .与常规的自适应算法相比 ,该算法不需训练序列 ,具有显著的优势 . A new speech sources separation and speaker tracking technique is introduced based on microphone arrays. By means of spatial property of the received speech signals from microphone arrays, this method utilizes beamforming to estimate the DOA of the speaker of interest, and attenuates unwanted voices by nulling other directions. Considering the speech environments where the speaker may freely move and the background voices exist, an adaptive algorithm is used to track the movements and the source direction variations automatically. Computer simulations validate the effectiveness of the technique. Compared with the conventional methods, the scheme needs no training sequence, and have great potential practical advantages.
作者 杜江 朱柯
出处 《电子学报》 EI CAS CSCD 北大核心 2005年第2期382-384,共3页 Acta Electronica Sinica
关键词 麦克风阵列 语音分离 说话人跟踪 波束形成 Acoustic wave propagation Adaptive algorithms Computer simulation Microphones Neural networks Signal processing
  • 相关文献

参考文献7

  • 1G Erten,F M Salam.Voice extraction by on-line signal separation and recovery[J].IEEE Transactions on Circuits and Systems-II:Analog and Digital Signal Processing,July 1999,CAS-46(7):912-922.
  • 2C Jutten,J Herault.Blind separation of sources,Part I:an adaptive algorithm based on neuromimetic architecture[J].Signal Processing,July 1991,24(1):1-10.
  • 3E Weinstein,M Feder,A V Oppenheim.Multi-channel signal separation by decorrelation[J].IEEE Trans.Speech Audio Processing,Oct.1993,1:405-413.
  • 4C Jutten,J Herault.Blind separation of sources,Part II:problems statement[J].Signal Processing,July 1991,24(1):11-20.
  • 5R Zelinski.A microphone array with adaptive post-filtering for noise reduction in reverberant rooms[A].In Proceedings of ICASSP-88[C].IEEE.New York,April 1988.2578-2580.
  • 6Iain A McCowan,Darren C Moore,S Sridharan.Near-field adaptive beamformer for robust speech recognition[J].Digital Signal Processing,2002,12:87-106.
  • 7Matthew R.Bielefeld and Lynn M.Supplee.Developing a test program for the dod 2400 bps vocoder selection process[A].In Proceedings of ICASSP-96[C].1996.1141-1144.

同被引文献46

引证文献9

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部