期刊文献+

基于改进身份向量提取的短语音说话人确认 被引量:3

Short Utterance Speaker Verification Based on Improved I-vector Extraction
下载PDF
导出
摘要 针对现有i-vector说话人确认系统在测试语音为短语音时性能下降的问题,对短语音i-vector估计的不确定性进行分析,改进了i-vector提取中Baum-Welch统计量的计算.该方法利用赋予权重的历史测试信息以及通用背景模型中的参数信息来增加用于短语音Baum-Welch统计量计算的说话人个性信息.将改进统计量用于i-vector提取,针对不同时长短语音的实验表明,新系统的性能优于当前i-vector系统,等错误率(EER)和检测代价函数最小值(min DCF)分别下降了13~19%和8~23%. Aiming at the problem of the performance degradation of the existing i-vector system in the short utterance speaker verification task,an improved Baum-Welch statistic is proposed by analyzing the source of the i-vector estimation uncertainty. The pre-estimated background model parameter information as well as the weighted historical test speech information encountered by the system is included in improved Baum-Welch statistic. The improved statistic is applied to the extraction of the current test speech i-vector. Experiments on different duration test speech show that the performance of the improved i-vector based system is superior to the existing i-vector system,such as the equal error rate( EER) and the minimum detection cost function( min DCF) decreased by 13 ~ 19% and 8 ~ 23%,respectively.
作者 王铮 傅山 WANG Zheng;FU Shan(School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,Shanghai 200240,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2019年第11期2264-2268,共5页 Journal of Chinese Computer Systems
基金 国家电网公司华东分部科技项目(SA0301503)资助
关键词 说话人确认 短语音 高斯混合模型 身份向量 模型自适应 speaker verification short utterance Gaussian mixture model i-vector model adaptation
  • 相关文献

参考文献3

二级参考文献16

  • 1Reynolds D, Quatieri T, Dunn R. Speaker verification using adapted Gaussian mixture models [J]. Digital Sig- nalProcess, 2000, 10(1/2/3): 19-41.
  • 2Kenny P, Boulianne G, Dumouchel P. Eigenvoice modeling with sparse training data[J]. IEEE Trans Speech andAudio Process, 2005, 13 (3) : 345-354.
  • 3Kenny P, Boulianne G, Ouellet P, et al. Joint factor analysis versus eigenchannels in speaker recognition [J]. IEEE Trans on Audio Speech Lang Process, 2007,15(4) : 1435-1447.
  • 4Dehak N, Kenny P, Dehak R, et al. Front-end factor analysis for speaker verification[J]. IEEE Trans on Au- dio Speech LangProcess, 2011, 19 (4) : 788-798.
  • 5Prince S, Elder J. Probabilistic linear discriminant analysis for inferences about identity [C]//Proc Computer Nsion. Rio de Janeiro, Brazil, 2007 : 1-8.
  • 6Cumani S, Plchot O, Laface P. On the use of i-vector posterior distributions in probabilistic linear discriminant analysis [J]. IEEE Tran on Audio Speech Lang Process, 2014, 22(4): 846-857.
  • 7Sarkar A, Matrouf D, Bousquet P, et al. Study of the effect of i-vector modeling on short and mismatch utter- ance duration for speaker verification[C]// Proc Inter- Speech. Portland, USA, 2012: 2661-2664.
  • 8Kenny P, Stafylakis T, Quellet P, et al. PLDA for speaker verification with utterances of arbitrary duration [C]// Proc Acoustics, Speech and Signal Processing. Vancouver, Canada, 2013: 7649-7653.
  • 9Hasan T, Saeidi R, Hansen J, et al. Duration mis- match compensation for i-vector based speaker recognition systems[C]// Proc Acoustics, Speech and Signal Processing. Vancouver, Canada, 2013: 7663-7667.
  • 10Kanagasundaram A, Dean D, Sridharan S, et al. Im- proving short utterance i-vector speaker verification using utterance variance modeling and compensation tech- niques[J]. IEEE Trans Speech Communication, 2014, 59 : 69-82.

共引文献13

同被引文献21

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部