基于多项式拟合的中性-情感模型转换算法被引量：1

Neutral-emotion model transformation algorithm based on polynomial function fitting.

下载PDF

导出

摘要情绪变化问题是说话人识别技术面临的一个难题。为了解决该问题,提出了基于多项式方程拟合的中性-情感模型转换算法。该算法建立了中性模型和情感模型之间的函数关系,只需要说话人的中性语音就能训练其各种情感类型的说话人模型。在普通话情感语音库上的实验表明,采用该方法后识别算法的等错误率由16.06%降低到10.31%,提高了系统性能。 One of the largest challenges in speaker recognition is dealing with speaker-emotion variability problem.A neutral-emotion model transformation algorithm is presented to overcome this limitation,which builds a relationship between emotion and neutral models.In this method,only neutral speech is needed in training emotion models.The experiments on MASC show that the EER is reduced to 10.31% from the 16.06%,and the recognition performance can be improved by this algorithm.

作者单振宇杨莹春

机构地区浙江大学计算机科学与技术学院

出处《计算机工程与应用》 CSCD 北大核心 2008年第21期206-208,221,共4页 Computer Engineering and Applications

基金国家高技术研究发展计划( 863)( the National High-Tech Research and Development Plan of China under Grant No.2006AA01Z136) 浙江省自然科学基金(the Natural Science Foundation of Zhejiang Province of China under Grant No.Y106705)

关键词说话人识别高斯混合模型情感语音 speaker recognition gaussian mixture model emotion speech

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Scherer K,Johnstone T,Klasmeyer G.Can automatic speaker verification be improved by training the algorithms on emotional speech[C]// Proceedings of ICSLP2000,Beijing, China,2000.IEEE,2000,2:807-810.
2Scherer K.A cross-cultural investigation of 40 enorm emotion inferences from voice and speech:implication for speech technology[C]// Proceedings of ICSLP2000,Beijing,China,2000IEEE,2000,2:757-760.
3Wu Z,Li D,Yang Y.Rules based feature modification for affective speaker recognition[C]//Proceedings of ICASSP06,May 2006.IEEE, 2006,1 : 661-664.
4Wu W,Zheng F,Xu M,et al.Study on speaker verification on emotional speech[C]//Proceedings of ICSLP06,September 2006.IEEE, 2006,1:2102-2105.
5Shan Z,Yang Y,Wu Z.Natural-emotion GMM transformation algorithm for emotional speaker recognition[C]//Proceedings of Interspeech07,Antwerp Belgium,2007,1:782-785.
6Douglas A,Richard C.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Transactions on Speech and Audio Processing, 1995,3 ( 1 ) :72-83.
7Charles L,Richard J.Solving least squares problems[M].England: Prentice-Hall, 1995 : 349-356.
8Wu T,Yang Y,Wu Z,et al.MASC:a speech corpus in mandarin for emotion analysis and affective speaker recognition[C]//Proceedings of ODYSSEY06,San Juan,PR,June 2006.IEEE,2006,1 : 1-5.
9Rivaral V,Douglas O,Vishwa G.Compensated Mel frequency cepstrum coefficients[C]//Proceedings of ICASSP96,1996.IEEE, 1996, 1 : 323-326.
10Bimbot F,Bonastre J,Fredouille C,et al.A tutorial on text-independent speaker verification[J].EURASIP Journal on Applied Signal Processing, 2004(4) :430-451.

同被引文献14

1李爱军,邵鹏飞,党建武.情感表达的跨文化多模态感知研究[J].清华大学学报（自然科学版）,2009(S1):1393-1401. 被引量：6
2GHIURCAU M V, RUSU C, ASTOLA J. A study of the effect of emotional state upon text-independent speaker identification [C]// International Conference on Acoustics, Speech and Signal Processing. Prague: IEEE, 2011:4944 - 4947.
3BAO H, XU M, ZHENG T F. Emotion attribute projec- tion {or speaker recognition on emotional speech [C]// 8th Annual Conference of the International Speech Communica- tion Association. Antwerp: IEEE, 2007: 758-761.
4HUANG T, YANG Y. Applying pitch-dependent difference detection and modification to emotional speaker recognition [C] // 9th Annual Conference of the International Speech Communication Association. Brisbane: IEEE, 2008:2751-2754.
5HUANG T, YANG Y. Learning virtual HD model for bi-model emotional speaker recognition [C]// International Conference on Pattern Recognition. Istanbul: IEEE, 2010: 1614-1617.
6SHAN Z, YANG Y. Natural-emotion GMM transformation algorithm for emotional speaker recognition [C] // 8th Annual Conference of the International Speech Communication Association. Antwerp: IEEE, 2007:782 - 785.
7SHAN Z, YANG Y. Learning polynomial function based neutral-emotion GMM transformation for emotional speaker recognition [C]// International Conference on Pattern Reeognition. Tampa: IEEE, 2008: 8-11.
8REYNOLDS D A, ROSE R C. Robust text-independent speaker identification using Gaussian mixture speaker models [J]. IEEE Transactions on Speech and Audio Processing, 1995, 3(1): 72- 83.
9REYNOLDS D A, QUATIERI T F, DUNN Q B. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000, 10(1/2/3) : 19 -41.
10HERSHEY J R, OLSEN P A. Approximating the Kullback Leibler divergence between Gaussian mixture models [C] //International Conference on Acoustics, Speech, and Signal Processing. Honolulu: IEEE, 2007:317 - 320.

引证文献1

1陈力,杨莹春.基于邻居相似现象的情感说话人识别[J].浙江大学学报（工学版）,2012,46(10):1790-1795. 被引量：1

二级引证文献1

1吴树兴,刘新红.语音短时分析与合成的滤波器实现[J].信息与电脑,2019,0(15):50-52.

1沈沧海.决策系统中统计模型建模方法的研究[J].计算机系统应用,1997,6(5):31-32.
2谭萍,邢玉娟,高翔.说话人模型聚类算法研究与分析[J].中国建材科技,2015,24(5):87-88.
3潘美玲,胡昌海,张明明,朱斌.音乐情感自动分类器研究[J].浙江树人大学学报（自然科学版）,2011,11(4):6-10.
4胡洋,蒲南江,吴黎慧,高磊.基于HMM和ANN的语音情感识别研究[J].电子测试,2011,22(8):33-35. 被引量：4
5王正创.基于MFCC与共振峰的声纹识别算法研究[J].电脑知识与技术,2016,0(2):188-190.
6陈晨,韩纪庆.说话人识别方法综述[J].智能计算机与应用,2015,5(5):92-94. 被引量：3
7张春晖.特定二维人脸的三维真实感重建[J].山西师范大学学报（自然科学版）,2005,19(4):41-44. 被引量：2
8叶吉祥,王聪慧.改进的F-score算法在语音情感识别中的应用[J].计算机工程与应用,2013,49(16):137-141. 被引量：8
9闫乐林,冯希叶.一种基于内容的视频情感类型识别算法[J].计算机系统应用,2011,20(3):102-105. 被引量：2
10韩文静,李海峰,阮华斌,马琳.语音情感识别研究进展综述[J].软件学报,2014,25(1):37-50. 被引量：169

计算机工程与应用

2008年第21期

浏览历史

内容加载中请稍等...

基于多项式拟合的中性-情感模型转换算法被引量：1

参考文献10

同被引文献14

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于多项式拟合的中性-情感模型转换算法 被引量：1

参考文献10

同被引文献14

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于多项式拟合的中性-情感模型转换算法被引量：1