Speech wideband extension based on Gaussian mixture model 被引量：4

Speech wideband extension based on Gaussian mixture model

导出

摘要 To decrease the spectral distortion of highband envelope, the function of spectral distortion and mutual information between feature vector and highband envelope was studied, and an extended Gaussian Mixture Model （GMM） bandwidth extension algorithm was proposed based on the research. The feature parameters which have larger mutual information with highband envelope were selected to constitute the feature vector, and the GMM was adopted to compute the joint probability density of the feature vector and highband envelope. Then the highband envelope was estimated via the posterior probabilities computed from the model parameters estimated by Expectation-Maximization （EM） algorithm. The experimental results show that the spectral distortion is lower than the algorithm, such as the traditional algorithm based on GMM, by 0.3 dB and the number of frames with spectral distortion over 10 dB sharply reduced over 50%. To decrease the spectral distortion of highband envelope, the function of spectral distortion and mutual information between feature vector and highband envelope was studied, and an extended Gaussian Mixture Model （GMM） bandwidth extension algorithm was proposed based on the research. The feature parameters which have larger mutual information with highband envelope were selected to constitute the feature vector, and the GMM was adopted to compute the joint probability density of the feature vector and highband envelope. Then the highband envelope was estimated via the posterior probabilities computed from the model parameters estimated by Expectation-Maximization （EM） algorithm. The experimental results show that the spectral distortion is lower than the algorithm, such as the traditional algorithm based on GMM, by 0.3 dB and the number of frames with spectral distortion over 10 dB sharply reduced over 50%.

作者 ZHANG Yong HU Ruimin

机构地区 National Engineering Research Center for Multimedia software

出处《Chinese Journal of Acoustics》 2009年第4期362-377,共16页 声学学报（英文版）

分类号 TN912.34 [电子电信—通信与信息系统] TP13 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1李淑红,桑恩方.基于小波变换和矢量量化的语音压缩编码方案[J].声学学报,2000,25(1):50-55. 被引量：8
2郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量：4

二级参考文献15

1崔锦泰程正兴（译）.小波分析导论[M].西安交通大学出版社,1994..
2程正兴（译），小波分析导论，1995年
3Sang Enfang，Proc. of HICEC’92，1992年，715页
4Yang X，IEEE Trans Information Theory，1992年，38卷，2期，824页
5陈永彬，语言信号处理，1990年
6Cheng Y M, O'Shaugnessy D, Mermelstein P. Statistical recovery of wideband speech from narrowband speech[J]. IEEE Transaction Speech Audio Process, 1994(2): 544-548.
7Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation[Z]. International Conference on Acoustic Speech Signal Process, Istanbul, 2000.
8Jax P, Vary P. Wideband extension of telephone speech using a hidden markov model[A]. IEEE Workshop on Speech Coding[C], Delavan: IEEE,2000.
9Yoshida Y, Abe M. An algorithm to reconstruct wideband speech from narrow band speech based on codebook mapping[Z]. IEEE International Conference on Spoken Language Processing, Yokohama, 1994.
10Enbom N, Kleijn W B. Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients[A]. IEEE Workshop on Speech Coding[C], Porvoo, Finland: IEEE,1999.

共引文献9

1康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量：13
2张勇,胡瑞敏.基于高斯混合模型的语音带宽扩展算法的研究[J].声学学报,2009,34(5):471-480. 被引量：7
3赵丹,马胜前,郑杰.基于SPIHT编码的语音信号压缩算法[J].计算机工程与应用,2011,47(9):142-145. 被引量：1
4叶蕾,杨震,孙林慧.基于压缩感知的低速率语音编码新方案[J].仪器仪表学报,2011,32(12):2688-2692. 被引量：2
5孙广武,戴永,喻世东,李璇.音素关联的多文种语音融合编码方法[J].计算机工程与应用,2013,49(19):217-221. 被引量：6
6ZHANG Yong,LIU Yi.Narrowband speech wideband extension algorithm research[J].Chinese Journal of Acoustics,2014,33(2):178-191.
7张涌,徐宏炳.基于PSOLA算法的单片机TTS系统的研究及实现[J].电子工程师,2002,28(2):1-3.
8张勇,刘轶.窄带语音带宽扩展算法研究[J].声学学报,2014,39(6):764-773. 被引量：5
9张涌.车载嵌入式汉语语音合成系统的研究及实现[J].轻型汽车技术,2003(6):4-7.

同被引文献14

1郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量：4
2赵俊渭,丁玮,阎宜生,赵日昌,王荣庆,王之程.收发分置水下目标声散射特性的实验研究[J].声学学报,1997,22(2):123-131. 被引量：11
3艾玲梅,王珏.基于双谱分析和支持向量机的手震颤加速度信号识别[J].电子学报,2008,36(11):2165-2170. 被引量：6
4马元锋,陈克安,王娜,郑文.听觉模型输出谱特征在声目标识别中的应用[J].声学学报,2009,34(2):142-150. 被引量：20
5张勇,胡瑞敏.基于高斯混合模型的语音带宽扩展算法的研究[J].声学学报,2009,34(5):471-480. 被引量：7
6刘传武,张智军,豆仁福.基于时域双谱特征的雷达目标识别[J].数据采集与处理,2009,24(6):709-713. 被引量：3
7张琳,黄敏.基于EMD与切片双谱的轴承故障诊断方法[J].北京航空航天大学学报,2010,36(3):287-290. 被引量：18
8李晓良,胡程,曾涛.多极化前向散射RCS分析及其对目标分类识别的影响[J].电子与信息学报,2010,32(9):2191-2196. 被引量：8
9陈新亮,胡程,曾涛.一种基于前向散射雷达的车辆目标自动识别方法[J].中国科学：信息科学,2012,42(11):1471-1480. 被引量：8
10梁岩,鲍长春,夏丙寅,何玉文,周璇,李娜.基于高斯混合模型的压缩域语音增强方法[J].电子学报,2012,40(10):2031-2038. 被引量：9

引证文献4

1ZHANG Yong,LIU Yi.Narrowband speech wideband extension algorithm research[J].Chinese Journal of Acoustics,2014,33(2):178-191.
2温涛,许枫,王梦宾,杨娟,闫路.预测特征误差映射及其在多基地水下目标识别中的应用[J].声学学报,2019,44(1):57-67. 被引量：2
3白海钏,鲍长春,刘鑫.基于局部最小二乘支持向量机的音频频带扩展方法[J].电子学报,2016,44(9):2203-2210. 被引量：3
4陈楠,鲍长春.基于双耳线索编码原理的语音增强方法[J].电子学报,2019,47(1):227-233. 被引量：3

二级引证文献8

1王威.高流量负荷下基于支持向量机的空间数据聚类方法[J].微电子学与计算机,2017,34(8):137-140.
2范珍艳,王莲子,庄晓东.基于FrFT的自适应阈值语音滤波降噪研究[J].青岛大学学报（工程技术版）,2019,34(4):18-23. 被引量：1
3方佳艳,刘峤.具有同步化特征选择的迭代紧凑非平行支持向量聚类算法[J].电子学报,2020,48(1):44-58. 被引量：7
4李思源,姜林.基于MDCT的线性带宽扩展方法[J].智能计算机与应用,2020,10(3):69-71.
5袁文浩,胡少东,时云龙,李钊,梁春燕.一种用于语音增强的卷积门控循环网络[J].电子学报,2020,48(7):1276-1283. 被引量：12
6肖鑫鑫.复杂噪声环境下的普通话测试系统设计[J].信息技术,2020,44(11):78-82. 被引量：1
7盛德奎.基于深度学习的水下移动目标快速识别方法研究[J].自动化与仪器仪表,2021(12):8-11. 被引量：1
8王佳维,许枫,杨娟.基于核空间联合稀疏表示和指数平滑的多基地水下小目标识别[J].电子学报,2024,52(1):217-231. 被引量：1

1马春波,王龙超,赵兰兰,李睿.基于DCT域特征的JPEG图像隐写分析[J].计算机与数字工程,2015,43(12):2266-2270.
2李卓,陈健,蒋晓宁,曾宪庭,潘雪增.基于多域特征的JPEG图像盲检测算法[J].浙江大学学报（工学版）,2011,45(9):1528-1538. 被引量：9
3关俊波,马春波,敖珺.基于校准DCT特征的JPEG图像隐写分析[J].计算机应用与软件,2014,31(1):229-231.
4洪浩,肖立民,张焱,王京.共存式频谱共享认知中继网络的中断性能研究[J].电波科学学报,2013,28(5):805-809. 被引量：2
5文志强,胡永祥,朱文球.流形上的k最近邻分类方法[J].计算机应用,2012,32(12):3311-3314. 被引量：3
6吴炜,沈占锋,李均力,杨海平,骆剑承.联合概率密度脊提取的影像镶嵌色彩一致性处理方法[J].测绘学报,2013,42(2):247-252. 被引量：10
7顾国松.基于类跟踪门的机动目标跟踪(英文)[J].光电工程,2013,40(2):23-31.
8周治平,朱丹.基于PPFFT和DCT系数的邻近联合概率密度的重采样检测[J].小型微型计算机系统,2015,36(4):868-871.
9谭丞,李晓敏,徐立军,张琦.一种基于联合概率密度判别器的新煤种在线辨识方法[J].仪器仪表学报,2010,31(6):1229-1234. 被引量：1
10袁泉,杨杰,杜春华,吴证.基于直方图统计学习的人脸检测方法[J].计算机工程,2008,34(19):182-184. 被引量：2

Chinese Journal of Acoustics

2009年第4期

浏览历史

内容加载中请稍等...