基于数据驱动字典和稀疏表示的语音增强被引量：14

Speech Enhancement Based on Data-Driven Dictionary and Sparse Representation

下载PDF

导出

摘要本文提出了一种基于数据驱动字典和过完备稀疏表示的自适应语音增强方法。首先在训练阶段采用干净语音基于K奇异值分解(K—singular value decomposition,K-SVD)算法训练过完备字典,然后在测试阶段根据含噪语音的噪声方差自适应选择最优的阈值,采用正交匹配追踪算法对含噪语音信号在过完备字典上进行稀疏分解,最后利用系数稀疏表示重构语音信号,从而达到语音增强的目。该方法不像传统语音增强方法那样减少或消去噪声,而是从字典中选取适当的原子表示纯净信号,从而把纯净信号从含噪信号中分离出来。对白噪声和有色噪声环境下重构语音进行了主客观评价。仿真结果显示:该方法能有效去除加性噪声,并且改善了语音质量。 An adaptive speech enhancement method based on Data-Driven Dictionary and overcompletely sparse representation theory is proposed.Firstly,using the K-singular value decomposition（K-SVD） algorithm,a dictionary that describes the clean speech content effectively is trained.Secondly,the prime threshold is adaptively selected according to noise variance of original noisy speech signal and the speech signal＇s sparsest coefficient vector is obtained through Orthogonal Matching Pursuit algorithm.And then the speech signal is recovered and speech enhancement is achieved.Different from the conventional techniques which improve the speech signal quality by suppression of noise and reduction of distortion,we select the appropriate atoms to represent speech signal.Thus, clean signal is separated from the noisy speech signal.In white or colored noise interference,the reconstructed speech signal via the proposed algorithm is evaluated by the objective and subjective evaluation.The experimental results show that the proposed algorithm can get ride of the addictive noise and improve speech quality.

作者孙林慧杨震

机构地区南京邮电大学通信与信息工程学院

出处《信号处理》 CSCD 北大核心 2011年第12期1793-1800,共8页 Journal of Signal Processing

基金国家重大基础研究973课题(2011CB302903) 国家自然科学基金项目(60971129) 江苏省普通高校研究生科研创新计划项目(CX10B_191Z,CX10B_189Z)

关键词语音增强稀疏表示过完备字典正交匹配追踪奇异值分解算法 speech enhancement sparse representation overcomplete dictionary Orthogonal Matching Pursuit singular value decomposition algorithm

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献15

1Benesty J,Makino S,Chen J.Speech enhancement[M].Berlin,Germany:Springer,2005.
2Hao J C,Attias H,Nagarajan S,Lee T W,Sejnowski T J.Speech enhancement,gain,and noise spectrum adaptation using approximate bayesian estimation[J].IEEE Transactions on Audio,Speech,and Language Processing,2009,17(1):24-37.
3Yoshioka T,Nakatani T,Okuno H G.Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure[A].2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)[C],2010:4270-4273.
4Tantibundhit C,Pernkopf F,Kubin G.Joint time-frequency segmentation algorithm for transient speech decomposition and speech enhancement[J].IEEE Transactions on Audio,Speech,and Language Processing,2010,18(6):1417-1428.
5Mallat S,Zhang Z.Matching pursuits with time-frequency dictionaries[J].IEEE Transactions on Signal Processing,1993,41:3397-3415.
6Gowreesunker B V,Tewfik A H.Learning sparse representation using iterative subspace identification[J].IEEE Transactions on Signal Processing,2010,58 (6):3055-3065.
7Aharon M,Elad M,Bruckstein A.K-SVD:an algorithm for designing overcomplete dictionaries for sparse representation[J].IEEE Transactions on Signal Processing,2006,54(11):4311-4322.
8Donoho D,Johnstone I M.Ideal spatial adaptation by wavelet shrinkage[J].Biomet rika,1994,81(3):425-455.
9Chen S S,Donoho D L,and Saunders M A.Atomic decomposition by basis pursuit[J].SIAM Review,2001,43(1):129-159.
10Griffin A,Tsakalides P.Compressed sensing of audio signals using multiple sensors[A].in Proc.16th European Signal Processing Conference (EUSIPCO'08)[C],Lausanne,Switzerland,2008.

同被引文献109

1张春梅,尹忠科,肖明霞.基于冗余字典的信号超完备表示与稀疏分解[J].科学通报,2006,51(6):628-633. 被引量：71
2邹霞,陈亮,张雄伟.基于Gamma语音模型的语音增强算法[J].通信学报,2006,27(10):118-123. 被引量：11
3Donoho D. Compressed sensing [ J ]. IEEE Transactions on Information Theory, 2006, 52(4) : 1289-1306.
4Tsaig Y, Donoho D. Extensions of compressed sensing [ J ]. Signal Processing, 2006, 86 (3) : 533-548.
5Chen S, Donoho D, Saunders M. Atomic decomposition by basis pursuit[J]. SIAM REVIEW, 2001, 43(1) :129-159.
6Candes E. Compressive sampling[ A]. Proceedings of the International Congress of Mathematicians [ C ] . Madrid, Spain,European Mathematioal Society Publishing House, 2006 : 1433-1452.
7Andrecut M, Este R A, Kauffman S A. Competitive opti- mization of compressed sensing [ J ]. Journal of Physics A : Mathematical and Theoretical, 2007, (40) : 299-305.
8Donoho D. For most underdetermined systems of linear e- quations, the minimal l2 norm near-solution approximates the sparsest near-solution [ EB/OL ]. http://www-stat. stanford, edu/donoho/Reports, 2007.
9Candes E, Tao T. Decoding by linear programming [ J]. IEEE, Trans Inf Theory, 2005(51 ) : 4203-4215.
10Giacobello D, Christensen M G, Murthi M N, Jensen S H, Moonen M. Retrieving sparse patterns using a com- pressed sensing framework : Applications to speech coding based on sparse linear prediction [ J ]. Signal Processing Letters, IEEE,2010, 17(1) : 103-106.

引证文献14

1叶蕾,杨震,孙林慧,郭海燕.行阶梯观测矩阵下语音压缩感知观测序列的Volterra+Wiener模型研究[J].信号处理,2013,29(7):816-822. 被引量：3
2李轶南,张雄伟,曾理,黄建军.改进的稀疏字典学习单通道语音增强算法[J].信号处理,2014,30(1):44-50. 被引量：12
3胡永刚,张雄伟,邹霞,张立伟,郑云飞.贝叶斯非负矩阵分解语音增强的优化算法[J].解放军理工大学学报（自然科学版）,2015,16(1):1-6. 被引量：2
4杨爱萍,田玉针,何宇清,董翠翠.基于改进K-SVD和非局部正则化的图像去噪[J].计算机工程,2015,41(5):249-253. 被引量：10
5崔晓.自训练过完备字典和稀疏表示的语音增强[J].现代电子技术,2015,38(13):56-58. 被引量：3
6靳立燕,陈莉,樊泰亭,高晶.基于奇异谱分析和维纳滤波的语音去噪算法[J].计算机应用,2015,35(8):2336-2340. 被引量：12
7周伟栋,杨震,于云.改进的正交匹配追踪语音增强算法[J].信号处理,2016,32(3):287-295. 被引量：8
8赵红玉,李小勇,何军政.压缩感知应用于透地无线通信初探[J].内蒙古科技与经济,2016(11):110-111.
9郭欣,贾海蓉,王栋.利用子空间改进的K-SVD语音增强算法[J].西安电子科技大学学报,2016,43(6):109-115. 被引量：4
10周伟力,贺前华,王亚楼,庞文丰.基于自适应逼近残差的稀疏表示语音降噪方法[J].电子与信息学报,2017,39(2):309-315. 被引量：4

二级引证文献59

1邹丽,唐文娟.一种改进的变步长前后向追踪重建算法[J].南通大学学报（自然科学版）,2014,13(2):7-12.
2崔晓.自训练过完备字典和稀疏表示的语音增强[J].现代电子技术,2015,38(13):56-58. 被引量：3
3胡永刚,张雄伟,邹霞,闵刚,郑云飞,李莉,石佳佳.改进的非负矩阵分解语音增强算法[J].信号处理,2015,31(9):1117-1123. 被引量：7
4黄智,付兴武,刘万军.混合相似性权重的非局部均值去噪算法[J].计算机应用,2016,36(2):556-562. 被引量：8
5陆真,裴东兴.基于连续小波阈值函数的语音增强技术[J].山西电子技术,2016(1):40-42. 被引量：1
6周岩,王雪瑞.基于差分演化-MP的快速信号稀疏分解[J].洛阳理工学院学报（自然科学版）,2016,26(1):64-69.
7于云,周伟栋.基于压缩感知的鲁棒性说话人识别参数研究[J].计算机技术与发展,2016,26(3):18-22. 被引量：1
8冯丽慧.矿区塌陷区遥感影像改进自适应维纳滤波算法[J].金属矿山,2016,45(7):151-154. 被引量：5
9王强,张培林,王怀光,吴定海,张云强.基于稀疏分解的振动信号数据压缩算法[J].仪器仪表学报,2016,37(11):2497-2505. 被引量：9
10王雪瑞,周岩.基于差分演化-MP的快速信号稀疏分解[J].商丘师范学院学报,2016,32(12):45-49.

1郭欣,贾海蓉,王栋.利用子空间改进的K-SVD语音增强算法[J].西安电子科技大学学报,2016,43(6):109-115. 被引量：4
2张惠媛,林志耀,滕建辅.开关电容∑△调制器设计中电容值的计算[J].电路与系统学报,1999,4(3):63-68.
3周若飞,王钢.一种基于压缩感知的斑点噪声去除算法[J].无线电通信技术,2017,43(2):25-28.
4原菲,司占军,赵永光.移动智能终端视频质量评价的应用研究[J].包装世界,2014(3):38-39.
5陈建强.胆石之争：对主客观评价的反思[J].无线电与电视,1996(6):5-6.
6冯博,杜兰,张学峰,刘宏伟.基于字典学习的雷达高分辨距离像目标识别[J].电波科学学报,2012,27(5):897-905. 被引量：8
7季正燕,陈辉,张佳佳,陆晓飞.两种基于SVD的稀疏重构解相干改进算法[J].空军预警学院学报,2017,31(1):5-10. 被引量：5
8秦晓伟,郭建中.K-SVD算法的超声图像加性噪声去噪研究[J].陕西师范大学学报（自然科学版）,2012,40(6):42-46. 被引量：2
9罗武骏,陶文凤,左加阔,赵力.自适应语音压缩感知方法[J].东南大学学报（自然科学版）,2012,42(6):1027-1030. 被引量：3
10谢高觉.传真信号频谱的不对称抑制传输[J].电信科学,1958(10):11-16.

信号处理

2011年第12期

浏览历史

内容加载中请稍等...

基于数据驱动字典和稀疏表示的语音增强被引量：14

参考文献15

同被引文献109

引证文献14

二级引证文献59

相关作者

相关机构

相关主题

浏览历史

基于数据驱动字典和稀疏表示的语音增强 被引量：14

参考文献15

同被引文献109

引证文献14

二级引证文献59

相关作者

相关机构

相关主题

浏览历史

基于数据驱动字典和稀疏表示的语音增强被引量：14