Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain 被引量：1

下载PDF

导出

摘要 Aiming at the problem of music noise introduced by classical spectral subtraction,a shorttime modulation domain(STM)spectral subtraction method has been successfully applied for singlechannel speech enhancement.However,due to the inaccurate voice activity detection(VAD),the residual music noise and enhanced performance still need to be further improved,especially in the low signal to noise ratio(SNR)scenarios.To address this issue,an improved frame iterative spectral subtraction in the STM domain(IMModSSub)is proposed.More specifically,with the inter-frame correlation,the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain.Then,the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR.With these classification results,a corresponding mask function is developed for noisy speech after noise subtraction.Finally,exploiting the increased sparsity of speech signal in the modulation domain,the orthogonal matching pursuit(OMP)technique is employed to the speech frames for improving the speech quality and intelligibility.The effectiveness of the proposed method is evaluated with three types of noise,including white noise,pink noise,and hfchannel noise.The obtained results show that the proposed method outperforms some established baselines at lower SNRs(-5 to +5 dB).

作者 Chao Li Ting Jiang Sheng Wu

机构地区 School of Information and Communication Engineering

出处《China Communications》 SCIE CSCD 2021年第9期100-115,共16页 中国通信（英文版）

基金 National Natural Science Foundation of China(NSFC)(No.61671075) Major Program of National Natural Science Foundation of China(No.61631003)。

关键词 short-time modulation domain single-channel speech enhancement modulation improved frame iterative spectral subtraction low SNRs

分类号 TN912.35 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献2

1Chao Li,Ting Jiang,Sheng Wu.Speech Enhancement Based on Approximate Message Passing[J].China Communications,2020,17(8):187-198. 被引量：1
2于志文,朱琦.基于压缩感知的自适应谱减法语音增强算法[J].南京邮电大学学报（自然科学版）,2015,35(2):51-57. 被引量：3

二级参考文献16

1DONOHO D L. Compressed sensing [ J]. IEEE Transac-tions on Information Theory, 2006,52(4) : 1289 — 1306.
2CANDES E J, ROMBERG J,TAO T. Robust uncertaintyprinciples : Exact signal reconstruction from highly incom-plete frequency information [ J ]. IEEE Transactions on In-formation Theory,2006,52(2) :489 -509.
3TSAIG Y,DONOHO D. Extensions of compressed sensing[J]. Signal Processing,2006,86(3) :533 -548.
4BOLL S F. Suppression of acoustic noise in speech usingspectral subtraction[ J]. IEEE Transactions on Acoustics,Speech, and Signal Processing, 1979,27 ( 2 ) : 113 - 120.
5BEROUTI M,SCHWARTZ R,MAKHOUL J. Enhancementof speech corrupted by acoustic noise[ C] //IEEE Interna-tional Conference on Acoustics, Speech and Signal Process-ing(ICASSP). 1979:208 -211.
6HU Y,LOIZOU PC. A generalized subspace approach forenhancing speech corrupted by colored noise [ J ]. IEEETransactions on Speech and Audio Processing,2003,11(4):334-341.
7GOLDSTEIN J S,REED I S,SCHARF L L. A multistagerepresentation of the Wiener filter based on orthogonal pro-jections [J ]. IEEE Transactions on Information Theory,1998,44(7) :2943 -2959.
8DONOHO D L. For most large underdetermined systemsof linear equations,the minimal LI norm solution is alsothe sparsest solution [ J ]. Communications on Pure andApplied Mathematics, 2006,59(6) :797 -829.
9TROPP J A,GILBERT A C. Signal recovery from randommeasurements via orthogonal matching pursuit [ J]. IEEETransactions on Information Theory, 2007,53(12) : 4655-4666.
10石光明,刘丹华,高大化,刘哲,林杰,王良君.压缩感知理论及其研究进展[J].电子学报,2009,37(5):1070-1081. 被引量：712

共引文献2

1修春晓,周瑜,刘迪,钟华森,张志远,张乐意.基于压缩感知谱距的声学弱信号检测算法[J].电声技术,2020,44(11):75-80.
2郭莉莉,陈永红.一种改进的谱减法语音增强算法[J].通信技术,2021,54(6):1350-1355. 被引量：8

同被引文献6

1何赛娟,陈华伟,尹明婕,丁少为.基于差分麦克风阵列和语音稀疏性的多源方位估计方法[J].数据采集与处理,2015,30(2):372-381. 被引量：7
2徐娜,吴长奇.结合差分阵列与幅度谱减的双麦语音增强算法[J].信号处理,2018,34(7):876-881. 被引量：8
3陈又圣,陈艳.电子耳蜗前端双麦克风语音增强及波束形成算法研究[J].生物医学工程学杂志,2019,36(3):468-477. 被引量：4
4张霞,袁鑫.双麦阵列的联合语音增强算法[J].电子器件,2019,42(5):1274-1277. 被引量：4
5潘超,黄公平,陈景东.面向语音通信与交互的麦克风阵列波束形成方法[J].信号处理,2020,36(6):804-815. 被引量：20
6孟维鑫,厉剑,郑成诗,李晓东.复广义高斯分布多通道最大似然联合去噪去混响波束形成器[J].信号处理,2022,38(4):677-689. 被引量：4

引证文献1

1张家扬,何伟,童峰,卢荣富,冯万健.基于角度压制比谱减的环境自适应双麦语音增强[J].厦门大学学报（自然科学版）,2024,63(2):296-304.

1粉红噪音[J].月读,2013,0(5):13-13.
2陈宁.基于空间句法的徽州传统村落解析及活化模式探讨--以安徽省黄山市呈坎村为例[J].安徽建筑,2021,28(9):9-12. 被引量：4
3Dihu Chen,Sheng Yang.Compression of ECG signal using video codec technology-like scheme[J].Journal of Biomedical Science and Engineering,2008,1(1):22-26.
4杜志浩,韩纪庆.基于听觉掩蔽生成对抗网络的单通道语音增强方法[J].智能计算机与应用,2021,11(3):209-214. 被引量：1
5SHI Wenhua,ZHANG Xiongwei,ZOU Xia,SUN Meng,LI Li,REN Zhengbing.Time-frequency mask estimation-based speech enhancement using deep encoder-decoder neural network[J].Chinese Journal of Acoustics,2021,40(1):141-154.
6ZHANG Xiaoyan,ZHANG Tianqi,GE Wanying,BAI Yangliu.Monaural speech enhancement combining deep neural network and convex optimization[J].Chinese Journal of Acoustics,2021,40(3):460-476.
7Lina Shi,Zichi Wang,Zhenxing Qian,Nannan Huang,Pauline Puteaux,Xinpeng Zhang.Distortion Function for Emoji Image Steganography[J].Computers, Materials & Continua,2019(6):943-953. 被引量：1
8Hisako Orimoto,Akira Ikuta,Kouji Hasegawa.Speech Signal Detection Based on Bayesian Estimation by Observing Air-Conducted Speech under Existence of Surrounding Noise with the Aid of Bone-Conducted Speech[J].Intelligent Information Management,2021,13(4):199-213. 被引量：1
9Hongbing CHENG,Ming LEI,Guorong HUANG,Yan XIA.Robust Speech Endpoint Detection in Airplane Cockpit Voice Background[J].Wireless Sensor Network,2009,1(5):489-495.
10Shu-jia Lin,Yang Xiang,Zhuo-qi Li,Fu-xin Wang,Hong Liu.Evolution of the Lagrangian drift and vortex added-mass of a growing vortex ring[J].Journal of Hydrodynamics,2021,33(4):725-735. 被引量：1

China Communications

2021年第9期

浏览历史

内容加载中请稍等...