摘要
为了对TELPC算法所提取的频谱包络进行感知增强,提出一种利用人耳感知特性的频谱包络估计新算法.该算法首先采用真实包络估计器实时算法提取频谱包络,然后通过美尔卷曲对频谱包络进行感知增强,最后对卷曲的频谱包络进行线性预测分析.为了进一步提高线性预测分析的性能,该算法还对卷曲的频谱包络进行幅度压缩.美尔卷曲采用傅里叶变换对和频谱包络线性内插两种方法实现.客观测试表明,新算法所提取频谱包络的对数谱失真和谱平坦度在低频段均小于原TELPC算法所提取的频谱包络,且幅度压缩使算法性能更佳.
In order to perceptually enhance the spectral envelope extracted via TELPC algorithm,a new estimation algorithm of spectral envelope is proposed using the property of auditory perception.In this algorithm,the spectral envelope is extracted using a real-time true envelope(TE) estimator and is perceptually enhanced via the Mel-warping.Then,a linear predictive analysis is performed for the warped envelope.Moreover,the warped envelope is compressed to further improve the performance of linear prediction coding(LPC).The Mel-warping is implemented in two ways: one is by the Fourier transform pair and the other is by the linear interpolation of spectral envelope.Test results indicate that,as compared with the existing TELPC algorithm,the proposed algorithm is more effective because it results in smaller log-spectral distortion and spectral flatness in the low-frequency band,as well as higher performance due to the envelope compression.
出处
《华南理工大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2011年第2期26-31,共6页
Journal of South China University of Technology(Natural Science Edition)
基金
国家杰出青年科学基金资助项目(60725105)
教育部长江学者和创新团队发展计划项目(IRT0852)
国家自然科学基金资助项目(61072068)
关键词
语音分析
包络检测器
倒谱分析
线性预测编码
美尔卷曲
speech analysis
envelope detector
cepstrum analysis
linear predictive coding
Mel-warping