基于决策树的语音基元语境特征权重训练算法

CARD based context specified weights training algorism for unit selection in speech synthesis

下载PDF

导出

摘要提出了一种基于决策树的语音合成基元的语境特征权重训练算法.对语音数据库中的每个带调音节,利用语境相关的问题集和候选基元的频谱距离建立决策树.对每个要合成的音节,根据其语境特征,获得语音合成系统选择的基元的语境特征F*和该语境特征下决策树叶子结点中基元的语境特征F′.统计F′中每一个语境特征相对于F*的变化,根据语境特征变化的概率对权重进行调整.实验结果表明,这种方法能够训练出合理的语境特征权重,使得合成语音的自然度有一定提高.同时,利用这种方法还可以对语音合成系统进行实时优化. The paper introduces a context specified weights training algorism for contextual features of speech unit in speech synthesis based on Classification and Regression Tree（CART）. A CART is created for each tonal syllable with the spectral distance of each candidate unit and the context dependent question set. The increments of contextual features are counted by comparing the units selected by TTS and given by leaf node of CART. The weights of contextual features are then adjusted in accordance with their probability of increment. The experiments demonstrate that a set of reasonable weights can be trained by the algorism so the naturalness of synthetic speech can also be improved. The algorism can also be used to optimize the speech synthesis system online.

作者杨鸿武郭威彤蔡莲红吴志勇

机构地区西北师范大学物理与电子工程学院清华大学计算机科学与技术系普适计算教育部重点实验室

出处《西北师范大学学报（自然科学版）》 CAS 2007年第4期50-54,共5页 Journal of Northwest Normal University(Natural Science)

基金西北师范大学科研骨干培育项目(NWNU-KJCXGC-03-42)

关键词语音合成文语转换基元选取权重训练 speech synthesis text-to-speech unit selection weight training

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1HUNT A,BLACK A.Unit selection in a concatenative speech synthesis system using a large speech database[C]//ICASSP96,Atlanta:IEEE Press,1996:373-376.
2DONOVAN R E.Trainable Speech Synthesis[D].Cambridge:Cambridge University,1996.
3MERON Y,HIROSE K.Efficient weight training for selection based synthesis[C]//Euro Speech 99.Budapest:ISCA Press,1999:2319-2322.
4FRANCESC A,XAVIER L.Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis[C]//Euro Speech 2003.Geneva:ISCA Press,2003:1333-1336.
5PARK Seung-Seop,KIM Chong-Kyu,KIM Nam-Soo.Discriminative weight training for unit-selection based speech synthesis[C]//Euro Speech 2003.Geneva:ISCA Press,2003:281-284.
6吴志勇,蔡莲红,蔡锐.语音合成中基于听辨指导的权重训练算法[J].清华大学学报（自然科学版）,2005,45(1):52-56. 被引量：1
7BREIMAN L.Classification and Regression Trees[M].Pacific Grove,CA:Wadsworth,1984.
8BLACK A,TAYLOR P.Automatically clustering similar units for unit selection in speech synthesis[C]//Euro Speech 97.Rhodes:ISCA Press,1997,2:601-604.
9YANG Hong-wu,HUANG De-zhi,CAI Lian-hong.Perceptually weighted mel-cepstral analysis of speech based on psychoacoustic model[J].IEICE Transactions on Information and Systems,2006,E89-D(12):2998-3001.
10ZHANG Xiao-nan,XU Jun,CAI Lian-hong.Prosodic boundary prediction based on maximum entropy model with error-driven modification[C]//Garbonell J G,Siekmann J.Lecture Notes in Artificial Intelligence.Berlin:Springer-Verlag,2006:149-160.

二级参考文献6

1Hunt A J, Black A W. Unit selection in a concatenative speech synthesis system using a large speech database [A].ICASSP96 [C]. Atlanta: IEEE Press, 1996. 373-376.
2Meron Y, Hirose K. Efficient weight training for selection based synthesis [A]. EuroSpeech99 [C]. Budapest: ISCA Press, 1999. 2319-2322.
3Alias F, Llora X. Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis [A].EuroSpeech2003 [C]. Geneva: ISCA Press, 2003. 1333 -1336.
4Park S S, KimC K, KimN S. Discriminative weight training for unit-selection based speech synthesis [A].EuroSpeech2003 [C]. Geneva: ISCA Press, 2003.281-284.
5PENG Hu, ZHAO Yong, CHU Min. Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation [A].ICSLP2002 [C].Denver : IEEE Press, 2002. 2613 - 2616.
6吴志勇,蔡莲红.语音合成中的韵律关联模型[J].中文信息学报,2004,18(2):44-50. 被引量：8

1杨飚,尚秀伟.加权随机森林算法研究[J].微型机与应用,2016,35(3):28-30. 被引量：9
2黄祥喜.“语境相关”自动分词方法[J].情报学报,1989,8(4):266-273. 被引量：3
3黄祥喜.书面汉语自动分词的“语境相关”方法[J].计算机应用与软件,1991,8(6):38-44.
4黄浩,朱杰.汉语语音识别中基于区分性权重训练的声调集成方法[J].声学学报,2008,33(1):1-8. 被引量：2
5韩先锋,李俊山,孙满囤,焦康.穿越式景象匹配策略研究[J].微电子学与计算机,2005,22(5):73-77. 被引量：2
6韩先锋,季景春,李俊山,焦康.穿越式景象匹配策略研究[J].湖北航天科技,2005(4):29-36.
7邵立南,刘志斌.遗传神经网络模型在地下水质量评价中的应用[J].露天采矿技术,2005,20(4):20-22. 被引量：2
8刘树杰,李志灏,李沐,周明.一种面向统计机器翻译的协同权重训练方法[J].软件学报,2012,23(12):3101-3114. 被引量：3
9盛积德,张延炘,常胜江,陈戍.前馈型神经网络中隐藏层神经元的研究[J].光电子．激光,2001,12(6):620-622. 被引量：3
10吕俊,张兴华,张湜.基于自适应递阶遗传算法的神经网络优化策略[J].计算机工程与设计,2005,26(2):305-307. 被引量：12

西北师范大学学报（自然科学版）

2007年第4期

浏览历史

内容加载中请稍等...

基于决策树的语音基元语境特征权重训练算法

参考文献10

二级参考文献6

相关作者

相关机构

相关主题

浏览历史