期刊文献+

基于决策树的语音基元语境特征权重训练算法

CARD based context specified weights training algorism for unit selection in speech synthesis
下载PDF
导出
摘要 提出了一种基于决策树的语音合成基元的语境特征权重训练算法.对语音数据库中的每个带调音节,利用语境相关的问题集和候选基元的频谱距离建立决策树.对每个要合成的音节,根据其语境特征,获得语音合成系统选择的基元的语境特征F*和该语境特征下决策树叶子结点中基元的语境特征F′.统计F′中每一个语境特征相对于F*的变化,根据语境特征变化的概率对权重进行调整.实验结果表明,这种方法能够训练出合理的语境特征权重,使得合成语音的自然度有一定提高.同时,利用这种方法还可以对语音合成系统进行实时优化. The paper introduces a context specified weights training algorism for contextual features of speech unit in speech synthesis based on Classification and Regression Tree(CART). A CART is created for each tonal syllable with the spectral distance of each candidate unit and the context dependent question set. The increments of contextual features are counted by comparing the units selected by TTS and given by leaf node of CART. The weights of contextual features are then adjusted in accordance with their probability of increment. The experiments demonstrate that a set of reasonable weights can be trained by the algorism so the naturalness of synthetic speech can also be improved. The algorism can also be used to optimize the speech synthesis system online.
出处 《西北师范大学学报(自然科学版)》 CAS 2007年第4期50-54,共5页 Journal of Northwest Normal University(Natural Science)
基金 西北师范大学科研骨干培育项目(NWNU-KJCXGC-03-42)
关键词 语音合成 文语转换 基元选取 权重训练 speech synthesis text-to-speech unit selection weight training
  • 相关文献

参考文献10

  • 1HUNT A,BLACK A.Unit selection in a concatenative speech synthesis system using a large speech database[C]//ICASSP96,Atlanta:IEEE Press,1996:373-376.
  • 2DONOVAN R E.Trainable Speech Synthesis[D].Cambridge:Cambridge University,1996.
  • 3MERON Y,HIROSE K.Efficient weight training for selection based synthesis[C]//Euro Speech 99.Budapest:ISCA Press,1999:2319-2322.
  • 4FRANCESC A,XAVIER L.Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis[C]//Euro Speech 2003.Geneva:ISCA Press,2003:1333-1336.
  • 5PARK Seung-Seop,KIM Chong-Kyu,KIM Nam-Soo.Discriminative weight training for unit-selection based speech synthesis[C]//Euro Speech 2003.Geneva:ISCA Press,2003:281-284.
  • 6吴志勇,蔡莲红,蔡锐.语音合成中基于听辨指导的权重训练算法[J].清华大学学报(自然科学版),2005,45(1):52-56. 被引量:1
  • 7BREIMAN L.Classification and Regression Trees[M].Pacific Grove,CA:Wadsworth,1984.
  • 8BLACK A,TAYLOR P.Automatically clustering similar units for unit selection in speech synthesis[C]//Euro Speech 97.Rhodes:ISCA Press,1997,2:601-604.
  • 9YANG Hong-wu,HUANG De-zhi,CAI Lian-hong.Perceptually weighted mel-cepstral analysis of speech based on psychoacoustic model[J].IEICE Transactions on Information and Systems,2006,E89-D(12):2998-3001.
  • 10ZHANG Xiao-nan,XU Jun,CAI Lian-hong.Prosodic boundary prediction based on maximum entropy model with error-driven modification[C]//Garbonell J G,Siekmann J.Lecture Notes in Artificial Intelligence.Berlin:Springer-Verlag,2006:149-160.

二级参考文献6

  • 1Hunt A J, Black A W. Unit selection in a concatenative speech synthesis system using a large speech database [A].ICASSP96 [C]. Atlanta: IEEE Press, 1996. 373-376.
  • 2Meron Y, Hirose K. Efficient weight training for selection based synthesis [A]. EuroSpeech99 [C]. Budapest: ISCA Press, 1999. 2319-2322.
  • 3Alias F, Llora X. Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis [A].EuroSpeech2003 [C]. Geneva: ISCA Press, 2003. 1333 -1336.
  • 4Park S S, KimC K, KimN S. Discriminative weight training for unit-selection based speech synthesis [A].EuroSpeech2003 [C]. Geneva: ISCA Press, 2003.281-284.
  • 5PENG Hu, ZHAO Yong, CHU Min. Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation [A].ICSLP2002 [C].Denver : IEEE Press, 2002. 2613 - 2616.
  • 6吴志勇,蔡莲红.语音合成中的韵律关联模型[J].中文信息学报,2004,18(2):44-50. 被引量:8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部