期刊文献+

基于GLR距离和BIC的混合音频分割算法 被引量:3

Hybrid approach for audio segmentation based on GLR distance and BIC
下载PDF
导出
摘要 针对传统单一音频分割算法中存在的冗余分割点过多问题,研究了一种基于一般似然比(GLR)和贝叶斯信息准则(BIC)相结合的广播音频顺序分割算法,提出了候选跳变点潜在区域的判断准则,并给出跳变点在潜在区域的检测方法,最后对检测到的跳变点进行校验。实验结果表明,与传统的音频分割算法相比,该算法的综合性能大大提高,达到较好的分割效果。 Due to traditional single audio segmentation algorithm suffers from a large amount of redundancy change points, a hybrid approach for audio sequential segmentation in broadcasting based on generalized likelihood ratio (GLR) and Bayesian Information Criterion (BIC) is proposed. The criterion of potential region of candidate change point and the detection method of change point is presented, and the validation of true change points is given. Compared with the algorithms of traditional audio segmentation, the results show that this approach is effective and feasible.
作者 郑继明 俞佳
出处 《计算机工程与设计》 CSCD 北大核心 2009年第13期3120-3123,共4页 Computer Engineering and Design
基金 重庆市教育委员会科学技术研究基金项目(KJ080524)
关键词 广播音频分割 一般似然比 贝叶斯信息准则 声学特征跳变点 校验 broadcasting segmentation generalized likelihood ratio (GLR) Bayesian information criterion (BIC) acoustic change points validation
  • 相关文献

参考文献8

  • 1Zhou Bowen,Hansen John H L.Efficient audio stream segmentation via the combined T2 statistic and Bayesian information criterion[J].IEEE Transactions on Speech and Audio Processing,2005,13(4):467-474.
  • 2Nishida Masafumi,Kawahara Tatsuya.Speaker model selection based on the Bayesian information criterion applied to unsupervised speaker indexing[J].IEEE Transactions on Speech and Audio Processing,2005,13(4):583-592.
  • 3Zhou Bowen,Hansen John H L.Unsuporvised audio stream segmentation and clustering via Bayesian information criterion[C].Proceedings of the International Conference of Spoken Language Processing,2000:714-717.
  • 4Zhang Shilei,Zhnng Shuwu,XU Bo.A two-level method for unsupervised speaker-based audio segmentation[C].Proceedings of the 18th International Conference on Pattern Recognition,2006:298-301.
  • 5Gangadharaiah Rashmi,Narayanaswamy B,Balakrishnan N.A novel method for two-speaker segmentation[C].Proceedings of the 8th International Conference on Spoken Language,2004:2337-2340.
  • 6卢坚,毛兵,孙正兴,张福炎.一种改进的基于说话者的语音分割算法[J].软件学报,2002,13(2):274-279. 被引量:17
  • 7Cheng Shi-sian,Wang Hsin-min.METRIC-SEQDAC:a hybrid approach for audio segmentation[C].Proc of the International Conference of Spoken Language Processing,2004:1617-1620.
  • 8Cheng Shi-sian,Wang Hsin-min.A sequential metric-based audio segmentation method via the Bayesian information criterion[C].Proceedings of Euro Speech,2003:945-948.

二级参考文献11

  • 1Delacourt, P., Wellekens, C.J. DISTBIC: a speaker-based segmentation for audio data indexing. Speech Communication, 2000,32(1~2):111~126.
  • 2Guo, Xue-feng, Zhu, Wei-bin, Shi, Qiu. The IBM LVCSR system used for 1998 Mandarin broadcast news transcription evaluation. In: Proceedings of the 1999 DARPA Broadcast News Workshop. 1999. http://www.nist.gov/.
  • 3Bakis, R., Chen, S., Gopalakrishnan, P.S., et al. Transcription of broadcast news shows with the IBM large vocabulary speech recognition system. In: Proceedings of the DARPA Speech Recognition Workshop. Chantilly, 1997. 67~72.
  • 4Wegmann, S., Zhan, P., Gillick, L. Progress in broadcast news transcription at Dragon systems. In: Proceedings of the ICASSP'99, Vol. 1. Phoenix, Arizona: IEEE. 1999. 33~36.
  • 5Siegler, M.A., Jain U., Raj, B., et al. Automatic segmentation, classification, and clustering of broadcast news audio. In: Proceedings of the DARPA Speech Recognition Workshop. Chantilly, 1997. 97~99.
  • 6Cover, T.M., Tomas, J.A. Elements of Information Theory. New York: John Wiley & Sons, 1991. 1197-1208.
  • 7Gish, H., Schmidt, N. Text-Independent speaker identification. IEEE Signal Processing Magazine, 1994,11(4):18~32.
  • 8Chen, S.S., Gopalakrishnan, P.S. Clustering via the bayesian information criterion with applications in speech recognition. In: Proceedings of the ICASSP'98, Vol. 2, Seattle, Washington: IEEE, 1998. 645~648.
  • 9Schwarz, G. Estimating the dimension of a model. The Annuals of Statistics, 1978,6:461~464.
  • 10Delacourt, P., Wellejkens, C.J. Audio data indexing: use of second-order statistics for speaker-based segmentation. In: Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS'1999), Vol.2. Florence, Italy: IEEE, 1999. 959~963.

共引文献16

同被引文献16

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部