Audio Segmentation via the Similarity Measure of Audio Feature Vectors

Audio Segmentation via the Similarity Measure of Audio Feature Vectors

下载PDF

导出

摘要 A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1）. To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper. A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1）. To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper.

作者 CHEN Gang TAN Hui CHEN Xin-meng

机构地区 School of Computer

出处《Wuhan University Journal of Natural Sciences》 EI CAS 2005年第5期833-837,共5页 武汉大学学报（自然科学英文版）

基金 SupportedbytheNationalNaturalScienceFoundationofChina(10371033)

关键词 audio segmentation abrupt change detection overall error similarity measure self-similarity matrix relevance feedback audio segmentation abrupt change detection overall error similarity measure self-similarity matrix relevance feedback

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

1ZHU Qiu ping, FANG Zhi hao, LUO Juan,FU Liang College of Electronic Information,Wuhan University,Wuhan 430072,China.Study of Self-Similarity in Variable Bit Rate Video Sources[J].Wuhan University Journal of Natural Sciences,1999,4(2):60-63.
2糜增元.基于内容的数字音频快速检索技术综述[J].中国新通信,2016,18(4):120-120.
3阿尔卡特朗讯荣获“2012年亚太区年度最佳光网络设备商”大奖[J].电信网技术,2012(6):87-87.
4孙卫国,夏秀渝,乔立能,叶于林.面向音频检索的音频分割和标注研究[J].微型机与应用,2017,36(5):38-41. 被引量：5
5宣丽萍.音频跳变点的分割熵检测算法[J].黑龙江科技学院学报,2008,18(3):199-201.
6李稀敏,洪青阳,黄晓丹.基于说话人的音频分割与聚类[J].心智与计算,2010,0(2):139-147. 被引量：5
7杨东沿,赵伟,孔明明.基于端点检测的广播音频分割与分类[J].现代计算机（中旬刊）,2016(4):46-49. 被引量：3
8赵剑,董远,赵贤宇,杨浩,陆亮,王海拉.Advances in SVM-Based System Using GMM Super Vectors for Text-Independent Speaker Verification[J].Tsinghua Science and Technology,2008,13(4):522-527.
9YongchengWang Jiqing Han Haifeng Li Tieran Zheng.A Novel Audio Segmentation Method Based on Changing Trend of Distance between Audio Scenes[J].通讯和计算机（中英文版）,2006,3(7):22-30.
10Fenghua Wang,Hui Xie,Zhitao Huang.Blind reconstruction of convolutional code based on segmented Walsh-Hadamard transform[J].Journal of Systems Engineering and Electronics,2014,25(5):748-754. 被引量：10

Wuhan University Journal of Natural Sciences

2005年第5期

浏览历史

内容加载中请稍等...

Audio Segmentation via the Similarity Measure of Audio Feature Vectors

相关作者

相关机构

相关主题

浏览历史