摘要
A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1). To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper.
A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1). To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper.
基金
SupportedbytheNationalNaturalScienceFoundationofChina(10371033)