最近特征线在音频分类中的应用

Application of Nearest Feature Line in Audio Classification

下载PDF

导出

摘要通过提取基音频率、明亮度、带宽、过零率、响度、均方根、相邻点之间距离的均值和方差及Mel倒谱系数这8个特征构造特征集,在此基础上提出一种基于最近特征线的音频分类算法,对其进行枪声、鞭炮声、喇叭声及说话声的分类实验中,结果表明,该算法的分类效果较好,错误率可低至11.76%。 This paper constructs the feature set by extracting eight features including perceptual features like pitch frequency, brightness, bandwidth, zero-crossing rate, loudness, Root Mean Square（RMS）, the distance between the adjacent point of the mean value and Mel Frequency Cepstral Coefficients（MFCC）, and proposes an audio classification algorithm based on Nearest Feature Line（NFL）. It is applied to classification experiment with four audio including guns, banger, horn and talks, and the result shows that the algorithm is effective in classification and its error rate can reduce to 11.76%.

作者练芝飞徐荣聪

机构地区福州大学数学与计算机科学学院

出处《计算机工程》 CAS CSCD 北大核心 2011年第2期151-153,共3页 Computer Engineering

关键词音频分类最近特征线音频特征选取 MEL倒谱系数 audio classification Nearest Feature Line（NFL） audio feature extraction Mel Frequency Cepstral Coefficients（MFCC）

分类号 TN912 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献8

1Feiten B, Frank R, Ungvary T. Organization of Sounds with Neural Nets[C]//Proc. of 1991 International Computer Music Conference. San Francisco, USA:[s. n.], 1991: 441-444.
2Feiten B, Gunzel S. Automatic Indexing of a Sound Database Using Self-organizing Neural Nets[J]. Computer Music Journal, 1994, 18(3): 53-65.
3Li Stan. Content-based Classification and Retriewtl of Audio Using the Nearest Feature Line Method[J ].IEEE Trans. on Speech and Audio .Processing, 2000, 8(5): 619-625.
4陈荆勇,谢湘,刘家康.基于最近特征线法的语音/音乐分类[C]//第八届全国人机语音通讯学术会议论文集.兰州:[出版者不详],2005.
5杨翠丽,郭昭辉,武港山.基于改进投票机制的音乐流派分类方法研究[J].计算机工程,2008,34(9):213-215. 被引量：5
6俞玉莲,郭世杰.音频分类中的特征分析[J].信息技术,2009,33(6):31-33. 被引量：1
7卢坚,陈毅松,孙正兴,张福炎.基于隐马尔可夫模型的音频自动分类[J].软件学报,2002,13(8):1593-1597. 被引量：47
8蔡群,陆松年,杨树堂.基于音视特征的视频内容检测方法[J].计算机工程,2007,33(22):240-242. 被引量：4

二级参考文献34

1雷浩,李生红.基于改进Kohonen网和BP网的色情图像识别技术[J].计算机工程,2005,31(10):164-167. 被引量：7
2John Saunders. Real-time discrimination of broadcast speech/music [C]. Int' 1 Conf Acoustic, Speech, and Signal Processing, Atlanta, 1996.
3Scheirer E, Slaney M. Construction and evaluation of a robust multifeature music/speech discriminator[C]. Int'1 Conf Acoustic, speech, and Signal Processing, Munich: IEEE Press, 1997:1331 - 1334.
4[1]Feiten, B., Frank, R., Ungvary, T. Organization of sounds with neural nets. In: Proceedings of the 1991 International Computer Music Conference, International Computer Music Association. San Francisco, 1991. 441～444.
5[2]Feiten, B., Günzel, S. Automatic indexing of a sound database using self-organizing neural nets. Computer Music Journal, 1994,18(3):53～65.
6[3]Wold, E., Blum, T., Keislar, D., et al. Content-Based classification, search and retrieval of audio. IEEE Multimedia Magazine, 1996,3(3):27～36.
7[4]Foote, J.T. Content-Based retrieval of music and audio. Multimedia Storage and Archiving Systems II, 1997,32(29):138～147.
8[5]Li, S.Z. Content-Based classification and retrieval of audio using the nearest feature line method. IEEE Transactions on Speech and Audio Processing, 2000,8(5):619～625.
9[6]Li, S.Z., Guo, Guo-dong. Content-Based audio classification and retrieval using SVM learning. In: Proceedings of the 1st IEEE Pacific-Rim Conference on Multimedia. 2000.
10[7]Jiang, Hao, Lin, Tony, Zhang, Hong-jiang. Video segmentation with the support of audio segmentation and classification. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2000), Vol 3. NY: IEEE, 2000. 1507～1510.

共引文献53

1齐俊英,孙劲光,高爱东.基于内容的音频自动分类方法[J].辽宁工程技术大学学报（自然科学版）,2005,24(z1):170-172. 被引量：5
2郑继明,李瑞仙,蒲兴成.基于单状态HMM的音频分类方法研究[J].计算机应用,2009,29(2):392-394.
3彭昱忠,元昌安,王艳,覃晓.基于内容理解的不良信息过滤技术研究[J].计算机应用研究,2009,26(2):433-438. 被引量：19
4陈姗姗.未来广播中的音频检索技术[J].视听界（广播电视技术）,2010(3):62-64.
5柳群英.基于内容的音频信息检索技术[J].现代情报,2005,25(6):91-93. 被引量：7
6郑贵滨,韩纪庆,李海峰,郑铁然.基于分段的实时声频检索方法[J].声学学报,2006,31(2):101-108. 被引量：5
7郭兴吉,范秉琪.基于特征的音频比对技术[J].河南师范大学学报（自然科学版）,2006,34(2):35-38. 被引量：15
8郑贵滨,韩纪庆.基于直方图的树与链表相结合的音频索引方法[J].哈尔滨工业大学学报,2006,38(11):1915-1918. 被引量：1
9郭兴吉.隐马尔科夫模型在音频波形识别中的应用研究[J].福建电脑,2007,23(3):13-14.
10黄光球,汪晓海.基于BP-HMM的网络入侵检测方法研究[J].计算机工程,2007,33(10):131-133. 被引量：2

1程剑,应自炉,张有为.基于模糊积分多分类器融合的人脸表情识别[J].信号处理,2005,21(z1):358-361. 被引量：2
2韦克.守岗[J].上海安全生产,2007(3):61-61.
3陈怡君,管桦,王国正,张群,罗迎.稀疏孔径条件下微动目标特征提取与成像算法[J].现代防御技术,2014,42(4):136-142. 被引量：1
4林嘉宇,黄芝平,王跃科,沈振康.语音信号相空间重构中嵌入维数的选择[J].电子科学学刊,1999,21(6):735-742. 被引量：4
5简单爱移动互联节后专辑[J].计算机应用文摘,2012(4):45-47.
6林嘉宇,王跃科,黄芝平,沈振康.一种新的基于混沌的语音、噪声判别方法[J].通信学报,2001,22(2):123-128. 被引量：6
7何望雄.康佳高清ST机芯彩电维修三例[J].家电维修,2010(10):12-12.
8四川汶川一废弃鞭炮引线作坊着火6名儿童被烧伤[J].化工安全与环境,2010(21):20-20.
9刘湘明.SAP发动疲劳战[J].IT经理世界,2005,8(6):40-40.
10phinex.给不能团聚的人们[J].数字家庭,2010(1):140-141.

计算机工程

2011年第2期

浏览历史

内容加载中请稍等...

最近特征线在音频分类中的应用

参考文献8

二级参考文献34

共引文献53

相关作者

相关机构

相关主题

浏览历史