Classification of Normal and Pathological Voice Using SVM and RBFNN 被引量：3

Classification of Normal and Pathological Voice Using SVM and RBFNN

下载PDF

导出

摘要 The identification and classification of pathological voice are still a challenging area of research in speech processing. Acoustic features of speech are used mainly to discriminate normal voices from pathological voices. This paper explores and compares various classification models to find the ability of acoustic parameters in differentiating normal voices from pathological voices. An attempt is made to analyze and to discriminate pathological voice from normal voice in children using different classification methods. The classification of pathological voice from normal voice is implemented using Support Vector Machine (SVM) and Radial Basis Functional Neural Network (RBFNN). The normal and pathological voices of children are used to train and test the classifiers. A dataset is constructed by recording speech utterances of a set of Tamil phrases. The speech signal is then analyzed in order to extract the acoustic parameters such as the Signal Energy, pitch, formant frequencies, Mean Square Residual signal, Reflection coefficients, Jitter and Shimmer. In this study various acoustic features are combined to form a feature set, so as to detect voice disorders in children based on which further treatments can be prescribed by a pathologist. Hence, a successful pathological voice classification will enable an automatic non-invasive device to diagnose and analyze the voice of the patient. The identification and classification of pathological voice are still a challenging area of research in speech processing. Acoustic features of speech are used mainly to discriminate normal voices from pathological voices. This paper explores and compares various classification models to find the ability of acoustic parameters in differentiating normal voices from pathological voices. An attempt is made to analyze and to discriminate pathological voice from normal voice in children using different classification methods. The classification of pathological voice from normal voice is implemented using Support Vector Machine (SVM) and Radial Basis Functional Neural Network (RBFNN). The normal and pathological voices of children are used to train and test the classifiers. A dataset is constructed by recording speech utterances of a set of Tamil phrases. The speech signal is then analyzed in order to extract the acoustic parameters such as the Signal Energy, pitch, formant frequencies, Mean Square Residual signal, Reflection coefficients, Jitter and Shimmer. In this study various acoustic features are combined to form a feature set, so as to detect voice disorders in children based on which further treatments can be prescribed by a pathologist. Hence, a successful pathological voice classification will enable an automatic non-invasive device to diagnose and analyze the voice of the patient.

作者 V. Sellam J. Jagadeesan

机构地区 Department of Computer Science and Engineering

出处《Journal of Signal and Information Processing》 2014年第1期1-7,共7页 信号与信息处理（英文）

关键词 Terms—Pitch Formants JITTER SHIMMER Reflection COEFFICIENTS SVM RBFNN Terms—Pitch Formants Jitter Shimmer Reflection Coefficients SVM RBFNN

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

同被引文献6

1陈大跃,丁洪,周新宇,林良明,卢钢,孙志宏.人体行走的步态测试与分析系统[J].中国生物医学工程学报,1997,16(2):133-141. 被引量：17
2纪爱兵,邱红洁,谷银山.基于模糊训练数据的支持向量机与模糊线性回归[J].河北大学学报（自然科学版）,2008,28(3):240-243. 被引量：3
3刘雨华.基于梯形模糊数的指标权重确定方法的应用研究[J].南京信息工程大学学报（自然科学版）,2009,1(4):369-372. 被引量：14
4孙娜,马生昀,马占新,李东平.基于LR模糊数的广义模糊DEA模型及其求解[J].数学的实践与认识,2015,45(5):201-208. 被引量：2
5田丰,戴国忠,陈由迪,程成.三维交互任务的描述和结构设计[J].软件学报,2002,13(11):2099-2105. 被引量：12
6吴涛,贺汉根,贺明科.基于插值的核函数构造[J].计算机学报,2003,26(8):990-996. 被引量：38

引证文献3

1胡站伟,焦立国,徐胜金,黄勇.基于多尺度重采样思想的类指数核函数构造[J].电子与信息学报,2016,38(7):1689-1695. 被引量：4
2李洋,黄进,田丰,韩冬奇,范俊君,陈辉,彭晓兰,戴国忠,王宏安.云端融合的神经系统疾病多通道辅助诊断研究[J].中国科学：信息科学,2017,47(9):1164-1182. 被引量：5
3张新亚,沈菊红,刘楷.一种输入数据为模糊数的模糊支持向量机[J].计算机工程与应用,2017,53(20):122-127. 被引量：2

二级引证文献11

1范向民,范俊君,田丰,戴国忠.人机交互与人工智能:从交替浮沉到协同共进[J].中国科学：信息科学,2019,49(3):361-368. 被引量：31
2李群生,赵剡,寇磊,王进达.一种基于多尺度核学习的仿射投影滤波算法[J].电子与信息学报,2020,42(4):924-931. 被引量：5
3刘晓宇,武鲁,许少华.一种深层过程神经网络及其在信号分类中的应用[J].软件导刊,2020,19(3):60-64. 被引量：1
4梁礼明,郭凯,盛校棋.基于加权特征子空间的支持向量机核函数研究[J].科学技术与工程,2020,20(15):6101-6106.
5梁礼明,郭凯,盛校棋.基于分形插值核函数的支持向量机算法[J].计算机应用与软件,2021,38(8):298-302. 被引量：8
6李洋,汪柳萍,黄进,范向民,田丰.人机交互技术在神经系统疾病辅助诊断中的应用:现状与前景[J].协和医学杂志,2021,12(5):608-613. 被引量：1
7王伟涛,杜洋,宋雪娇,樊建春,王雅杰.基于模糊支持向量机的深远海应急物资优先级研究[J].安全与环境学报,2022,22(6):3321-3325. 被引量：3
8张萍,孙旭芳,张悦蛟.居家场景下帕金森步态障碍早期干预产品设计研究[J].设计,2023,36(10):100-103.
9李永飞,贺桂英,张金.基于SVM算法的生鲜农产品质量安全预警研究[J].智能计算机与应用,2023,13(7):150-154. 被引量：2
10陶思源,王海楠.基于J2EE和Android的动物疾病辅助诊断平台设计与测试[J].科技创新与应用,2024,14(13):120-123.

1LEE Wai-Sum,ZEE Eric.VOWEL DISPERSION AND VARIABILITY—DATA FROM THREE CHINESE DIALECTS[J].中国语音学报,2013(1):143-152.
2肖跃华,刘伟,张美伦,王永宝,李冬雷,张旭宇,张静.低温等离子射频术治疗喉癌的效果及术后患者喉狭窄发生情况分析[J].实用癌症杂志,2019,34(10):1646-1648. 被引量：9
3LEE Wai-Sum.ARTICULATORY-ACOUSTIC RELATIONS IN PALATAL VOWELS[J].中国语音学报,2013(1):153-162.
4Markandu Thirukumar.Hitches in Antenatal Screening in Vaharai, Batticaloa, Sri Lanka[J].Open Journal of Obstetrics and Gynecology,2018,8(4):416-424.
5Dionysios Tafiadis,Meropi E. Helidoni,Spyridon K. Chronopoulos,Evangelia I. Kosma,Vasiliki Liagkou,Louiza Voniati,Nafsika Ziavra,George A. Velegrakis.Preliminary Receiver Operating Characteristic Analysis on Voice Handicap Index of Laryngeal Inflammation in Greek Patients[J].International Journal of Otolaryngology and Head & Neck Surgery,2018,7(3):115-131.
6Gislaine Ferro Cordeiro,Arlindo Neto Montagnoli,Maysa Tibério Ubrig,Marcia Helena Moreira Menezes,Domingos Hiroshi Tsuji.Comparison of Tongue and Lip Trills with Phonation of the Sustained Vowel /<i>ε</i>/ Regarding the Periodicity of the Electroglottographic Waveform and the Amplitude of the Electroglottographic Signal[J].Open Journal of Acoustics,2015,5(4):226-238.
7于雪杰.渐进式发声训练在声带息肉术后患者嗓音恢复中的应用[J].安徽卫生职业技术学院学报,2019,18(5):144-145. 被引量：4
8Mohammad Shahbakhi,Danial Taheri Far,Ehsan Tahami.Speech Analysis for Diagnosis of Parkinson’s Disease Using Genetic Algorithm and Support Vector Machine[J].Journal of Biomedical Science and Engineering,2014,7(4):147-156. 被引量：1
9C. Nagarjuna Reddy,T. Harinarayana.Solar Thermal Energy Generation Potential in Gujarat and Tamil Nadu States, India[J].Energy and Power Engineering,2015,7(13):591-603.
10Dionysios Tafiadis,Evangelia I. Kosma,Spyridon K. Chronopoulos,Louiza Voniati,Nafsika Ziavra.A Preliminary Receiver Operating Characteristic Analysis on Voice Handicap Index Results of the Greek Voice-Disordered Patients[J].International Journal of Otolaryngology and Head & Neck Surgery,2018,7(3):98-114. 被引量：2

Journal of Signal and Information Processing

2014年第1期

浏览历史

内容加载中请稍等...

Classification of Normal and Pathological Voice Using SVM and RBFNN 被引量：3

同被引文献6

引证文献3

二级引证文献11

相关作者

相关机构

相关主题

浏览历史