语音识别开发工具包SRDK的研究与开发被引量：1

Research and Development of Speech Recognition Development Kit

下载PDF

导出

摘要详细介绍了一个语音识别开发工具包SRDK(SpeechRecognitionDevelopmentkits)。该工具包可以方便地完成语音识别的各种任务,并且可以用来对语音识别技术进行研究。SRDK的特点是:ANSIC编写,便于向嵌入式系统进行移植;模块化良好,可以任意拆分组合;内置状态捆绑、训练中的剪枝、段长后处理、SSE(StreamingSingle-InstructionMultiple-DataExtensions)指令集的使用等多种先进技术等。已经使用SRDK开发出实用的语音识别系统。 Today,using the general-purpose platform to build speech recognizers becomes more and more popular.In this paper,a compact speech recognition development kit(SRDK)featured with effective merits is presented.SRDK is a set of software modules based on modified Hidden Markov Model(HMM).With them,the task for building various prac-tical speech recognizers as well as relative research work becomes more easily.SRDK is written in ANSI C,therefore it can run well not only on Windows operation system but also in UNIX environment.Further more it can be transplanted to other embed systems too.Contributed to the modularization design,any part of SRDK can be employed independently.Besides,in SRDK,four particular build-in approaches should be emphasized here.Firstly,parameter tying can be imple-mented in both semi-syllable level and state level.Secondly,SRDK adopts pruning in training stage in order to enhance the training speed.Thirdly,considering the effect of different speech rate,duration model is imported into the post pro-cessing for improving recognition performance.Finally,source codes of SRDK are optimized by using Streaming SIMD Extensions(SSE)instructions published by Intel Company and supported by AMD Company.Plus oriented graphs frame-works are utilized instead of multi-sub-tree structure in searching network,the recognition performance is improved comprehensively.The writers have already achieved a private automatic branch exchange system based on SRDK intro-duced in this paper.

作者陈一宁朱璇单翼翔刘加

机构地区清华大学电子工程系

出处《计算机工程与应用》 CSCD 北大核心 2003年第1期5-8,共4页 Computer Engineering and Applications

基金国家自然科学基金项目(编号:69975007) 国家"863"高技术研究发展计划项目(编号:863-306ZD13-04-6)

关键词语音识别开发工具包 SRDK 专用软件语音识别段长模型 SSE指令集隐含马尔可夫模型 Speech Recognition,Development Kits,Duration Model,SSE

分类号 TP319 [自动化与计算机技术—计算机软件与理论] TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献2

1张云等.PC平台新技术SSE应用编程实例[M].东南大学出版社,2000..
2贾宾,朱小燕,罗予频,胡东成.基于状态驻留时间的汉语语音分段概率模型[J].清华大学学报（自然科学版）,2000,40(1):87-90. 被引量：4

二级参考文献5

1战普明,王作英,陆大.语音识别隐马尔可夫模型的改进[J].电子学报,1994,22(1):9-15. 被引量：9
2Shen J L，ICASSP- 96，1996年，125页
3黄昌宁，语言信息处理专论，1996年
4谢锦辉，隐 Markov模型（HMM）及其在语音处理中的应用，1995年
5杨行峻，语音信号数字处理，1995年

共引文献3

1黄光球,汪晓海.基于BP-HMM的网络入侵检测方法研究[J].计算机工程,2007,33(10):131-133. 被引量：2
2刘晖,马飞.基于RS的BP-HMM在网络入侵检测中的应用[J].微计算机信息,2008,24(27):34-36.
3陈小丽,文志诚.一种基于RBF-HMM的网络入侵检测方法研究[J].网络安全技术与应用,2011(1):9-11.

同被引文献13

1韩纪庆,王欢良,李海峰,郑铁然.基于语音识别的发音学习技术[J].电声技术,2004,28(9):47-51. 被引量：9
2易定.用Microsoft Speech SDK5.1实现中文语音交互的方法[J].电脑开发与应用,2005,18(4):62-63. 被引量：4
3高敬惠,姜子敬,胡金铭.基于Speech SDK的语音应用程序实现[J].广西科学院学报,2005,21(3):169-172. 被引量：11
4熊凯.用C#开发基于Microsoft Speech SDK的语音应用程序[J].计算机时代,2007(2):40-42. 被引量：7
5Gutiittez A J,Mullins M E,Novelline R A. Impact of PACS and voice-recognition reporting on the education of radiology residents [ J ]. Journal of Digital Imaging, 2005,18 (2) : 100-108.
6Pienquin O. Machine learning for spoken dialogue management: An experiment with speech-based database querying [ C]//Lecture Notes in Computer Science, A~ficial Intelligence. 2006 : 172-180.
7Boland G. Voice recognition technology for radiology reporting: transforming the radiologist' s value proposition [ J ]. Journal of the American College of Radiology, 2007, 4 (12) :865-867.
8Smith K L. Voice Recognition System for Use in Health Care Management System: US, US20080970317 [ Z].
9Ken H. Voice Recognition system and Method:US, US2006 0492982 [Z].
10朱松豪,刘允才.Automatic Artist Recognition of Songs for Advanced Retrieval[J].Journal of Shanghai Jiaotong university(Science),2008,13(5):513-520. 被引量：1

引证文献1

1孔祥勇,宋健.语音检索在中医处方信息系统中的应用[J].计算机与现代化,2009(10):175-178. 被引量：1

二级引证文献1

1于琦,王映辉,李宗友,田野,王一萌,李菲,刘欣源,李敬华.基于语音识别技术的中医医案采集与应用研究[J].中国数字医学,2021,16(9):80-83. 被引量：2

1arer.一起来认识PentiumⅢ指令集[J].现代计算机（中旬刊）,2000(88):87-91.
2杜利民,阎兆立.基于语音增强的语音识别方法[J].科技开发动态,2004(9):47-47.
3陈静,杨文飞,谢方方,杨素敏,成城.基于SSE指令集的运动目标模板匹配算法设计[J].价值工程,2012,31(29):177-179. 被引量：1
4解军,范毅,陈鹤鸣.偏振模色散的波片级联仿真模型的比较[J].光子技术,2004(4):221-224. 被引量：2
5吴永建,袁德成,郭金玉.基于隐含马尔可夫模型的过程监视方法在TE过程中的应用[J].沈阳化工学院学报,2004,18(2):144-146. 被引量：1
6周乾南.265bps混合激励音子声码器[J].电信资料,2004(6):20-28.
7赵冬晖,潘日华,池凤彬.利用SIMD指令加速VLSI设计规则检查[J].微电子学与计算机,2008,25(7):68-71. 被引量：1
8WANG Chengyou,TANG Shuqi,LIANG Diannong,CHEN Huihuang and TANG Zhaojing(National University of Defence Technology Changsha 410073)Received.The methods for combining the information of various kinds of features in speech recognition[J].Chinese Journal of Acoustics,1997,16(2):115-120.
9助笔记本腾飞[J].中国经济和信息化,1999,0(47):31-31.
10唐国.语音识别技术探讨[J].菏泽学院学报,2001,25(4):17-19.

计算机工程与应用

2003年第1期

浏览历史

内容加载中请稍等...

语音识别开发工具包SRDK的研究与开发被引量：1

参考文献2

二级参考文献5

共引文献3

同被引文献13

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

语音识别开发工具包SRDK的研究与开发 被引量：1

参考文献2

二级参考文献5

共引文献3

同被引文献13

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

语音识别开发工具包SRDK的研究与开发被引量：1