生物序列模式分析中神经网络的并行训练策略

Parallel Training Strategies of Neural Networks for Bio-Sequence Analysis

下载PDF

导出

摘要神经网络作为模式识别、数据挖掘等方面的有效工具,已被广泛应用到生物序列的模式分析中,而生物序列的超大规模、超长同时也给神经网络提出了挑战,即必须解决训练时间过长、效率低下的问题。本文提出了若干适合生物应用的神经网络并行训练策略,并按其神经网络粒度进行分类,同时分析和比较了各种策略的代价。 As an analysis method , neural network has been successfully used in the field of bioinformatics, such as recognition of gene and promoter in DNA sequence, classification of DNA and protein sequences. In the field the bio-sequences are very long and in very large scale, some even up to 6Gbps , and with the rapid development of sequencing technology of the genes, a huge amount of public bio-sequence data are available. The extra huge scale and extra long size of bio-sequences provide some challenges on the neural network. Thus it is very important to reduce the network training time to meet the requirements of bioinformatics and parallel neural network seems one of promising approaches to the problem. In this paper, we have summarized some parallel training strategies of neural network suitable for bioinformatic applications. We also classified them in terms of the granularity of the neural network, analyzed and compared the cost of each strategy.

作者王镝吴青泉王国仁于戈

机构地区东北大学信息学院

出处《计算机科学》 CSCD 北大核心 2004年第3期130-133,178,共5页 Computer Science

基金国家自然科学基金(60273079)

关键词神经网络并行训练策略生物信息学生物序列模式分析 Bioinformatics, Bio-sequence analysis. Neural network, Parallel training

分类号 Q811.4 [生物学—生物工程] TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献34

1HaganMT DemuthHB BealeMH 戴葵(译).神经网络设计[M].北京：机械工业出版社,2002.119-166.
2The Human Genome Project (HGP). http://www.nhgri.nih.gov/HGP/
3Collins F,Patrinos A,Jordan E,Chakravarti A, et al. New goals for the us human genome project: 1998-2003. Science,1998, 282(5389): 682-689
4European Bioinformatics Institue. EMPL at EBI. http://www. ebi. ac. uk
5European Bioinformatics Institute. SWISS-PROT Database.http://www.ebi.ac.uk/swissprot
6The Genome Database. http://www. gdb. org
7FlyBase: the Database of the Drosophila Genome. http://www. flybase. org
8Gusfield D, Stoye J. Linear time algorithms for finding and representing all tandem repeats in a string: [Tech Report CSE-98-4]. Dept of Computer Science, University of California, 1998
9National Center for Biotechnoloy Information. GenBank Overview. http: // www. ncbi. nlmnih. gov/Genbank/ Genbank Overview. html
102E. Hunt. PJama stores and suffix tree indexing for bioinformatics applications. In:Proc. of ECOOP'00, 2000

共引文献18

1韩浪,单建强.人工神经网络在圆管临界热流密度数据处理中的研究[J].原子能科学技术,2005,39(1):69-72. 被引量：1
2马戎,周王民,陈明.声表面波压力传感器的温度补偿[J].传感器技术,2005,24(2):37-38. 被引量：5
3金涛斌,王安国,丁荣林,吴咏诗.周期性缺陷接地结构的BP神经网络模型[J].固体电子学研究与进展,2005,25(1):114-118. 被引量：4
4邱刚,王养利.基于边缘特征和神经网络的汽车牌照定位算法[J].微机发展,2005,15(4):30-32. 被引量：4
5周德新,樊智勇.管道泄漏监测与控制技术的研究[J].计算机测量与控制,2005,13(3):237-238. 被引量：19
6冯天瑾,刘洪波,丁香乾.多层感知器分类行为的模糊线性分析[J].模式识别与人工智能,2005,18(3):334-339.
7姚敏,赵敏,邢力.基于小波神经网络的压力传感器温度补偿方法[J].传感器技术,2005,24(7):13-15. 被引量：9
8陈燕龙,肖南峰.车牌与人脸识别系统的设计与实现[J].交通与计算机,2005,23(4):21-24.
9刘玉洁,张旭.反馈神经网络在入侵检测系统中的应用[J].计算机工程,2005,31(B07):174-175. 被引量：4
10金涛斌,王安国,丁荣林,吴咏诗,李增路.一种新型缺陷接地结构的BP神经网络模型[J].微波学报,2005,21(5):46-50. 被引量：3

1雷咏梅,王雄,郭恒明,金亨科.一种基于支持向量机的并行训练策略[J].上海大学学报（自然科学版）,2007,13(5):545-549.
2刘欣阳,王国仁,乔百友,韩东红.决策树的并行训练策略[J].计算机科学,2004,31(8):129-130. 被引量：1
3Cout.,VE,徐汉臣.是什么引超大规模灭绝？：火山喷发[J].科学（中文版）,1991(2):33-41. 被引量：1
4赵赫南,刘永志,王凤昭,韩俊艳.紫茉莉化学成分与生物应用研究进展[J].畜牧与饲料科学,2016,37(9):25-27. 被引量：8
5赵尚泉.物种大灭绝的幕后真凶[J].大科技（科学之谜）（A）,2012(1):19-21.
6吴栋淦.开发基于视图的Cocoa Touch应用[J].福建信息技术教育,2012,0(2):26-29. 被引量：2
7韩雪,陈启凡,刘丹.表面等离子共振技术的研究[J].中国化工贸易,2013,5(11):249-249. 被引量：1
8中国遗传学会模式生物与人类健康研讨会第一轮通知[J].遗传,2010,32(1):58-58.
9肖国荣,骆志刚,朱伟林,郭华源,刘志芳.一个生物计算网格的设计与实现[J].计算机应用研究,2006,23(11):226-229.
10王斌,徐绮嫔,郭慧琛,曹随忠,孙世琪,姚学萍.犬细小病毒病毒样颗粒研究进展[J].动物医学进展,2016,37(3):81-85. 被引量：4

计算机科学

2004年第3期

浏览历史

内容加载中请稍等...

生物序列模式分析中神经网络的并行训练策略

参考文献34

共引文献18

相关作者

相关机构

相关主题

浏览历史