期刊文献+

生物序列模式分析中神经网络的并行训练策略

Parallel Training Strategies of Neural Networks for Bio-Sequence Analysis
下载PDF
导出
摘要 神经网络作为模式识别、数据挖掘等方面的有效工具,已被广泛应用到生物序列的模式分析中,而生物序列的超大规模、超长同时也给神经网络提出了挑战,即必须解决训练时间过长、效率低下的问题。本文提出了若干适合生物应用的神经网络并行训练策略,并按其神经网络粒度进行分类,同时分析和比较了各种策略的代价。 As an analysis method , neural network has been successfully used in the field of bioinformatics, such as recognition of gene and promoter in DNA sequence, classification of DNA and protein sequences. In the field the bio-sequences are very long and in very large scale, some even up to 6Gbps , and with the rapid development of sequencing technology of the genes, a huge amount of public bio-sequence data are available. The extra huge scale and extra long size of bio-sequences provide some challenges on the neural network. Thus it is very important to reduce the network training time to meet the requirements of bioinformatics and parallel neural network seems one of promising approaches to the problem. In this paper, we have summarized some parallel training strategies of neural network suitable for bioinformatic applications. We also classified them in terms of the granularity of the neural network, analyzed and compared the cost of each strategy.
出处 《计算机科学》 CSCD 北大核心 2004年第3期130-133,178,共5页 Computer Science
基金 国家自然科学基金(60273079)
关键词 神经网络 并行训练策略 生物信息学 生物序列模式分析 Bioinformatics, Bio-sequence analysis. Neural network, Parallel training
  • 相关文献

参考文献34

  • 1HaganMT DemuthHB BealeMH 戴葵(译).神经网络设计[M].北京:机械工业出版社,2002.119-166.
  • 2The Human Genome Project (HGP). http://www.nhgri.nih.gov/HGP/
  • 3Collins F,Patrinos A,Jordan E,Chakravarti A, et al. New goals for the us human genome project: 1998-2003. Science,1998, 282(5389): 682-689
  • 4European Bioinformatics Institue. EMPL at EBI. http://www. ebi. ac. uk
  • 5European Bioinformatics Institute. SWISS-PROT Database.http://www.ebi.ac.uk/swissprot
  • 6The Genome Database. http://www. gdb. org
  • 7FlyBase: the Database of the Drosophila Genome. http://www. flybase. org
  • 8Gusfield D, Stoye J. Linear time algorithms for finding and representing all tandem repeats in a string: [Tech Report CSE-98-4]. Dept of Computer Science, University of California, 1998
  • 9National Center for Biotechnoloy Information. GenBank Overview. http: // www. ncbi. nlmnih. gov/Genbank/ Genbank Overview. html
  • 102E. Hunt. PJama stores and suffix tree indexing for bioinformatics applications. In:Proc. of ECOOP'00, 2000

共引文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部