期刊文献+

基于并行的同源RNA序列快速搜索算法 被引量:1

A Parallel Algorithm for Rapid RNA Homology Search
下载PDF
导出
摘要 cmsearch程序是目前最流行的同源RNA搜索工具之一,其最大的缺陷在于搜索速度过慢,严重影响了其应用范围。文章基于序列划分策略采用两级工作池方式实现同源RNA的并行搜索,并对其中所涉及的通信和负载平衡问题进行了优化。测试结果表明,此文方法具有良好的可扩展性,在120个处理器时,并行效率可达86.3%,能够用于全基因组范围内的大规模同源RNA序列搜索。 Cmsearch program is one of the most popular tools for searching homologous RNA. However, its speed is too slow, which seriously limits its application. In this paper, we present a method for parallel searching which employs sequence partition strategy and two-level work pool approach. Then, we optimized the communication and load balance involved in our method. As experiments shown, our parallel method has good scalability, which achieves 86.3% efficiency at 120 CPUs, and can be used in solving large-scale homologous RNA search problem in genomic region.
出处 《微电子学与计算机》 CSCD 北大核心 2006年第9期1-3,9,共4页 Microelectronics & Computer
关键词 同源RNA序列 两级工作池 并行化 Homologous RNA sequence, Two-level work pool, Parallelization
  • 相关文献

参考文献5

  • 1R Durbin,S R Eddy,A Krogh,G J Mitchison.Biological sequence analysis:Probabilistic models of proteins and nucleic acids[M].Cambridge UK,Cambridge university press 1998
  • 2S Griffiths-Jones,A Bateman,M Marshall,A Khanna SR Eddy.RFAM:an RNA family database[J].Nucleic Acids Research,2003,31:439~441
  • 3Z Weinberg and W L Ruzzo.Faster genome annotation of non-coding RNA families without loss of accuracy[A].Proceedings of the Eighth Annual International Conference on Computational Molecular Biology,2004:243~251
  • 4S R Eddy.A Memory-Efficient Dynamic Programming Algorithm for Optimal Alignment of a Sequence to an RNA Secondary Structure[J].BMC Bioinformatics 2002,3 (18)
  • 5陈军,赵文辉,莫则尧,李晓梅.基因序列分析软件Hmmpfam的可扩展并行性能优化[J].软件学报,2004,15(2):170-178. 被引量:4

二级参考文献12

  • 1黄铠 徐志伟.可扩展并行计算--技术、结构与编程[M].北京:机械工业出版社,2000.334-368.
  • 2Teresa KA,David JP.Luo JC,et al.,Trans.Introduction to Bioinformatics.Beijing:Beijing University Press,2002.11-197(in Chinese).
  • 3http://www.genetics.wustl.edu/eddy/software/
  • 4Eddy SR.Profile hidden Markov models.Bioinformatics,1998,14(9):755-763.
  • 5Richard D,Eddy SR,Anders K.Biological Sequence Analysis.Beijing:Tsinghua University Press,2002.46-79(in Chinese).
  • 6http://www.sgi.com/industries/sciences/chembio/htc.html
  • 7http://www.apple.com/server/clustering-resource.html
  • 8http://www.platform.com/PDFs/whitepapers/AC_Bioinformatics_WP_v3.pdf
  • 9Hwang K,Xu ZW.Lu XD,et al.,Trans.Scalable Parallel Computing Technology,Architecture,Programming.Beijing:China Machine Press,2000.416-458(in Chinese).
  • 10Mo ZY,Yuan GX.Message-Passing Parallel Programming Environment MPI.Beijing:Science Press,2001.1-11(in Chinese).

共引文献3

同被引文献3

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部