期刊文献+

The statistical power of k-mer based aggregative statistics for alignment-free detection of horizontal gene transfer

原文传递
导出
摘要 Alignment-based database search and sequence comparison are commonly used to detect horizontal gene transfer(HGT).However,with the rapid increase of sequencing depth,hundreds of thousands of contigs are routinely assembled from metagenomics studies,which challenges alignment-based HGT analysis by overwhelming the known reference sequences.Detecting HGT by k-mer statistics thus becomes an attractive alternative.These alignment-free statistics have been demonstrated in high performance and efficiency in wholegenome and transcriptome comparisons.To adapt k-mer statistics for HGT detection,we developed two aggregative statistics T^(S)_(sum ) and T^(*)_(sum),which subsample metagenome contigs by their representative regions,and summarize the regional D^(S) _(2) and D^(*)_(2)metrics by their upper bounds.We systematically studied the aggregative statistics’power at different k-mer size using simulations.Our analysis showed that,in general,the power of T^(S)_(sum) and T^(*)_(sum) increases with sequencing coverage,and reaches a maximum power>80%at k=6,with 5%Type-I error and the coverage ratio>0.2x.The statistical power ofT^(S)_(sum) and T^(*)_(sum) was evaluated with realistic simulations of HGT mechanism,sequencing depth,read length,and base error.We expect these statistics to be useful distance metrics for identifying HGT in metagenomic studies.
出处 《Synthetic and Systems Biotechnology》 SCIE 2019年第3期150-156,共7页 合成和系统生物技术(英文)
基金 L.C.X.was supported by the Innovation in Cancer Informatics Fund.
  • 相关文献

参考文献1

二级参考文献19

  • 1Bergey's Manual Trust, Bergey's Manual of Systematic Bacteriology, Springer-Verlag, New York,2nd Ed. Vol. 1, 2001.
  • 2G. M. Garrity , M. Winters, and D. B. Searles, Taxonomic Outline of the Prokaryotic Genera,Bergey's Manual of Systematic Bacteriology, Ed. 2, Rel. 1.0. Available at:http: / / www. bergeysout line. corn.
  • 3C. R: Woese et al., Proc. Natl. Acad. Sci. (USA), 1977, 74:5088 and 1990, 87: 4576; Microbial.Rev., 1983, 47: 621.
  • 4G. Deckert et al., The complete genome of the hyperthermophilic bacterium Aquifez Aeolicus,Nature, 1998, 392: 353-358.
  • 5K.E.Nelson et al., Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima, Nature, 1999, 399: 323.
  • 6E.Pennisi, Cenome data shake tree of life, Science, 1998, 280: 672-674.
  • 7E. Pennisil Is it time to uproot the tree of life? Science, 1999, 284: 1305-1308.
  • 8W. F. Doolittle, Uprooting the tree of life, Sci. Amer., February 2000, 90-95.
  • 9G. Hinkle et al., Complete Genome Sequence of Agrobacterium Tumefaciens, C58, the Causative Agent of Crown Gall Disease in Plants, GenBank Entries AE007869, AE006469, AE00782, and AE007871, 2001.
  • 10C. R. Woese, Proc. Natl. Acad. Sci. USA, 2000, 97: 8392-8396.

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部