摘要
随着二代测序技术的快速发展,转录组测序在越来越多的动植物中完成,人们获得了大批量的转录组数据序列。如何从这些海量的序列数据中挖掘具有生物意义的信息已成为很多研究的关键所在,对未知基因的功能进行预测和注释就是其中一个重要的问题。转录组序列的功能注释是功能基因组学研究的一项重要内容,基因本体论(gene ontology,GO)注释目前是一种最重要的功能注释方式。介绍了利用生物信息学软件进行转录组测序数据分析过程,包括数据质量控制和过滤、从头拼接(De novo assembly)、同源比对以及大规模GO注释,为从事转录组测序特别是非模式植物转录组测序研究者在数据分析方面提供参考。
With the development of sequencing technology, the transcriptome sequencing has been completed in more and more plants.A large number of transcriptome sequence data were obtained.How to mine biologically meaningful information from these massive serial data has become the key point of many researches.Predicting and annotating the function of unknown genes is an important issue.Functional annotation of transcriptome sequences is an important part of functional genomics. Gene Ontology (GO) annotation is currently one of the most important functional annotation methods.We introduced the analysis of transcriptome sequencing data using bioinformatics software, including data quality control and filtering, De novo assembly, homology comparison and large-scale GO annotation,which provided a reference for researchers engaged in transcriptome sequencing, especially non-model plant transcriptome sequencing in data analysis.
作者
刘粉香
杨文国
孙勤红
LIU Fen-xiang 1,2 , YANG Wen-guo 3, SUN Qin-hong 2(1.Nanjing Institute of Industry Technology, Nanjing, Jiangsu 210023;2.Sanjiang University,Nanjing,Jiangsu 210012;3.Nanjing University of Chinese Medicine,Nanjing,Jiangsu 21002)
出处
《安徽农业科学》
CAS
2018年第31期88-91,100,共5页
Journal of Anhui Agricultural Sciences
基金
江苏省青年基金项目(BK2015100)
关键词
二代测序
转录组
从头拼接
GO注释
Next-generation sequencing
Transcriptome
De novo assembly
GO annotation