摘要
提出一个预测基因表达水平和同义密码子使用的自洽信息聚类方法。将同义密码子分成最适密码子、非最适密码子和稀有密码子,认为三者的使用频率是调控基因表达水平的主要因素。基于这一观点,对Ecoli(121个基因)和Yeast(107个基因)两类生物的基因表达水平和密码子的使用,用自洽信息聚类方法进行了预测。发现高低表达基因明显分开,基因表达水平被分为四级:甚高表达基因(VH)、高表达基因(H)、较低表达基因(LM)和低表达基因(LL);预测结果与现有的结论和实验结果保持了很好的一致性。
We propse a self-consistal information clustering method to predict gene expression leveland synonymous codon usage. The codons are xlassified into three kinds: optimal codons.non-optimal codons and rare codons. We consider that gene expression level are controlledmainly by the usage frequencies of these three kinds of codons. On the basis of this viewpoint,for E. coli 121 genes and Yeast 107 genes, we predict the gene expression level and synonymous codon usage by the self-consistent information dustering method. It is found thatgenes for each species can be classified into four distinct groups. very high-expressed genesVH). high-expressed genes(H). moderate-expressed gene(LM) and low expressed genes(LL).The results of gene expression level and synonymous codon usage are consistent with experimental data.
出处
《生物物理学报》
CAS
CSCD
北大核心
1995年第2期237-244,共8页
Acta Biophysica Sinica
关键词
基因表达水平
自洽信息聚类
预测
同义密码子
Codon usage Gene expression Self-consistent information clustering Pridiction