摘要
RNA序列的高维空间二进制编码有以下优点:除可以对RNA序列的碱基结构、功能基团、碱基互补、氢键强弱等性质进行编码之外,还可方便的进行数学与逻辑运算。该文研究了RNA序列高维空间数字编码的更一般、更深刻的运算法则:(1)进一步研究RNA序列高维空间的表观维数NV,数值维数NX以及差异维数Nd,具体刻给出了当Nd=0,1,2,2n或2n+1(n=0,1,2,…)时,RNA序列的首段碱基及其数值取值范围。(2)推导出RNA序列多点“突变”(单核苷酸多态性SPN)的运算法则,将以前的结果推广到一般情形,深刻探讨了RNA序列之汉明距离、汉明值的变化及其数值变化情况。(3)利用RNA序列的定值部Xi和定位部Wi及其计算公式,从新的角度导出RNA重复序列的编码法则和运算法则,进而统一了以前的结果。
The binary digital coding of RNA sequence has the following advantages:the properties of nucleotide bases such as:structure,functional group,complementary relationship and strong and weak hydrogen bond connections can be encoded.It is also convenient for mathematical operations and logical operations.This paper investigates more general and profound operational rules on digital coding for RNA sequence in high dimensional space:(1)It studies the visual di-mension N V ,the digital dimension N X and the difference dimension N d of RNA sequence.When N d =0,1,2,2n or2n+1(n=0,1,2,…),and characterizes the initial part of RNA sequence and its corresponding digital value boundary.(2)Op-erational rules for seven kinds of multipoint mutation of RNA sequence are derived,which characterize the Hamming distance and the variety cases of its Hamming value and digital value.These results generalize the known ones.(3)Using the location value X i ,digital value W i and their formulas,it derives the encoding rule and operational rule for a tandem repeat RNA sequences in a new point of view.
出处
《计算机工程与应用》
CSCD
北大核心
2004年第8期15-18,128,共5页
Computer Engineering and Applications
基金
教育部科学技术重点研究项目(编号:02139)
华中科技大学博士后基金资助项目(编号:AA184021)
关键词
数字编码
RNA序列
多点突变
重复序列
汉明距离
Digital coding,RNA sequence,Multipoint mutation,Tandem repeat,Hamming distance