相对行常量差异共表达双聚类挖掘算法被引量：1

Differential co-expression relative constant row bicluster mining algorithm

下载PDF

导出

摘要在生物信息学上,挖掘差异共表达双聚类有助于研究衰老、癌变类变化的生物过程。以往的差异共表达双聚类定义仅仅从一组基因的角度来衡量差异,导致包含了很多噪声。为了克服上述缺点提出新的差异共表达支持度MiSupport,可以将一组基因的差异细化到基因级别;并由此定义提出MiCluster算法,可以在两个真实的基因芯片数据中挖掘最大的差异共表达双聚类。MiCluster算法首先基于两个基因芯片数据构建差异共表达权值图,然后基于权值图,采用样本扩展和层次扩展,并利用精确的候选产生方法和高效的剪枝策略,挖掘出最大的差异共表达双聚类。实验结果证明,MiCluster算法比现有的算法快速高效,而且通过均方误差(MSE)测试和基因本体(GO)评价,挖掘出来结果具有更大的统计意义和生物学意义。 Bioinformaticly,it is useful to study the change process of biology,such as aging and canceration,by mining differential co-expression bicluster.The definition in the past only measured from the perspective of all set of genes,thus containing a lot of noise.Therefore,a new definition named MiSupport was put forward to measure the difference on gene level,and on the basis of MiSupport,an algorithm named MiCluster was proposed to mine the maximal differential coexpression bicluster in two real gene chips.Firstly,MiCluster constructed a differential weighted undirected sample-sample relational graph in two real-valued gene expression datasets.Secondly,the maximal differential biclusters was produced in the above differential weighted undirected sample-sample relational graph with efficiently pruning techniques and accurately generating candidates method by sample-growth and level-growth.The experimental results show that MiCluster is more efficient than the existing methods.Furthermore,the performance is evaluated by Mean Square Error（MSE） score and Gene Ontology（GO）.The results show that this algorithm can find better statistical and biological significance.

作者谢华博尚学群王淼

机构地区西北工业大学计算机学院

出处《计算机应用》 CSCD 北大核心 2013年第8期2188-2193,2239,共7页 journal of Computer Applications

基金国家973计划项目(2012CB316203) 国家自然科学基金资助项目(61272121)

关键词基因芯片基因共表达双聚类差异行常量 gene chip gene co-expression bicluster differential constant row

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献15

1MADEIRA S C, OLIVEIRA A L. Biclustering algorithms for biologi- cal data analysis: a survey [C]//IEEE/ACM Transactions on Com- putational Biology and Bioinformatics, 2004, 1(1) : 24 -45.
2HARTIGAN J A. Direct clustering of a data matrix [ J]. Journal of the American Statistical Association, 1972, 67(337) : 123 - 129.
3GETZ G, LEVINE E, DOMANY E. Coupled two-way clustering analysis of gene microarray data [ J]. Proceedings of the Natural Academy of Sciences of United States of America, 2000, 97(22) : 12079 - 12084.
4CORMEN T H, ELEISERSON C E, RIVEST R L, et al. Introduc- tion to algorithms [ M]. 2nd ed. Cambridge: MIT Press, 2001.
5LAZZERONI L, OWEN A. Plaid models for gene expression data [J]. Statistica Sinica, 2002, 12:61-86.
6KOSTKA D, SPANG R. Finding disease specific alterations in the coexpression of genes [ J]. Bioinformatics, 2004, 20( Suppl. 1) : i194 - i199.
7OKADA Y, INOUE Y. Identification of differentially expressed gene modules between two-class DNA micrearray data [ J]. Bioinforma- tion, 2009,4(4) : 134 - 137.
8SERIN A, VINGRON M. DeBi: discovering differentially expressed biclusters using a frequent itemset approach [ J]. Algorithms for Mo- lecular Biology, 2011,6(1) : 18.
9BURDICK D, CALIMLIM M, GEHRKE J. MAFIA: a maximal fre- quent itemset algorithm for transactional databases [ C]// Proceed- ings of the 17th International Conference on Data Engineering. Pis- cataway: IEEE, 2001:443-452.
10ODIBAT O, REDDY C K, GIROUX C N. Differential biclustering for gene expression analysis [ C]//Proceedings of the ACM Confer- ence on Bioinformatics and Computational Biology. New York: ACM, 2010:275-284.

同被引文献21

1罗尚凤,李国光,何宇东,辛永洁,武建华,王燕生.枳术丸的化学成分分析[J].西北药学杂志,1994,9(5):206-209. 被引量：11
2李国春,戴慎.动态聚类分析在中医方剂药量组合规律中的应用[J].中国卫生统计,2006,23(1):63-64. 被引量：9
3范欣生,段金廒,王中越,姚映芷.中药量效关系特征问题的探讨[J].中华中医药杂志,2009,24(3):270-274. 被引量：39
4朱娴,马卫.一种基于层次聚类的双聚类算法[J].微计算机应用,2009,30(5):12-17. 被引量：4
5王红丽.关于中药饮片处方药味、剂量的调查分析[J].中国医院用药评价与分析,2009,9(5):357-358. 被引量：15
6柴程芝,刘志刚,寇俊萍,朱丹妮,余伯阳.当归芍药散医案药物剂量研究[J].中医杂志,2009,50(11):1042-1044. 被引量：8
7傅延龄,蔡坤坐,宋佳.方药量效关系文献与理论研究思考[J].北京中医药大学学报,2010,33(9):601-605. 被引量：55
8刁静霓,尚学群,王淼,缪苗.基于权值图的基因芯片数据差异双聚类挖掘算法[J].计算机应用研究,2011,28(1):48-50. 被引量：2
9古求知,柳长华.古方今用剂量问题探索[J].辽宁中医药大学学报,2011,13(9):111-112. 被引量：5
10缪苗,尚学群,刘加财,王淼.从基因表达数据中挖掘最大的行常量双聚类[J].计算机应用研究,2011,28(12):4447-4450. 被引量：5

引证文献1

1王瑞祥.基于双聚类算法的方剂剂量模式研究[J].辽宁中医杂志,2016,43(1):8-9. 被引量：1

二级引证文献1

1于兴文,龚庆悦,胡孔法,毛文静,张卫明.基于改进双聚类算法的方剂剂量分析[J].辽宁中医杂志,2018,45(3):519-521. 被引量：1

1杨蜜静,尚学群,许涛,王淼.面向时序基因表达数据的双聚类算法[J].计算机应用研究,2013,30(8):2308-2314. 被引量：3
2印安涛,钱钢,施欢欢.在复杂网络中查找k个有限重叠的密集子图[J].计算机应用与软件,2016,33(12):140-144.
3陈兰,王世敏,陈润生.一种从多表达谱数据挖掘基因共表达团的新方法[J].中国学术期刊文摘,2009,15(3):124-124.
4陈兰,王世敏,陈润生.一种从多表达谱数据挖掘基因共表达团的新方法[J].生物化学与生物物理进展,2008,35(8):914-920. 被引量：2
5沈美,刘同义,于翔,丁香乾.基于高级Petri网的工作流建模研究与仿真分析[J].计算机工程与应用,2006,42(32):200-203. 被引量：1
6李晓园,尚学群,王淼.从基因表达数据中有效挖掘差异共表达双聚类——DiCluster算法[J].计算机应用研究,2012,29(11):4087-4092. 被引量：1
7董超,王建民,王喆.基于着色Petri网的工作流系统子流调用机制的研究与实现[J].计算机应用研究,2006,23(6):88-89.
8许涛,尚学群,杨蜜静,王淼.基于离散时序基因表达数据的双聚类算法[J].计算机应用研究,2013,30(12):3551-3556. 被引量：1
9ZAN Xiangzhen,XIAO Biyu,MA Runnian,ZHANG Fengyue,LIU Wenbin.A Graph-based Method to Mine Coexpression Clusters Across Multiple Datasets[J].Chinese Journal of Electronics,2012,21(4):657-662. 被引量：1
10胡云,苗夺谦,王睿智,陈敏.一种基于粗糙k均值的双聚类算法[J].计算机科学,2007,34(11):174-177. 被引量：8

计算机应用

2013年第8期

浏览历史

内容加载中请稍等...

相对行常量差异共表达双聚类挖掘算法被引量：1

参考文献15

同被引文献21

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

相对行常量差异共表达双聚类挖掘算法 被引量：1

参考文献15

同被引文献21

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

相对行常量差异共表达双聚类挖掘算法被引量：1