基于双索引的子图查询算法被引量：2

Subgraph Query Algorithm Based on Dual Index

下载PDF

导出

摘要传统的子图查询算法大多只在图数据库上进行一次挖掘算法,即在图数据库上建立稳定的数据库索引后将不再对索引进行更新。随着查询兴趣的改变或数据库的频繁更新,原有的数据库索引将不再能提供有用的信息来减少查询过程中候选图的数量。为此,提出一种双索引的子图查询算法,同时在数据库和查询流上挖掘频繁子图并建立索引。子图查询和查询流索引的建立同步进行,即使查询兴趣改变,查询流索引也能自适应地更新索引信息来优化查询效率。针对数据库的频繁更新,查询流索引已提供实时的有效信息,数据库索引无需重新建立。实验结果表明,双索引的结合能有效提高查询子图的处理效率。 Most traditional subgraph query algorithms only conduct a mine-at-once algorithm on the graph database.That is,after establishing a stable database index,the index is no longer be updated. This kind of algorithms may encounter such problems：with the query interest frequently changing or the database frequently updating,the original database index becomes increasingly obsolete and no longer provides useful information to effectively reduce the number of candidate graphs. Based on this consideration,this paper proposes a dual index structure which mines frequent subgraphs on the database and the query stream,and establishes index on them. The process of subgraph query and the establishment of query index are simultaneous. They complement each other. So even if the query interest changes,the query stream index can be adaptively updated to optimize the query performance. For the frequent updates of database,the database index doesnot need to be re-built,because the query stream index provides useful information in real time.Experimental results show that the dual index improves the processing efficiency of subgraph query.

作者陆慧琳黄博

机构地区复旦大学计算机科学与技术学院智能信息处理重点实验室

出处《计算机工程》 CAS CSCD 北大核心 2015年第1期44-48,共5页 Computer Engineering

关键词双索引查询流索引子图查询频繁子图图数据库子图同构 dual index query stream index subgraph query frequent subgraph graph database subgraph isomorphism

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1彭佳扬,杨路明,王建新,刘振,李敏.一种高效挖掘生物网络闭合频繁子图的算法[J].高技术通讯,2009,19(2):188-193. 被引量：1
2楼宇波,马坚,周皓峰,袁晴晴,施伯乐.基于频繁链接的Web权威资源挖掘[J].计算机研究与发展,2003,40(7):1095-1103. 被引量：6
3Johnson D S,Garey M R.Computers and Intractability:A Guide to the Theory of Np-completeness[M].[S.1.]:W.H.Freeman and Company,1979.
4Giugno R,Shasha D.Graph Grep:A Fast and Universal Method for Querying Graphs[C]//Proceedings of ICPR’02.Quebec,Canada:IEEE Press,2002:123-129.
5Zhao Peixiang,Yu J X,Yu P S.Graph Indexing:Tree+delta>=graph[C]//Proceedings of VLDB’2007.[S.1.]:IEEE Press,2007:233-241.
6Klein K,Kriege N,Mutzel P.CT-Index:Fingerprint-based Graph Indexing Combining Cycles and Trees[C]//Proceedings of ICDE’11.Hannover,Germany:IEEE Press,2011:258-265.
7李先通,李建中,高宏.一种高效频繁子图挖掘算法[J].软件学报,2007,18(10):2469-2480. 被引量：35
8Yan Xifeng,Yu P S,Han Jiawei.Graph Indexing:A Frequent Structure-based Approach[C]//Proceedings of SIGMOD’04.Paris,France:ACM Press,2004:568-576.
9Cheng J,Ke Y,Ng A,et al.FG-index:Towards Verification-free Query Processing on Graph Data-bases[C]//Proceedings of SIGMOD’07.Beijing,China:[s.n.],2007:541-549.
10Yan Xifeng,Han Jiawei.g Span:Graph-based Substructure Pattern Mining[C]//Proceedings of ICDM’02.Maebashi,Japan:IEEE Press,2002:236-246.

二级参考文献52

1Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB' 94), Sanfiaogo di Chile,Chile, 1994. 487-499
2Yah X, Han J. gSpan: Graph-based substructure pattern mining. In: Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM'02), Maebashi City, Japan, 2002. 721-724
3Koyuturk M, Grama A, Szpankowski W. An eficient algorithm for detecting frequent subgraphs in biological networks. Bioinformatics, 2004,20(1) : i200-i207
4Hu H Y, Yan X F, Huang Y, et al. Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics, 2005,21(1) : i213-i221
5Olken F. Biopathways and protein interaction databases. In: A lecture in Bioinfonnatics Tools for Comparative Genomics: A short course, Berkeley, CA, 2003
6Hart J W, Pei J, Yin Y W. Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACMSIG- MOD International Conference on Management of Data, Dallas, TX,USA, 2000. New York: ACM Press, 2000. 1-12
7Krishnamurthy L, Nadeau J, Ozsoyoglu G, et al. Pathways database system: an integrated system for biological pathways. Bioinformatics, 2003,19(8) : 930-937
8Han J W, Kamber M. Data Mining Concepts and Tech- niques. 2nd Edition. Singapore: Elsevier (Singapore) Pte Ltd, 2007. 233-249
9Altschul S F, Madden T L, Scheffer A A, et al. Gapped BLAST and PSIBLAST: a new generation of protein database search programs. Nucleic Acids Res, 1997, 25 ( 17 ) : 3389- 34O2
10Thompson J D, Higgins D G, Gibson T J. CLUSTALW: im- proving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res, 1994,22 (22), 4673-4680

共引文献39

1鲁慧民,冯博琴,宋擒豹.频繁子图挖掘研究综述[J].微电子学与计算机,2009,26(3):156-161. 被引量：1
2周敏子,周皓峰,王晨,汪卫,施伯乐.使用频繁结构提炼网络权威资源[J].计算机研究与发展,2004,41(10):1614-1620. 被引量：1
3王艳辉,吴斌,王柏.频繁子图挖掘算法综述[J].计算机科学,2005,32(10):193-196. 被引量：12
4董德民,何钦铭.面向电子商务的Web挖掘技术及其应用研究[J].计算机工程与设计,2006,27(1):95-98. 被引量：3
5高琳,覃桂敏,周晓峰.图数据中频繁模式挖掘算法研究综述[J].电子学报,2008,36(8):1603-1609. 被引量：9
6吴甲,陈崚.一种快速的频繁子图挖掘算法[J].计算机应用,2008,28(10):2533-2536. 被引量：4
7付立东,赵永刚,邓福岐.二维非线性对流扩散方程求解程序优化[J].西安科技大学学报,2009,29(1):104-108.
8赵宝华.基于Web挖掘的远程教育课件访问模式分析系统[J].计算机应用与软件,2009,26(3):149-152. 被引量：2
9刘振,杨路明,彭佳扬.基于频繁模式树的频繁连通闭图集挖掘算法[J].计算机技术与发展,2009,19(5):37-40.
10徐慧,陶宏.电子商务中的智能挖掘技术及其应用研究[J].漯河职业技术学院学报,2009,8(5):54-55.

同被引文献4

1董安国,高琳,赵建邦.图模式挖掘中的子图同构算法[J].数学的实践与认识,2011,41(13):105-112. 被引量：4
2王超珲,黄一夫.基于增量信息索引的子图查询算法[J].计算机应用与软件,2016,33(10):37-40. 被引量：1
3黄云,洪佳明,覃遵跃,钟键,李梦婷,印鉴.ERSearch:一种高效的子图查询算法[J].电子学报,2017,45(2):368-375. 被引量：2
4杨艳,纪安娜,金虎.大规模数据图上的个性化子图匹配算法[J].计算机研究与发展,2015,52(S1):48-55. 被引量：5

引证文献2

1张宇彤,王思檬,曹佳.基于邻域等价类的同构子图搜索算法[J].计算机工程,2017,43(9):7-11. 被引量：2
2施炜杰,董一鸿,王雄,潘剑飞.基于索引的子图查询技术研究进展[J].计算机应用,2019,39(1):39-45.

二级引证文献2

1任锦标.基于数据仓库及决策树算法的电网事故事件信息智能检索方法研究[J].集成电路应用,2019,36(12):86-87. 被引量：2
2王迪,黎冠,李志伟,李明宇,谢家顺.基于改进A*算法的消防机器人路径规划算法研究[J].华北科技学院学报,2022,19(1):72-79. 被引量：3

1刘雅辉,刘春阳,张铁赢,程学旗.图索引技术研究综述[J].山东大学学报（理学版）,2013,48(11):44-52.
2王超珲,黄一夫.基于增量信息索引的子图查询算法[J].计算机应用与软件,2016,33(10):37-40. 被引量：1
3邹晓红,郭聪敏,郭景峰.一种有效的图索引查询算法[J].小型微型计算机系统,2013,34(2):370-374.
4王宏志,骆吉洲,李建中.图结构XML文档上子图查询的高效处理算法[J].软件学报,2009,20(9):2436-2449. 被引量：1
5张一楠,高宏,张炜.基于双分支特征编码的子图查询处理算法[J].计算机研究与发展,2011,48(S3):114-123.
6王楠,王斌,李晓华,杨晓春.支持动态图数据的子图查询方法[J].计算机科学与探索,2014,8(2):139-149. 被引量：4
7朱青,李红.时序图上动态子图查询优化算法[J].计算机科学与探索,2014,8(11):1324-1333.
8张航,王宏志,李建中,高宏.基于2-hop优化的子图模式匹配算法[J].黑龙江大学自然科学学报,2010,27(1):78-82. 被引量：1
9黄云,洪佳明,覃遵跃,钟键,李梦婷,印鉴.ERSearch:一种高效的子图查询算法[J].电子学报,2017,45(2):368-375. 被引量：2
10解春欣,汪卫.子图同构验证算法OES[J].计算机工程,2011,37(3):30-32. 被引量：3

计算机工程

2015年第1期

浏览历史

内容加载中请稍等...

基于双索引的子图查询算法被引量：2

参考文献11

二级参考文献52

共引文献39

同被引文献4

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于双索引的子图查询算法 被引量：2

参考文献11

二级参考文献52

共引文献39

同被引文献4

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于双索引的子图查询算法被引量：2