期刊文献+

Non-Blocking Join Algorithm Based on Hash-Merge for Improving Query Response Times

Non-Blocking Join Algorithm Based on Hash-Merge for Improving Query Response Times
下载PDF
导出
摘要 In data streams or web scenarios at highly variable and unpredictable rates, a good join algorithm should be able to "hide" the delays by continuing to output join results. The non-blocking algorithms allow some tuples to be flushed onto disk, with the goal of producing results continuously when data transmission is suspended. But state-of-the-art algorithms have trouble with the constraint of allocated memory. To make better use of memory, a novel non-blocking join algorithm based on hash-merge for improving query response times is proposed. The reduced data structure of in-memory tuples helps to improve memory utility. A replacement selection tree is applied to adjust memory by expanding or shrinking the size of the tree and separates one external join transaction into multi-subtasks. In addition, a cost model to estimate task output rate is proposed to select the in-disk portion that promises to produce the fastest results in the external join stage. Experiments show that the technique, with far less memory, delivers results faster than the three non-blocking join algorithms ( XJoin, HMJ and RPJ ) , with up to almost two-fold improvement in reliable network and one order of magnitude improvement in unreliable network in terms of the number of the reported tuples. In data streams or web scenarios at highly variable and unpredictable rates, a good join algorithm should be able to "hide" the delays by continuing to output join results. The non-blocking algorithms allow some tuples to be flushed onto disk, with the goal of producing results continuously when data transmission is suspended. But state-of-the-art algorithms have trouble with the constraint of allocated memory. To make better use of memory, a novel non-blocking join algorithm based on hash-merge for improving query response times is proposed. The reduced data structure of in-memory tuples helps to improve memory utility. A replacement selection tree is applied to adjust memory by expanding or shrinking the size of the tree and separates one external join transaction into multi-subtasks. In addition, a cost model to estimate task output rate is proposed to select the in-disk portion that promises to produce the fastest results in the external join stage. Experiments show that the technique, with far less memory, delivers results faster than the three non-blocking join algorithms ( XJoin, HMJ and RPJ ) , with up to almost two-fold improvement in reliable network and one order of magnitude improvement in unreliable network in terms of the number of the reported tuples.
出处 《Journal of Southwest Jiaotong University(English Edition)》 2010年第2期160-165,共6页 西南交通大学学报(英文版)
基金 The National High Technology Research and Development Program of China(No.2007AA01Z309) the National Natural Science Foundation of China(No.60803160,No.60873030)
关键词 Hash-merge NON-BLOCKING Replacement selection tree Hash-merge Non-blocking Replacement selection tree
  • 相关文献

参考文献10

  • 1Chen G,Li G H,Yang B,etc.Progressive hash-merge join algorithm. 2008IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application . 2008
  • 2Knuth D E.The Art of Computer Programming,Vol-ume3:Sorting and Searching. . 1998
  • 3Tao Y F,Yiu M L,Papadias Det al.RPJ:producing fast join results on streams through rate-based optimiza-tion. Proceedings of the24th ACM SIGMOD Interna-tional Conference on Management of Data . 2005
  • 4Dittrich J P,Seeger B,Taylor D S.Progressive merge join:a generic and non-blocking sort-based join algo-rithm. Proceedings of the28th International Confer-ence on Very Large Data Bases . 2002
  • 5Jermaine C,Dobra A,Arumugam Set al.The sort-merge-shrink join. ACM Transactions on Database Sys-tems . 2006
  • 6Tang G,Crcoles J E,Jing N.NSJ:an efficient non-blocking spatial join algorithm. Proceedings of the14th annual ACM international symposium on Advances in geographic information systems . 2006
  • 7Tok W H,Bressan S,Lee M L.Rrpj:Result-rate based progressive relational join. The12th International Conference on Database Systems for Advanced . 2007
  • 8Tok W H,Bressan S,Lee M L.A stratified approach to progressive approximate joins. The11th International Conference on Extending Database Technology . 2008
  • 9T. Urhan,and M. J. Franklin.XJoin: Getting Fast Answers From Slow and Burst Networks. Technical Report CS-TR-3994, UMIACS-TR-99-13 . 1999
  • 10Mokbel M F,Lu Ming,Aref W G.Hash-Merge Join:A Non-blocking Join Algorithmfor Producing Fast and Early Join Re-sults. Proceeding of the 20th International Conference onData Engineering . 2004

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部