期刊文献+

基于DHT的分布式索引技术研究与实现 被引量:8

Research and Implementation of Distributed Index Based on DHT
下载PDF
导出
摘要 针对索引创建和维护效率不高的问题,设计了一种基于DHT(Distributed Hash Table)的分布式倒排索引构建算法。该算法利用基于改进的Chord网络的分布式哈希表技术,将分词后的结果分散到多个索引服务器上并行构建索引,同时采用前驱列表定位和减少服务器定位延迟的技术,大大缩短了索引构建时间。通过采用统一调度的基于分块的增量式倒排索引更新策略,索引更新时不再需要移动已有的索引文件,提高了索引更新效率。利用周期性稳定算法和前驱列表定位提高了系统的稳定性、容错性和索引的一致性。 A distributed inverted index's building method based on DHT (Distributed Hash Table) was adopted to im- prove the index's creating and updating efficiency. The arithmetic, using the DHT technology based on improved Chord network,hashes the terms and their relational information to the distributed index servers and builds the index paralle- ly. This method reduces the index' s building time through distributing a task to many nodes. The strategies of schedu- ling the index building task through chained index management servers and the incremental distributed inverted index updating method were used,which could assure index's consistency and updating efficiency.
出处 《计算机科学》 CSCD 北大核心 2010年第2期65-70,共6页 Computer Science
基金 国家自然科学基金项目(60873225 60773191 70771043) 国家高技术研究发展计划(863计划)项目(2007AA01Z403)资助
关键词 分布式索引 分布式哈希表 CHORD网络 Distributed index,Distributed hash table,Chord network
  • 相关文献

参考文献9

  • 1程学旗,吕建明,周昭涛.基于对等网络的全文信息检索[J].计算机研究与发展,2004,41(12):2148-2155. 被引量:11
  • 2Ratnasamy S, Shenker S. Routing algorithms for DHT : Some open questions[C]//Proceedings of IPTPS. Boston, MA, 2002: 278-293.
  • 3Luu T,Kelemm F. ALVIS Peers: A Scalable Full-text Peer-to- Peer retrieval Engine[M]. Arlington, Virginia, USA, 2006: 383- 395.
  • 4Podnar I, Rajman M, Luu T, et al. Sealable peer-to-peer web retrieval with highly discriminative keys[R]. September 2006.
  • 5Zakis J D, Pudlowski Z J. The World Wide Web as Universal Medium for Scholarly Publication, Information Retrieval and Interchange[J]. Global Journal of Engineering Education, 1997,1 (3).
  • 6左朝树,刘心松,陈小辉,顾攀.DPsIR^+:一种基于动态空间槽的分布式并行空间索引树[J].计算机科学,2006,33(2):121-126. 被引量:5
  • 7Cutting D,Pedersen J. Optimization for Dynamic Inverted Index Maintenance[C] // Proceedings of the 13th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 1990:405-411.
  • 8Tomasic A, Garcia-Molina H. Incremental Updates of Inverted Lists for Text Document Retrieval[C] // Proceedings of 1994 ACM SIGMOD International Conference on Management of Data. Minneapolis. New York: ACM Press, 1994:289-300.
  • 9吴恒山,刘兴宇,左琼.一种基于可扩展散列表的倒排索引更新策略[J].计算机工程,2004,30(8):83-84. 被引量:6

二级参考文献45

  • 1[1]Zipf G K.Human Behavior and the Principle of Least Effort. Addisonwesley Press, 1949
  • 2[2]Fagin R,Nievergelt J,Pippenger N,et al. Extendible Hashing:a Fast Aecess Method for Dynamic Files. ACM Trans.on Database Systems,1979,4(3):315-344
  • 3[3]Melnik S,Raghavan S,Yang B,et al. Building a Distributed Full-text Index for the We b. In: Proceed ings of WWW 1 0, 2001
  • 4[4]Cutting D,Pedersen J.Optimizafion for Dynamic Inverted Index Maintenance. SIGIR90,1990:405-41 l
  • 5[5]Garcia-Molina H,Tomasic A,Shoens K.Incremental Updates of Inverted Lists for Text Document Retrieval.SIGMOD94,1994,23(2):289-300
  • 6[6]Chiueh T, Huang L.Efficient Real-time Index Updates in Text Retrieval Systems. ECSL Technical Report 66,1999
  • 7Guttman A. R-Trees: A Dynamic Index Structure for Spatial Searching. In: 1984 Proc. ACM SIGMOD Conf, , 1984.47-57
  • 8Ciferri R R, Salgado A C, et al. A performance comparison among the traditional R trees, the hilbert R-tree and the SR-tree.In:2003 Proe. SCCC Conf. ,2003. 3-12
  • 9Xia Yuni,Prabhakar S. Q+Rtree: efficient indexing for moving object databases. In:2003 Proc. DASFAA Conf. ,2003. 175-182
  • 10Hu Kong-Fa,Dong Yi-Sheng, et al. DCA-Tree: a high performance structure for incremental update cube on MDDW. In: 2002 Proc. Machine Learning and Cybernetics Conf. , 2002. 20692072

共引文献19

同被引文献58

  • 1刘云生,游安.一种可行的时态数据库索引技术[J].计算机应用研究,2006,23(10):63-65. 被引量:1
  • 2李运娣,冯勇.基于DHT的P2P搜索定位技术研究[J].计算机应用研究,2006,23(10):226-228. 被引量:19
  • 3方冰,张一中.高性能FTP搜索引擎的设计[J].南京邮电大学学报(自然科学版),2007,27(3):67-70. 被引量:7
  • 4International Telecommunication Union. The Internet of things [ EB/ OL ]. (2005-11 - 17 ) [ 2011 - 11-06 ]. http ://www. itu. int/osg/spu/ publications/internetofthings/IntemetotThings_summary. pdf.
  • 5EPCglobal. The EPCglobal architecture framework [ S/OL]. (2009- 03-19 ) [ 2011-11-06 ]. http ://www. epcglobalinc. org/standards/ar- chitecture/architectur?_1 _3-framework-20090319. pdf.
  • 6EPCglobal. Object name service(ONS) 1.0.1 [ S/OL]. (2008-05- 29 ) [ 2011 - 11 - 06 ]. http ://www. epcglobaline. org/standards/ons/ ons_1_0_1-standard-20080529. pdf.
  • 7STOICA I, MORRIS R, LIBEN-NOWELL D, et al. Chord: a scalable peer-to-peer lookup protocol for Intemet applications [ J ]. IEEE/ ACM Trans on Networking,2003, 11 ( 1 ) : 17-32.
  • 8曾强,郝玉洁,郭建东.实时历史数据库库文件结构设计与分析[J].微计算机信息,2007(24):152-154. 被引量:4
  • 9Li Y, Jagadish H, Tan K L. Sprite : A Learning - based Tex- tretrieval System in Dht Networks [ C ]//ICDE 2007. IEEE 23 rd Interna- tional Conference on Data Engineering,2007. Istanbul : IEEE,2007 : 1106 -1115.
  • 10Fox G. Peer - to - Peer Network [ J ]. Computing in Science &Engineering,2001,3 ( 3 ) : 75 - 77.

引证文献8

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部