期刊文献+

基于决策树分类算法异构数据的索引优化 被引量:10

Optimization for Heterogenuous Data Index Based on Decision Tree Classification
下载PDF
导出
摘要 海量数据的索引是提高分布式环境下海量数据的查询重要手段。为了构建高效的索引结构,人们提出了多种异构数据索引优化方法。文中给出了基于决策树分类算法下的索引优化方法。基于决策树分类算法构建索引决策树,利用该索引决策树对各个子空间表的属性列进行决策,建立索引表,根据索引表数据建立索引,再根据各子空间上的索引构建全局索引。该二级索引结构为快速定位索引信息提供了技术支持。实验结果表明,索引决策树是一个对优化异构数据索引合适的方法。 The massive data index is an important means to improve the query efficiency of massive data in distributed environment. In order to construct an efficient index structure,Some heterogeneous data index optimization methods have proposed. This paper gives the index optimization method based on the index of decision tree classification,Firstly,an index decision tree is build up based on data tables and their index. then an index structure is obtained according to decisions given by the decision tree for each subspace. A global level index structure can be created based on local index. The two level index structure can used to rapid position index information and reduce data searching time. Finally,the experimental results show that the index of decision tree is a proper method to optimize heterogeneous spatial data index.
出处 《电子科技》 2018年第3期48-52,60,共6页 Electronic Science and Technology
基金 国家自然科学基金青年基金(61402288)
关键词 决策树 索引结构 大数据 索引优化 decision tree index structure big data optimizing index
  • 相关文献

参考文献2

二级参考文献22

  • 1赵经纬.智能管道建设进入务实阶段 中国电信部署三阶段试点方案[J].通信世界,2011(29):30-30. 被引量:1
  • 2王国仁,黄健美,王斌,韩东红,乔百友,于戈.基于最大间隙空间映射的高维数据索引技术[J].软件学报,2007,18(6):1419-1428. 被引量:9
  • 3郑宝鑫,周雪松,李斌,唐宇. 基于用户画像、信令挖掘技术的手机游戏产品推广[J]. 广东通信(2010 青年论坛优秀论文集),2010.
  • 4Jinbao Wang, Sai Wu,Hong Gao, et al. IndexingMulti-dimensional Data in a Cloud System[C]//SIG-MOD,2010:591-602.
  • 5Papadopoulos A,Katsaros D. A-tree: Distributed in-dexing of multidimensional data for cloud computingenvironments [ C]//Cloud Computing Technology andScience (CloudCom) , 2011 IEEE Third InternationalConference on. IEEE,2011 : 407-414.
  • 6Fries S,Boden B, Stepien G,et al. PHiDJ: Parallel simi-larity self-join for high-dimensional vector data with Ma-pReduce[C]//Data Engineering(ICDE) , 2014 IEEE 30thInternational Conference on. IEEE,2014: 796-807.
  • 7Hung-chih Yang, Douglas Stott Parker Jr. Traverse:Simplified Indexing on Large Map-Reduce-Merge Clus-ters[C]//DASFAA,2009 : 308-322.
  • 8Sai Wu, Dawei Jiang, Beng Chin Ooi, et al. EfficientB^tree Based Indexing for Cloud Data Processing[C]//PVLDB,2010,3(1) :1207-1218.
  • 9Xiangyu Zhang, Jing Ai, Zhongyuan Wang, et al. An Efficient Multi-dimensional Index for Cloud Data Man- agement[C]//CloudDb, 2009 : 17-24.
  • 10Antonin Guttman. R-Trees: A Dynamic Index Struc-ture for Spatial Searching[C]//SIGMOD,1984:47-57.

共引文献15

同被引文献134

引证文献10

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部