期刊文献+

一种适用于Hadoop MapReduce环境的数据预取方法 被引量:5

Prefetching method for Hadoop MapReduce environments
下载PDF
导出
摘要 为解决由Reduce任务引起的远程数据访问延时和资源竞争导致的系统性能问题,提出了一种基于预调度的数据预取方法.该方法通过预取数据来隐藏由Reduce任务引起的远程数据访问延时,通过控制与Reduce任务相关的资源分配来减少由其引起的资源竞争.此方法已在Hadoop-0.20.2中实现.实验结果表明,与缺省的Hadoop MapReduce及Hadoop Online Prototype相比,该方法可将系统性能提高10%以上. Due to the data dependency and the special task execution mode in MapReduce environments, reduce tasks always cause massive remote data access delay and unnecessary resource competition, which degrades the system performance. To solve the performance problem, we propose a pre-fetching method based on pre-scheduling. The method hides the remote data access delay by pre-fetching, and controls the resource competition by adjusting resource allocation of reduce tasks. The method is implemented in Hadoop-0. 20. 2. The experimental results show that the method improves the system performance by more than 10 %, compared with default Hadoop MapReduce and Hadoop Online Prototype.
出处 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2014年第2期191-196,共6页 Journal of Xidian University
基金 国家自然科学基金资助项目(51274088) 河南省教育厅资助项目(ITE12103) 河南理工大学博士基金资助项目(B2012-099) 河南理工大学矿山信息化省级重点实验室资助项目(KY2012-05)
关键词 MAPREDUCE 分布式计算 预取 调度 MapReduce distributed computing pre-fetching scheduling
  • 相关文献

参考文献13

  • 1Gantz J,Reinsel D.The Digital Universe Decade-are You Ready?[DB/OL].[2012-12-26].http://www.emc.com/collateral/demos/microsite s/idc-digi-taluniverse/iview.htm.
  • 2Dean J,Ghemawat S.Mapreduce:Simplified Data Processing on Large Custers[J].Communications of the ACM,2008,51(1):107-113.
  • 3Ghemawat S,Gobioff H,Leung S.The Google File System[C]//Proceedings of the 19th ACM Symposium on Operating Systems Principles.New York:ACM,2003:29-43.
  • 4The Apache Software Foundation.Welcome to Hadoop Mapreduce![DB/OL].[2012-12-26].http://hadoop.apache.org/mapreduce/.
  • 5Menon A.Big Data @ Facebook[C]//Proceedings of Workshop on Management of Big Data Systems.New York:ACM,2012:31-32.
  • 6Lattanzi S,Moseley B,Suri S,et al.Filtering:a Method for Solving Graph Problems in MapReduce[C]//Proceedings of the 23rd ACM Symposium on Parallelism in Algorithms and Architectures.New York:ACM,2011:85-94.
  • 7Shao B,Wang H,Xiao Y.Managing and Mining Large Graphs:Systems and Implementations[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.NewYork:ACM,2012:589-592.
  • 8Chen Y,Alspaugh S,Katz R.Interactive Analytical Processing in Big Data Systems:a Cross-industry Study of MapReduce Workloads[C]//Proceedings of the VLDB Endowment:5.NewYork:ACM,2012:1802-1813.
  • 9Seo S,Jang I,Woo K,et al.HPMR:Prefetching and Pre-shuffling in Shared Mapreduce Computation Environment[C]//Proceedings of IEEE International Conference on Cluster Computing.Piscataway:IEEE,2009:1-8(528917).
  • 10Ibrahim S,Jin H,Lu L,et al.Leen:Locality/Fairness-aware Key Partitioning for Mapreduce in the Cloud[C]//Proceedings of the IEEE International Conference on Cloud Computing Technology and Science.Piscataway:IEEE,2010:17-24.

同被引文献38

  • 1周玉林,郑建秀.快速排序的改进算法[J].上饶师范学院学报,2001,21(6):11-15. 被引量:8
  • 2闫鹤,李小勇,胡鹏,刘海涛.分布式文件系统的流式数据预读[J].计算机研究与发展,2012,49(S1):252-256. 被引量:1
  • 3Jehan-Francois Paris,Ahmed Amer,Darrell D.E.Long.A stochastic approach to file access prediction[].Proceedings of the international workshop on Storage network architecture and parallel I/Os.2003
  • 4He Yuan,Liu Yun-hao.Supporting VCRin peer-to-peer video-on-demand[].ICNP.2007
  • 5Kroeger TM,Long DDE.Design and implementation of a predictive file prefetching algorithm[].Procof the General Track:USENIX Annual Technical Conf.2001
  • 6G. A. S. Whittle,J.-F. P aris,A. Amer,D. D. E. Long,R. Burns."Using multiplepredictors to improve the accuracy of file access predictions,"[].Proceedings of thethIEEE/th NASA Goddard Conference on Mass Storage Systems and Technologies.2003
  • 7H Lei,D Duchamp.An analysis approach to file pre-fetching[].Proc of the USENIX annual technical Conf.1997
  • 8欧国东,张民选.一种基于线程的数据预取方法[J].计算机工程与科学,2008,30(1):119-122. 被引量:3
  • 9彭亚锋,巢强国,葛宇,张华燕,周耀斌,冯俊.我国食品安全现状与贸易对策研究[J].国外电子测量技术,2009,28(8):74-76. 被引量:5
  • 10胡洁云,欧杰,李柏林.预报微生物学在食品安全风险评估中的作用[J].微生物学通报,2009,36(9):1397-1403. 被引量:30

引证文献5

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部