基于单元集群的MapReduce中节点失效的改进被引量：1

The improvement of the failure problem of node in MapReduce based on unit cluster

下载PDF

导出

摘要针对传统MapReduce框架中任务节点和工作节点的失效问题,提出了在配置备份节点的分层主从式MapReduce框架中加入单元集群的处理方法。在改进框架中,任务处理的最小单位是单元集群,当单元集群中的某个工作节点失效或者超过时间阙值时,子任务节点则选择该单元集群中的空闲工作节点来分配任务,并且不需要重新传输任务文件分块,这既节省了工作节点重选择的时间,又降低了网络传输的压力。使用该框架针对不同数量的数据块进行实验,工作节点的灾难恢复时间均可以节省25ms左右,证明了单元集群的处理方法可以有效解决工作节点的失效问题。 Against the failure problem of Master Node and Worker Node in the traditional MapReduce framework, proposing a solution of adding unit cluster in the hierarchical master-slave MapReduce framework with Master backup nodes, in this improve- ment framework, for a sub-master node, the minimum unit of executing task is a unit cluster. When a worker node in the unit cluster failing or exceeding the time threshold, the sub-master node selects the idle nodes in this unit cluster to execute the task and does not retransmit the task file block, this not only saves the time of reselecting node, but also reduces the pressure of net- work transmission.In the experiment of using this framework, against the different number of the data blocks, the disaster recovery time of the worker node era1 save about 25 ms. The experiment results demonstrates the solution of unit cluster can effectively solve the failure problem of the worker node.

作者张乐

机构地区暨南大学信息科学与技术学院

出处《微型机与应用》 2013年第16期81-84,共4页 Microcomputer & Its Applications

关键词 Hadoop架构 MAPREDUCE框架任务节点工作节点备份节点节点失效单元集群 Hadoop architecture MapReduce framework master node worker node backup node failure node unit cluster

分类号 TP302.1 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献7

1陈康,郑纬民.云计算:系统实例与研究现状[J].软件学报,2009,20(5):1337-1348. 被引量：1312
2WHITE T.Hadoop:the definitive guide[M].California:O'Reilly Media,2012.
3DEAN J,GHEMAWAT S.MapReduce:simplified data processing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
4BORTHAKUR D.HDFS architecture guide[DB/OL].Hadoop apache project.(2008-02-14).[2013-04-22].http://hadoop.apache.org/common/docs/current/hdfsdesign.pdf.
5CONDIE T,CONWAY N,ALVARO P,et al.MapReduce online[C].Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation,2010:21-21.
6李玉林,董晶.基于Hadoop的MapReduce模型的研究与改进[J].计算机工程与设计,2012,33(8):3110-3116. 被引量：36
7彭辅权,金苍宏,吴明晖,应晶.MapReduce中shuffle优化与重构[J].中国科技论文,2012,7(4):241-245. 被引量：8

二级参考文献45

1Sims K. IBM introduces ready-to-use cloud computing collaboration services get clients started with cloud computing. 2007. http://www-03.ibm.com/press/us/en/pressrelease/22613.wss
2Boss G, Malladi P, Quan D, Legregni L, Hall H. Cloud computing. IBM White Paper, 2007. http://download.boulder.ibm.com/ ibmdl/pub/software/dw/wes/hipods/Cloud_computing_wp_final_8Oct.pdf
3Zhang YX, Zhou YZ. 4VP+: A novel meta OS approach for streaming programs in ubiquitous computing. In: Proc. of IEEE the 21st Int'l Conf. on Advanced Information Networking and Applications (AINA 2007). Los Alamitos: IEEE Computer Society, 2007. 394-403.
4Zhang YX, Zhou YZ. Transparent Computing: A new paradigm for pervasive computing. In: Ma JH, Jin H, Yang LT, Tsai JJP, eds. Proc. of the 3rd Int'l Conf. on Ubiquitous Intelligence and Computing (UIC 2006). Berlin, Heidelberg: Springer-Verlag, 2006. 1-11.
5Barroso LA, Dean J, Holzle U. Web search for a planet: The Google cluster architecture. IEEE Micro, 2003,23(2):22-28.
6Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. Computer Networks, 1998,30(1-7): 107-117.
7Ghemawat S, Gobioff H, Leung ST. The Google file system. In: Proc. of the 19th ACM Symp. on Operating Systems Principles. New York: ACM Press, 2003.29-43.
8Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters. In: Proc. of the 6th Symp. on Operating System Design and Implementation. Berkeley: USENIX Association, 2004. 137-150.
9Burrows M. The chubby lock service for loosely-coupled distributed systems. In: Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. Berkeley: USENIX Association, 2006. 335-350.
10Chang F, Dean J, Ghemawat S, Hsieh WC, Wallach DA, Burrows M, Chandra T, Fikes A, Gruber RE. Bigtable: A distributed storage system for structured data. In: Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. Berkeley: USENIX Association, 2006. 205-218.

共引文献1347

1查伟,孙燕琼,郑继平.基于云测试架构的FIVP解决方案[J].铁路技术创新,2021(S01):82-86.
2林少伟.人工智能法律主体资格实现路径:以商事主体为视角[J].中国政法大学学报,2021(3):165-177. 被引量：6
3胡祖林,肇杰.云计算下的网盘安全[J].计算机产品与流通,2020,0(1):164-164.
4陈小样.关于数据统计的课程推荐算法在远程教育平台的应用概述[J].吉林广播电视大学学报,2021(6):21-23. 被引量：1
5张盛,任伟,王玉,黄金明,陈旭彤.基于Web的重力异常正演建模工具[J].地质论评,2023,69(S01):595-597.
6赵文韬.基于5G技术的黑龙江云计算产业发展[J].电子技术（上海）,2020,49(9):186-187.
7Longfei He,Mei Xue,Bin Gu.Internet-of-things enabled supply chain planning and coordination with big data services:Certain theoretic implications[J].Journal of Management Science and Engineering,2020,5(1):1-22. 被引量：6
8吴劲松,陈孚.云计算发展及应用研究[J].广西通信技术,2011(2):9-13. 被引量：5
9黄纬,温志萍,程初.云计算中基于K-均值聚类的虚拟机调度算法研究[J].南京理工大学学报,2013,37(6):807-812. 被引量：17
10孙凌宇,欧阳春娟,冷明,刘昌鑫,夏洁武.云计算与高等教育管理信息服务系统构建[J].山西财经大学学报,2012,34(S1). 被引量：9

同被引文献1

1刘莎,谭良.Hadoop云平台中基于信任的访问控制模型[J].计算机科学,2014,41(5):155-163. 被引量：17

引证文献1

1苏锡杰.云计算安全研究[J].硅谷,2014,7(17):52-53.

1赵红侠,姜淑娟,牟春雷.一种基于星型结构的移动代理的容错模型[J].微计算机信息,2009,25(12):134-136.
2田华,鄢喜爱.数字图书馆中的一种容灾系统的设计与实现[J].农业网络信息,2006(8):68-70.
3王明伟,尹康凯,李善平.高可用性集群中多个节点的热切换研究[J].计算机应用研究,2005,22(3):85-86. 被引量：2
4李明明,李伟.基于HDFS的高可靠性存储系统的研究[J].西安科技大学学报,2016,36(3):428-433. 被引量：7
5金岩.基于备份节点无线传感器网络设计策略[J].微纳电子技术,2007,44(7):461-464.
6徐杰,陈良彬.一种高效的P2P系统[J].漯河职业技术学院学报,2012,11(5):31-32.
7李强,陈良彬.一种基于三层聚类结构的高效P2P系统[J].信阳农业高等专科学校学报,2009,19(4):132-134.
8张亚昕.面向云计算的虚拟机动态迁移技术研究[J].计算技术与自动化,2016,35(1):82-85. 被引量：4
9胡英,娄红.物联网中考虑节点故障的报文调度方案[J].计算机工程,2016,42(11):32-37.
10康征贤.一种基于纯P2P模式下的提高数据可用性的方案[J].有线电视技术,2013,20(10):58-60.

微型机与应用

2013年第16期

浏览历史

内容加载中请稍等...

基于单元集群的MapReduce中节点失效的改进被引量：1

参考文献7

二级参考文献45

共引文献1347

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于单元集群的MapReduce中节点失效的改进 被引量：1

参考文献7

二级参考文献45

共引文献1347

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于单元集群的MapReduce中节点失效的改进被引量：1