基于优先级的Three-Queue调度算法研究被引量：4

Research of Three-Queue Scheduling Algorithms Based on Priority

下载PDF

导出

摘要针对Hadoop平台上调度算法存在的不足,提出了一种改进的调度算法———Triple-Queue算法。在充分考虑数据的本地性后,Triple-Queue算法设计了一种改进的优先级计算模型,以有效地区分用户作业的等级,同时又保证一定程度的公平性,进而减小作业执行时间,避免系统资源浪费。实验结果表明,随着数据量的提高,该算法执行效率明显提高,同时能够较好地解决数据本地性问题。 For solving the shortage of scheduling algorithms on the Hadoop platform,the paper proposed an improved scheduling algorithm-Triple-Queue algorithm.After taking full account of data locality,the Triple-Queue algorithm designs a improved computational model of priority,which can distinguish the user＇s job levels clearly and ensure a certain degree of fairness,so as to reduce the job execution time and avoid wasting system resources.The results of experiment show that the algorithm improves the efficiency significantly and solves the problem of data locality better with the increased amount of data.

作者顾宇周良丁秋林

机构地区南京航空航天大学计算机科学与技术学院

出处《计算机科学》 CSCD 北大核心 2011年第B10期253-256,共4页 Computer Science

关键词调度 Triple-Queue 数据本地性 MAPREDUCE Scheduling Triple-Queue Data locality Map reduce

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1Dean J,Ghemawat S. MapReduee: Simplified data processing on large elusters[C]///OSDI' 04: Sixth Symposium on Operating System Design and Implementation. 2004:137-150.
2Zaharia M, Borthakur D, Sarma J S. Job seheduleing for multiuser mapreduce clusters[C]//Proceedings of the 5th European Conference IEEE. 2009 : 145-161.
3Matei Zaharia, Dhruba Borthakur and Joydeep Sen Sarma. Delay scheduling:a simple technique for achieving locality and fairness in cluster scheduleing[C]// EuroSys ' 10: Proceedings of the 5th European conference on Computer systems. 2010:265-278.
4Polo J, de Nadal D, Carrera D. Adaptive Task Scheduling for MultiJob MapReduce Environments[C] // Proceedings of the 2010 Eighth International Conference on Grid and Cooperative Computing IEEE. 2010:326-332.
5Thomas Sandholm and Kevin Lai. Dynamic proportional share scheduling in hadoop[C]//JSSPP ' 10: 15th Workshop on Job Scheduling Strategies for Parallel Processing. 2010:110-131.
6Polo J, Carrera D, Becerra Y. Performance-driven task co-scheduling for rnapr- educe environrnents[C]//Network Operations and Management Symposium(NOMS), IEEE. 2010 : 373-380.
7Tian C, Zhou H, Zha L. A dynamic MapReduce scheduler for heterogeneous workloads[C]//Proceedings of the 2009 Eighth International Con ference on Grid and Cooperative Computing. IEEE Computer Society, 2009 :218-224.
8陈全,邓倩妮.异构环境下自适应的Map-Reduce调度[J].计算机工程与科学,2009,31(A01):168-171. 被引量：21
9Apache Hadoop[OL]. http://hadoop, apache org/.
10Fair Scheduler for Hadoop[EB/OL]. http://Hadoop, apache. org/eommon/does/eurrent/Fair_scheduler, html, 2010-04-15.

二级参考文献10

1Vaquero L M, Rodero-Merino L, Caceres J, et al. A Break in the Clouds: Towards a Cloud DefinitionD]. ACM SIGCOMM Computer Communication Review, 2009, 39 ( 1 ) : 50- 55.
2Bryant R E. Data-Intensive Supercomputing: the Case for DISC[R]. CMU Technical Report CMU-CS-07-128, Department of Computer Science, Carnegie Mellon University, 2007.
3Dean J, Ghemawat S. MapReduce: Simplied Data Processing on Large Clusters[C]//Proc of OSDI '04,2004 : 137-150.
4Colbyranger, Raghuraman R, Penmetsa A. Evaluating MapReduce for Multi-Core and Multiprocessor Systems[C]//Proc of the IEEE 13th Int'l Syrup on High Performance Computer Architecture, 2007 : 13-24.
5Kruijf M D, Sankaralingam K. MapReduce for the Cell B. E. Architecture[-R]. Technical Report CS-TR-2007-1625, University of Wisconsin Computer Sciences University of Wisconsin, 2007.
6He B S, Fang W B, Luo Q, et al. Mars: A MapReduce Framework on Graphics Processors[C]//Proc of the 17th Int'l Conf on Parallel Architectures and Compilation Techniques, 2008 : 260-269.
7Apache Hadoop. Hadoop [EB/OL]. [2009-03-06]. http://hadoop, apache, org/.
8Yahoo. Yahoo! Hadoop Tutorial [EB/OL]. [2009-02-27]. http:// public, yahoo, com/gogate/hadoop-tutorial/start-tutorial, html.
9Ghemawat S, Gogioff H, Leung P T. The Google File System[C]//Proc of the 19th ACM Syrnp on Operating Systems Principles, 2003 : 29-43.
10Zaharia M, Konwinski A, Joseph A D. Improving MapReduce Performance in Heterogeneous Environments [C]//Proc of the 8th Usenix Syrup on Operating Systems Design and Implementation, 2008 : 29-42.

共引文献20

1李丽英,唐卓,李仁发.基于LATE的Hadoop数据局部性改进调度算法[J].计算机科学,2011,38(11):67-70. 被引量：17
2李鑫,张鹏.Hadoop集群公平调度算法的改进与实现[J].电脑知识与技术,2012,8(1):166-168. 被引量：6
3邹世军,赵红武.基于Hadoop集群的加权循环算法的研究[J].工业控制计算机,2012,25(10):65-66.
4杨立身,余丽萍.异构环境下增强的自适应MapReduce调度算法[J].计算机工程与应用,2013,49(19):39-43. 被引量：5
5陈吉荣,乐嘉锦.基于Hadoop生态系统的大数据解决方案综述[J].计算机工程与科学,2013,35(10):25-35. 被引量：118
6何翔,李仁发,唐卓.一种异构环境下的基于MapReduce任务调度改进机制[J].计算机应用研究,2013,30(11):3370-3373. 被引量：8
7李锋刚,魏炎炎,杨龙.基于和声算法异构Hadoop集群资源分配优化[J].计算机工程与应用,2014,50(9):98-102. 被引量：5
8李燕歌,张治斌,王娜.基于负载均衡的MapReduce后备任务上限自适应算法[J].计算机应用研究,2015,32(1):67-70. 被引量：3
9杨倩茹,黄梦醒,万兵.一种引入内存平衡的Hadoop平台作业调度算法[J].小型微型计算机系统,2014,35(12):2708-2712. 被引量：4
10张连义,杜中军,李震.Hadoop平台公平调度算法研究与优化[J].计算机时代,2014(12):45-47. 被引量：1

同被引文献92

1李振东,谢立.Web服务器群的QoS确保及其接纳控制研究[J].计算机研究与发展,2005,42(4):662-668. 被引量：9
2施朝健,张明铭.Logistic回归模型分析[J].计算机辅助工程,2005,14(3):74-78. 被引量：23
3段海滨.蚁群算法原理及其应用[M].北京:科学出版社,2006:33-35.
4Dean J, Ghemawat S. MapReduee:Simplified data processing on large clusters[ C]//Proc of Sixth Symposium on Operating System Design and Implementation. Berkeley : USENIX Asso- ciation, 2004 : 137-150.
5Zaharia M, Borthakur D, Sarma J S. Job scheduling for multi- user Mapreduce clusters[ C ]//Proceedings of the 5th Europe- an Conference. Washington : IEEE Computer Society, 2009 : 145-161.
6Zaharia M, Borthakur D, Sarma J S. Delay scheduling: a simple technique for achieving locality and fairness in cluster schedu- ling [ C ]//Proceedings of the 5th European Conference on Computer Systems. New York : ACM ,2010:265-278.
7Jorddal P, Claris C, David C, et al. Resourceaware adaptive scheduling for MapReduce clusters [ C ]//Middleware 2011 - ACM/IFIP/USENIX 12th International Middleware Confer- ence. New York : ACM ,2011 : 187-205.
8Apache Hadoop [ EB/OL]. 2012-04-16. http://hadoop, a- pache, org/.
9CloudSim [ EB/OL ]. 2012 - 02 - 11. http ://www. cloudbus. org/cloudsim/.
10Hadoop公平调度算法[EB/OL].2010-02-19.http://ha-doop.apache.org/docs/rO.20.2/fair_scheduler.html.

引证文献4

1董新华,李瑞轩,周湾湾,王聪,薛正元,廖东杰.Hadoop系统性能优化与功能增强综述[J].计算机研究与发展,2013,50(S2):1-15. 被引量：70
2秦军,张建平,王昊,魏家宾.基于蚁群优化算法的MapReduce集群调度策略[J].计算机技术与发展,2013,23(6):74-78. 被引量：2
3徐焕良,翟璐,薛卫,任守纲.Hadoop平台中MapReduce调度算法研究[J].计算机应用与软件,2015,32(5):1-6. 被引量：11
4帅仁俊,沈阳,陈平,潘静,董亚楠.基于logistic回归模型的Hadoop本地任务调度优化算法[J].计算机应用研究,2017,34(3):727-729. 被引量：7

二级引证文献88

1王少锋,伍少成,刘涛,邓琨,黄兵.对Hadoop的用电信息大数据计算服务应用分析[J].自动化与仪器仪表,2016(4):221-222. 被引量：6
2谢彦祥,刘天琪,苏学能.Hadoop架构下基于分布式粒子群算法的暂态稳定评估特征量选择[J].电网技术,2018,42(12):4107-4115. 被引量：7
3沈楠.云平台下电磁感应带钢稳定系统的自动化部署[J].消费电子,2014(16):171-171.
4任桂禾,王晶.浅谈大数据处理技术架构的演进[J].信息通信技术,2014,8(6):47-51. 被引量：3
5李冰利,刘钊远,贾威威.基于NBD的弹性云存储研究与设计[J].计算机与数字工程,2015,43(2):343-346. 被引量：2
6秦军,童毅,戴新华,林巧民.基于MapReduce数据密集型负载调度策略研究[J].计算机技术与发展,2015,25(4):48-52. 被引量：2
7戴中华,盛鸿彬,王丽莉.基于Hadoop平台的大数据分析与处理[J].通讯世界（下半月）,2015(3):59-60. 被引量：7
8刘青,鲍爱华,倪桂强.大数据技术专题讲座(二) 第3讲面向大数据处理的MapReduce优化技术[J].军事通信技术,2015,36(2):81-87. 被引量：1
9曹畋.浅析建立面向图书馆用户的HDFS云存储服务系统[J].农业图书情报学刊,2015,27(9):53-56. 被引量：1
10马艳,陈玉峰,刘兴华,郭志红,苏东亮,孙溪.面向设备状态评估的大数据底层平台设计（英文）[J].山东电力技术,2015,42(8):23-26. 被引量：1

1黄仁,王良伟.基于主题相关概念和网页分块的主题爬虫研究[J].计算机应用研究,2013,30(8):2377-2380. 被引量：9
2张霄宏,海林鹏,贾宗璞,沈记全,赵文涛.同构Hadoop环境作业执行时间计算方法[J].计算机工程与应用,2014,50(10):249-252. 被引量：1
3曹旭,张云华.Hadoop平台下计算模型中调度策略的研究[J].计算机应用与软件,2013,30(9):208-210. 被引量：5
4陈若飞,姜文红.Hadoop作业调度本地性的研究与优化[J].软件,2015,36(2):64-68. 被引量：5
5万兵,黄梦醒,段茜.一种基于资源预取的Hadoop作业调度算法[J].计算机应用研究,2014,31(6):1639-1643. 被引量：4
6孙瑞琦,杨杰,高瞻,贺志强.一种提高虚拟化Hadoop系统数据本地性的资源调度方法[J].计算机研究与发展,2014,51(S2):189-198. 被引量：5
7蒋炎华.网格环境下任务的执行时间预测技术研究[J].计算机工程与设计,2011,32(10):3428-3430. 被引量：4
8张啸,高原,王晓亮,葛以踊,杨海祥,万书鹏.绿色数据中心数据处理型框架中的数据管理[J].系统仿真学报,2016,28(3):592-599. 被引量：2
9王越峰,陈福洪.Hadoop集群环境下本地性调度算法改进[J].软件工程,2016,19(12):36-39.
10沈记全,易月婵,张霄宏.Hadoop作业执行时间在线计算方法[J].河南理工大学学报（自然科学版）,2014,33(6):776-780.

计算机科学

2011年第B10期

浏览历史

内容加载中请稍等...

基于优先级的Three-Queue调度算法研究被引量：4

参考文献11

二级参考文献10

共引文献20

同被引文献92

引证文献4

二级引证文献88

相关作者

相关机构

相关主题

浏览历史

基于优先级的Three-Queue调度算法研究 被引量：4

参考文献11

二级参考文献10

共引文献20

同被引文献92

引证文献4

二级引证文献88

相关作者

相关机构

相关主题

浏览历史

基于优先级的Three-Queue调度算法研究被引量：4