期刊文献+

一种云计算环境下大数据动态迁移策略 被引量:12

A Big Data Dynamic Migration Strategy in Cloud Computing Environment
下载PDF
导出
摘要 云计算环境中大数据应用在数据迁移方面遇到各种问题,主要表现为如何在迁移过程中减少网络访问次数,减少全局时间消耗,以及在提高效率的同时兼顾全局的负载均衡等。为此,对数据迁移进行建模,描述动态迁移策略,分别针对策略中的全局时间消耗、网络访问次数和全局负载均衡3个参数进行求解,并在云计算仿真平台Cloudsim下进行实验。结果表明,使用数据动态迁移策略后,任务完成时间比Zipf分布减少约10%,网络访问次数低于原始Zipf分布并趋于稳定;全局负载均衡方面,节点存储空间方差趋于0。 Big data applications meet various challenges in data migration in cloud computing environment. It mainly manifests in below aspects: reduce the number of network access, reduce the overall time consumption and improve the efficiency by the time of balancing the global load in the migration process and so on. Facing these challenges,it builds the problem model and descripts the dynamic migration strategy, then solves the global time consumption of data migration, the number of network access and global load balance in these three parameters. The cloud computing simulation experiment is done under Cloudsim experimental platform. The result shows that the proposed data dynamic migration strategy makes the task completion time reduced by 10% than Zipf distribution, network access number be lower than Zipf and tends to be stable. And in global load,the variance of the node' s store space is closed to zero.
出处 《计算机工程》 CAS CSCD 北大核心 2016年第5期13-17,共5页 Computer Engineering
基金 国家自然科学基金资助项目(51467007) 云南省应用基础研究计划基金资助项目(2013FZ020)
关键词 云计算 大数据 负载均衡 数据迁移 网络访问 数据集 cloud computing big data load balance data migration network access dataset
  • 相关文献

参考文献15

二级参考文献146

  • 1Deelman E,Chervenak A.Data management challenges of data-intensive scientific workflows//Proceedings of the IEEE International Symposium on Cluster Computing and the Grid(CCGRID).Lyon,France,2008:687-692.
  • 2Deelman E,Blythe J,Gil Y,Kesselman C,Mehta G,Patil S,Su M H,Vahi K,Livny M.Pegasus:Mapping scientific workflows onto the grid//Proceedings of the European Across Grids Conference(AxGrids).Nicosia,Cyprus,2004:11-20.
  • 3Ludascher B,Altintas I,Berkley C,Higgins D,Jaeger E,Jones M,Lee E A.Scientific workflow management and the Kepler system.Concurrency and Computation:Practice and Experience,2005,18(10):1039-1065.
  • 4Oinn T,Addis M,Ferris J,Marvin D,Senger M,Greenwood M,Carver T,Glover K,Pocock M R,Wipat A,Li P.Taverna:A tool for the composition and enactment of bioinformatics workflows.Bioinformatics,2004,20(17):3045-3054.
  • 5Ghemawat S,Gobioff H,Leung S T.The google file system.ACM SIGOPS Operating Systems Review,2003,37(5):29-43.
  • 6Wang L,Tao J,Kunze M,Castellanos A C,Kramer D,Karl W.Scientific cloud computing:Early definition and experience//Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications(HPCC).Dalian,China,2008:825-830.
  • 7Wieczorek M,Prodan R,Fahringer T.Scheduling of scientific workflows in the ASKALON grid environment.SIGMOD Record,2005,34(3):56-62.
  • 8Baru C,Moore R,Rajasekar A,Wan M.The SDSC storage resource broker//Proceedings of the IBMCentre for Advanced Studies Conference.Toronto,Canada,1998:1-12.
  • 9Churches D,Gombas G,Harrison A,Maassen J,Robinson C,Shields M,Taylor I,Wang I.Programming scientific and distributed workflow with Triana services.Concurrency and Computation:Practice and Experience,2006,18:1021-1037.
  • 10Chervenak A,Deelman E,Foster I,Guy L,Hoschek W,Iamnitchi A,Kesselman C,Kunszt P,Ripeanu M,Schwartzkopf B,Stockinger H,Stockinger K,Tierney B.Giggle:A framework for constructing scalable replica location services//Proceedings of the ACM/IEEE Conference on Supercomputing.Baltimore,Maryland,USA,2002:1-17.

共引文献480

同被引文献99

引证文献12

二级引证文献45

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部