期刊文献+

大数据分析平台——从扩展性优先到性能优先 被引量:5

Big Data Analytic Platforms: Changing the Priority from Scalability to Performance
下载PDF
导出
摘要 认为现有以Map Reduce/Spark等为代表的大数据处理平台在解决大数据问题的挑战问题方面过多考虑了容错性,忽视了性能。大数据分析系统的一个重要的发展方向就是兼顾性能和容错性,而图计算系统在数据模型上较好地考虑了性能和容错能力的平衡,是未来的重要发展方向。 Existing big data analytic platforms, such as Map Reduce and Spark, focus on scalability and fault tolerance at the expense of performance. We discuss the connections between performance and fault tolerance and show they are not mutually exclusive. Distributed graph processing systems are promising because they make a better tradeoff between performance and fault tolerance with mutable data models.
出处 《中兴通讯技术》 2016年第2期11-13,共3页 ZTE Technology Journal
基金 国家重点基础研究发展("973")计划(2014CB340402) 国家自然科学基金(61525202)
关键词 大数据 分布与并行处理 并行编程 容错 可扩展性 big data distributed and parallel processing parallel programming fault tolerance scalability
  • 相关文献

参考文献13

  • 1DEAN,JEFFREY,SANJAY G.Map Reduce:Simplified Data Processing on Large Clusters[J].Communications of the ACM,2008,51(1):107-113.DOI:10.1145/1327452.1327492.
  • 2ZAHARIA M,CHOWDHURY M,DAS T,et al.Resilient Distributed Datasets:A FaultTolerant Abstraction for In-Memory Cluster Computing[C]//Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation.USA:USENIX Association,2012:15-28.
  • 3THUSOO A,SARMA S J,JAIN N,et al.Hive:A Warehousing Solution over a Map-Reduce Framework[J].Proceedings of the VLDB Endowment,2009,2(2):1626-1629.DOI:10.14778/1687553.1687609.
  • 4GROPP W,LUSK E,DOSS N,et al."A HighPerformance,Portable Implementation of the MPI Message Passing Interface Standard[J].Parallel Computing,1996,22(6):789-828.DOI:10.1016/0167-8191(96)00024-5.
  • 5BU Y,HOWE B,BALAZINSKA M,et al.Ha Loop:Efficient Iterative Data Processing on Large Clusters[J].Proceedings of the VLDB Endowment,2010,3(1):285-296.DOI:10.14778/1920841.1920881.
  • 6EKANAYAKE,JALIYA.Twister:A Runtime for Iterative Mapreduce[C]//Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing.USA:ACM,2010:810-818.
  • 7FRANK M,MICHAEL I,MURRAY D G.Scalability!But at what COST[C]//5th Workshop on Hot Topics in Operating Systems(Hot OS XV).USA:USENIX Association,2015.
  • 8KWAK,HAEWOON.What is Twitter,A Social Network or A News Media?[C]/Proceedings of the 19th International Conference on World Wide Web.USA:ACM,2010:591-600.
  • 9MALEWICZ,GRZEGORZ.Pregel:A System for Large-Scale Graph[C]//Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data.USA:ACM,2010:135-146.
  • 10LOW,YU C.Distributed Graph Lab:A Framework for Machine Learning and Data Mining in the Cloud[J].Proceedings of the VLDB Endowment,2012,5(8):716-727.

同被引文献23

引证文献5

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部