期刊文献+

基于SparkR的大数据分析平台设计 被引量:2

下载PDF
导出
摘要 电信运营商在以DPI数据为基础,结合IT系统数据、网元平台数据刻画用户特征的过程中,面临着数据分析与挖掘效率低下的问题。通过分析数据挖掘效率低下的原因,结合DPI数据的特点,基于开源大数据分析与挖掘技术SparkR构建大数据分析平台,提升用户行为分析与挖掘的效率。通过大数据分析平台,使得电信运营商具备PB级数据分析与挖掘的能力。
出处 《电子技术与软件工程》 2016年第21期184-184,共1页 ELECTRONIC TECHNOLOGY & SOFTWARE ENGINEERING
  • 相关文献

参考文献3

  • 1浅谈分布式计算的开发与实现(一).http://www.cnblogs.com/mushroom/p/4959904.html.2015.
  • 2SparkR (R on Spark).http: //spark. apache, org/docs/lat es t/spa rkr. html, 2016.
  • 3刘志强,顾荣,袁春风,黄宜华.基于SparkR的分类算法并行化研究[J].计算机科学与探索,2015,9(11):1281-1294. 被引量:14

二级参考文献19

  • 1刘华元,袁琴琴,王保保.并行数据挖掘算法综述[J].电子科技,2006,19(1):65-68. 被引量:15
  • 2Dean J,Ghemawat S.Map Reduce:simplified data processing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
  • 3Zaharia M,Chowdhury M,Das T,et al.Resilient distributed datasets:a fault-tolerant abstraction for in-memory cluster computing[C]//Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation,San Jose,USA,Apr 25-27,2012.Berkeley,CA,USA:USENIX Association,2012.
  • 4The R Foundation.The R project for statistical computing[EB/OL].[2014-10-06].http://www.r-project.org/.
  • 5Amplab-extras.Spark R(R frontend for Spark)[EB/OL].[2014-09-25].http://amplab-extras.github.io/Spark R-pkg/.
  • 6Liu Chuang.Research on classification algorithms based on multicore computing[D].Nanjing:Nanjing University of Aeronautics and Astronautics,2011.
  • 7Jin Lei,Wang Zhaokang,Gu Rong,et al.Training large scale deep neural networks on the Intel Xeon Phi many-core coprocessor[C]//Proceedings of the 2014 IEEE 28th International Parallel&Distributed Processing Symposium Workshops(Par Learning),Phoenix,USA,May 19-25,2014.Piscataway,NJ,USA:IEEE,2014:1622-1630.
  • 8Woodsend K,Gondzio J.Hybrid MPI/Open MP parallel linear support vector machine training[J].Journal of Machine Learning Research,2009,10:1937-1953.
  • 9Narang A,Gupta R,Joshi A,et al.Highly scalable parallel collaborative filtering algorithm[C]//Proceedings of the 2010International Conference on High Performance Computing,Dona Paula,Dec 19-22,2010.Piscataway,NJ,USA:IEEE,2010:1-10.
  • 10The Apache Software Foundation.Apache Mahout:scalable machine learning and data mining[EB/OL].(2014)[2014-10-06].http://mahout.apache.org/.

共引文献13

同被引文献9

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部