5Dean J, Ghemawat S. MapReduce : Simplified data process- ing on large clusters [ J ]. Communications of the ACM, 2008,51 ( I ) : 107 - 113.
6Zikopoulos P, Eaton C. Understanding big data: Analytics for enterprise class Hadoop and streaming data[EB/OL]. (2011 - 10) [2015 -06 - 10]. http://public, dhe. ibm. corn/ common/ssi/ecm/im/en/im114296usen/IML14296USEN. PDF.
7Olston C,Reed B,Srivastava U,et al. Pig latin: A not-so- foreign language for data processing [ C ]//Proceedings of the 2008 ACM SIGMOD international conference on Man- agement of data. Vancouver, Canada: ACM, 2008 : 1099 - 1110.
8Thusoo A,Sarma J S, Jain N, et al. Hive-a petabyte scale data warehouse using hadoop[C]//2010 IEEE 26th In- ternational Conference on Data Engineering (ICDE). Long Beach, California, USA : IEEE, 2010 : 996 - 1005.
9Herodotou H, Babu S. Profiling, what-if analysis, and cost- based optimization of MapReduce programs [ J]. Proceed- ings of the VLDB Endowment, 2011,4 ( 11 ) : 1111 - 1122.
10Yang H L,Luan Z Z,Li W J,et al. MapReduce workload modeling with statistical approach [ J]. Journal of Grid Computing,2012,10 (2) :279 - 310.