摘要
传统数据库处理分析大量历史数据的性能有限,无法达到满意效果。针对该问题,通过对商立方体的研究,提出等价区间的概念,并利用区间之间的独立性,使商立方体能更好地适应分布式环境下的查询。同时,提出了商立方体在Spark集群上的并行查询算法,充分利用等价区间点查询面命中的特性,使在保证查询有效的情况下尽可能并行化。最后,通过实验验证了算法高效性。
Traditional database processing has limited performance in analyzing large amounts of historical data and cannot achieve satisfactory results.Aiming at this problem,through the study of the business cube,we propose the concept of equivalence interval.The independence between the intervals is used to make the quotient cube better adapt to the query in the distributed environment.At the same time,the parallel query algorithm of the business cube on the Spark cluster is proposed,which makes full use of the characteristics of the equivalent interval point query surface hit so as to ensure parallelization as much as possible while ensuring the query is valid.Finally,the efficiency of the algorithm is verified by experiments.
作者
张正凡
都仪敏
ZHANG Zheng-fan;DU Yi-min(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China)
出处
《软件导刊》
2018年第11期37-39,44,共4页
Software Guide