期刊文献+

大型商业银行基于Hadoop分布式数据仓库建设初探 被引量:3

A PRELIMINARY STUDY ON THE CONSTRUCTION OF LARGE COMMERCIAL BANKS BASED ON HADOOP DISTRIBUTED DATA WAREHOUSE
下载PDF
导出
摘要 商业银行的数据规模随着传统业务扩展和互联网发展水平的不断提高而与日俱增,使得银行对数据的存储、管理和应用要求越来越高。通过搭建基于Hadoop技术的大数据平台,利用分布式文件系统HDFS、SQL分析引擎Inceptor、Nosql数据库工具Hyperbase、流处理工具Stream等架构,探索了大型商业银行Hadoop分布式数据仓库的构建过程,最终实现了由基于集中式存储架构的传统关系型数据仓库向分布式数据仓库的迁移工作。该分布式数据仓库实现了结构化数据和非结构化数据的存储、ETL调度管理、历史数据检索、交互式分析以及流数据处理。应用表明,相比基于集中式存储架构的传统关系型数据仓库,分布式数据仓库可大幅提高数据存储和数据服务的效率。 With the expansion of tradit ional business and the development of Internet, the rapid growth of data volumes in commercial banks requires stronger abilities on storage, management, application on a huge amount of data. Based on Hadoop and its various frameworks, including HDFS, Inceptor, Hyperbase, Stream, a distributed data warehouse for commercial banks was constructed. Various applications were migrated from the relational data warehouse based on centralized storage architecture, including the storage of heterogeneous data, management of ETL processing, historical data retrieval, interactive analysis and streaming data processing. Compared to the relational data warehouse, it is shown that the efficiency of data storage and services are substantially promoted on the distributed data warehouse.
出处 《计算机应用与软件》 2017年第8期72-75,113,共5页 Computer Applications and Software
关键词 分布式数据仓库 HADOOP 数据批处理 历史数据查询 交互式分析 Distributed data warehouse Hadoop ETL processing Historical data retrieval Interactive analysis
  • 相关文献

参考文献8

二级参考文献32

  • 1何强,郝建国,黄健.基于SOA的仿真服务系统[J].计算机仿真,2007,24(5):98-102. 被引量:12
  • 2杨恒宁.基于SOA的WEB应用系统的研究与实现[D]合肥:合肥工业大学,2006.
  • 3DeveloperWorks 中国.SOA and WebService 新手入门.
  • 4Sanjay Ghemawat,Howard Gobioff,Shun-Tak Leung.The Google file system[J].ACM SIGOPS Operating Systems Review.2003(5)
  • 5Jeffrey Dean,Sanjay Ghemawat.MapReduce[J]. Communications of the ACM . 2008 (1)
  • 6Kyong-Ha Lee,Yoon-Joon Lee,Hyunsik Choi,Yon Dohn Chung,Bongki Moon.Parallel data processing with MapReduce[J]. ACM SIGMOD Record . 2012 (4)
  • 7陆嘉恒.Hadoop实践[M].第2版.北京:机械工业出版社,2014.
  • 8Shvachko K,Kuang H.The Hadoop Distributed File System. Mass Storage Systems and Technologies (MSST) . 2010
  • 9Ashish Thusoo,Joydeep Sen Sarma,Namit Jain,Zheng Shao,Prasad Chakka,Suresh Anthony,Hao Liu,Pete Wyckoff,Raghotham Murthy.Hive: a warehousing solution over a map-reduce framework. Proceedings of the VLDB Endowment . 2009
  • 10A. Thusoo,Z. Shao,S. Anthony, et al.Data warehousing and analytics infrastructureat facebook. Proceedings of the ACM SIGMOD international conference onManagement of data . 2010

共引文献47

同被引文献16

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部