期刊文献+

一种面向大规模副本存储系统的可靠性模型 被引量:7

An Analytical Model for Large-Scale Storage System with Replicated Data
下载PDF
导出
摘要 可靠性对大规模存储系统至关重要,在大规模存储系统中设备失效日趋频繁,副本技术成为提高系统可靠性的主流技术之一.基于Markov模型,针对多副本存储系统建立了度量系统可靠性的理论模型.该模型能够反应失效检测延迟对系统可靠性的影响.通过该模型还可以度量存储系统关键参数如系统规模、副本阶数、单节点容量、单节点平均失效时间、数据对象平均大小、平均修复带宽等对系统可靠性的影响,从而为存储系统的设计提供理论基础. Nowadays storage systems become larger and larger, so the number of storage devices is increasing rapidly, which makes storage device failure occur quite frequently in large scale storage systems. Data replica technology begins to be adopted prevalently to enhance storage system reliability. When designing a large scale storage system, there are many factors that could affect the reliability of the storage system, such as failure detection latency, storage node capacity selection, data object size design, replica rank selection and so on. On the other hand, system reliability can not be exactly experimented, so a theoretical model is needed to evaluate it. In this paper, an analytical framework is represented to evaluate the reliability for large scale storage systems which adopt replica technology to protect data. Based on the Markov model, this analytical model could provide quantitative answers to measure the impact of a series of storage system design factors on the reliability of storage systems, such as the rank of the replicated data, the capacity of the storage system, the capacity of storage nodes, the size of data object, the repair bandwidth, mean time failure detection latency and so on. Hence, many storage system design tradeoffs could be reasoned by this framework.
出处 《计算机研究与发展》 EI CSCD 北大核心 2009年第5期756-761,共6页 Journal of Computer Research and Development
基金 国家自然科学基金项目(90612018) 科技部"十一五"国家科技支撑计划重大项目(2006BAA02A17) 国家"九七三"重点基础研究发展计划基金项目(2004CB318205)~~
关键词 存储系统 可靠性 多副本 MARKOV模型 失效检测 storage system reliability replica Markov model failure detection
  • 相关文献

参考文献5

  • 1IDC White Paper-Sponsored by EMC. The expanding digital universes A forecast of worldwide information growth through 2010 [OL]. [2007-08-06]. http://www. eme. com/ about/destination/digitaluniverse/
  • 2Patterson D A, Gibson G, Katz R H. A case for redundant arrays of inexpensive disks (RAID)[C] //Proc of the 1988 ACM SIGMOD Int Conf on Management of Data. New York: ACM, 1988:109-116
  • 3Xin Q, Miller E L, Long D D E, et al. Reliability mechanisms for very large storage systems [C]//Proc of the 20th IEEE/11th NASA Goddard Conf on Mass Storage Systems & Technologies. Piscataway, NJ: IEEE, 2003: 146-156
  • 4Lian Q, Chen W, Zhang Z. On the impact of replica placement to the reliability of distributed brick storage systems [C]//Proc of the 25th ICDCS. Piscataway, NJ: IEEE, 2005
  • 5Ramabhadran S, Pasquale J. Analysis of long-running replicated systems [C] //Proc of INFOCOM. Piscataway, NJ: IEEE, 2006:1-9

同被引文献40

  • 1韩德志,谢长生,李怀阳.存储备份技术探析[J].计算机应用研究,2004,21(6):1-4. 被引量:49
  • 2韩德志,汪洋,李怀阳.远程备份及关键技术研究[J].计算机工程,2004,30(22):34-36. 被引量:11
  • 3张世武,吴月华,杨杰,刘际明.基于信息寻觅智能体的网络用户浏览模式研究[J].计算机研究与发展,2004,41(11):1966-1973. 被引量:6
  • 4陈宁江,魏峻,杨波,黄涛.Web应用服务器的适应性失效检测[J].软件学报,2005,16(11):1929-1938. 被引量:18
  • 5LINDHORST T,LUKAS G,NETT E,et al.Data-mining-based link failure detection for wireless mesh networks[C] // Proceedings of the 29th IEEE International Symposium on Reliable Distributed Systems.Piscataway,NJ:IEEE Press,2010:353-357.
  • 6TSAI W,SHAO Q,SUN X,ELSTON J.Real-time service-oriented cloud computing[C] // Proceedings of the 6th World Congress on Services (SERVICES-1).Piscataway,NJ:IEEE Press,2010:473-478.
  • 7GREVE F,SENS P,ARANTES L,et al.A failure detector for wireless networks with unknown membership[C] // Proceedings of the 17th International Conference on Parallel Processing.Berlin:Springer-Verlag,2011,Ⅱ:27-38.
  • 8DING X,HOU Y,GU Z,et al.A failure detection model based on message delay prediction[C] // GCC'09:Proceedings of the 2009Eighth International Conference on Grid and Cooperative Computing.Washington,DC:IEEE Computer Society,2009:24-30.
  • 9CHEN W,TOUEG S,AGUILERA M K.On the quality of service of failure detectors[J].IEEE Transactions on Computers,2002,51(5):561-580.
  • 10HAYASHIBARA N,DEFAGO X,YARED R,et al.The (ψ) accrual failure detector[C] //SRDS'04:Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems.Washington,DC:IEEE Computer Society,2004:66-78.

引证文献7

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部