期刊文献+

一种网格环境下的动态故障检测算法 被引量:9

A Dynamic Fault Detection Algorithm under Grid Environments
下载PDF
导出
摘要 针对现有网格系统出错几率较大、已有故障检测算法不能有效满足网格系统需求问题,提出了一种网格环境下的动态故障检测算法.根据网格系统的特点,基于不可靠故障检测思想,建立了网格系统模型和故障检测模型;结合心跳(heartbeat)策略和灰色预测方法,设计了一种动态心跳机制,并给出了预测模型和实时预测策略;提出了基于该动态心跳机制的网格故障检测算法,分析了算法的可靠性.仿真实验结果表明,该算法是正确、有效的,可用于网格环境下的故障检测. Aimed at the problems that grids are more prone to failures, and existing failure detection algorithms can not satisfy the unique requirement of grids, a dynamic failure detection algorithms is presented. According to the characteristics of grids and unreliable failure detection theory, the authors establish models for grid systems and fault detection respectively; Combining heartbeat strategy with grey prediction method, they also design a dynamic heartbeat strategy, and present the prediction model, as well as a real-time prediction strategy. Furthermore, on the basis of the dynamic heartbeat strategy, a fault detection algorithm is put forward for grid environments, and its dependability is analyzed. Finally, simulation results shows that the algorithm is valid and effective, and can be used for fault detection under grid environments.
出处 《计算机研究与发展》 EI CSCD 北大核心 2006年第11期1870-1875,共6页 Journal of Computer Research and Development
基金 教育部跨世纪优秀人才支持计划基金项目(NCET-04-0843) 重庆市自然科学基金项目(2005BB2192)
关键词 网格 故障检测 灰色预测 心跳机制 grid fault detection grey prediction heartbeat mechanism
  • 相关文献

参考文献11

  • 1I Foster.The Grid:A new infrastructure for 21st century science[J].Physics Today,2002,55(22):42-47
  • 2R Medeiros,W Cirne,F Brasileiro.Faults in grids:Why are they so bad and what can be done about it[C].In:Proc of the 4th Int'l Workshop on Grid Computing.Los Alamitos,CA:IEEE Computer Society Press,2003.18-24
  • 3S Hwang,C Kesselman.A flexible framework for fault tolerance in the grid[J].Journal of Grid Computing,2003,1(3):251-272
  • 4P Stelling,C Dematteis,I Foster,et al.A fault detection service for wide area distributed computations[J].Cluster Computing,1999,(2):117-128
  • 5J H Abawajy.Fault detection service architecture for grid computing systems[G].In:Proc of ICCSA 2004,Lecture Note in Computer Science 3044.Berlin:Springer,2004.107-115
  • 6A Jain,R K Shyamasundar.Failure detection and membership in grid environments[C].In:Proc of the 5th IEEE/ACM Int'l Workshop on Grid Computing (GRID'04).Los Alamitos,CA:IEEE Computer Society Press,2004.44-52
  • 7T D Chandra,S Toueg.Unreliable failure detectors for reliable distributed systems[J].Journal of ACM,1996,43(2):225-267
  • 8W Chen,S Toueg,M K Aguilera.On the quality of service of failure detectors[J].IEEE Trans on Computers,2002,51(2):13-32
  • 9M Bertier,O Marin,P Sens.Implementation and performance evaluation of an adaptable failure detector[C].In:Proc of IEEE Int'l Conf on Dependable Systems and Networks (DSN'02).Los Alamitos,CA:IEEE Computer Society Press,2002.354-363
  • 10N Hayashibara,X Défago,R Yared,et al.The φ accrual failure detector[C].In:Proc of the 23rd IEEE Int'l Symp on Reliable Distributed Systems (SRDS'04).Los Alamitos,CA:IEEE Computer Society Press,2004.66-78

同被引文献87

引证文献9

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部