摘要
检查点机制在现代并行分布式计算中有着重要的应用。本文介绍了一种基于Linux的检查点系统的设计和实现方法 ,它对系统容错、进程迁移和动态负载平衡的研究都具有重要的意义。
Checkpoint and recovery technology play an important role in parallel and distributed computing. In this paper, a checkpoint system based on Linux is implemented. It is valuable for the research on fault-tolerance, process migration and dynamic load-balance.
出处
《计算机与数字工程》
2004年第4期6-9,共4页
Computer & Digital Engineering
关键词
检查点
容错
恢复
checkpoint, fault-tolerance, recovery