On 02/23/2011 05:52 PM, Anthony Liguori wrote:
I still don't see.  What would you do with thousands of checkpoints?


Er, hit send to quickly.

HPC is a big space where checkpointing is actually useful. An HPC workload may take weeks to run to completion. If something fails during the run, it's a huge waste of time. However, if you do regularl checkpointing, a failure may only lose a few minutes of work instead of the entire weeks worth of work.

The trick is to delete snapshot N-M after taking snapshot N (for a small constant M).

--
error compiling committee.c: too many arguments to function


Reply via email to