Hello there,

Is there a way to combine preemption, checkpointing and automatic requeue?

The behavior I try to configure is:

- Submit a batch job with checkpointing (based on BLCR)
- On preemption - take a checkpoint and kill (like the CHECKPOINT mechanism
in PreemptMode)
- Resubmit the  job (which should start from the last checkpoint)

The documentation is clear about "Checkpointed jobs are not automatically
restarted." so PreemptMode=CHECKPOINT isn't a solution.

Is there anyway to hook into the process and resubmit the jobs?

Thank you,
  Yoel

Reply via email to