Thank you Chris. That means every user has to put --no-kill in their sbatch command. It would be nice if there are options in slurm configuration to implement that.
Warm regards, Teshome ________________________________________ From: Chris Samuel <[email protected]> Sent: Monday, May 19, 2014 2:10 PM To: slurm-dev Subject: [slurm-dev] Re: Requeue and resubmit after networking issue On Mon, 19 May 2014 04:37:03 AM Teshome Dagne Mulugeta wrote: > Is there a way to keep the running jobs continue after a netwokring issue > between slurm daemon and nodes? I suspect the answer is the --no-kill option for sbatch. Best of luck! Chris -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: [email protected] Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci
