[slurm-dev] Re: How to restart a job "(launch failed requeued held)"

2016-10-27 Thread Lachlan Musicman
On 28 October 2016 at 09:20, Christopher Samuel wrote: > > On 28/10/16 08:44, Lachlan Musicman wrote: > > > So I checked the system, noticed that one node was drained, resumed it. > > Then I tried both > > > > scontrol requeue 230591 > > scontrol resume 230591 > > What

[slurm-dev] Re: How to restart a job "(launch failed requeued held)"

2016-10-27 Thread Christopher Samuel
On 28/10/16 08:44, Lachlan Musicman wrote: > So I checked the system, noticed that one node was drained, resumed it. > Then I tried both > > scontrol requeue 230591 > scontrol resume 230591 What happens if you "scontrol hold" it first before "scontrol release"'ing it? -- Christopher Samuel