Hello,

have you tried upgrading to slurm 16.05? This commit, which made it to
16.05, looks promising:
https://github.com/SchedMD/slurm/commit/33bf4d29aa72a29ea40d5202a50efdfeeb9f4425

best regards
Maciej

2016-06-13 8:58 GMT+02:00 Loris Bennett <loris.benn...@fu-berlin.de>:

>
> Hi,
>
> One of our users was carrying out some tests and running some very short
> jobs with a TimeLimit of 60s.  However, because one of the nodes had to
> be booted, which takes a couple of minutes, the jobs were terminated
> with TIMEOUT as the state.
>
> I am aware that we can set BatchStartTimeout to a larger value, but
> wouldn't it make more sense if the run-time for the job only started to
> accumulate, once the slurmd on the node became available?
>
> Cheers,
>
> Loris
>
> --
> Dr. Loris Bennett (Mr.)
> ZEDAT, Freie Universität Berlin         Email loris.benn...@fu-berlin.de
>

Reply via email to