Try setting it less than 5 min.

If that works then it might be the max_switch_wait setting for the cluster.
I forget exactly where to set it now but I'm having a  strong sense of deja
vu reading this and I remember thinking that's a very short default when I
changed it to 24 hours.

Antony
On 26 Jan 2016 10:08, "Danny Rotscher" <[email protected]> wrote:

>
> Hello,
>
> there seems to be a bug with the lua job submit plugin.
> I want to change the variable wait4switch to a specific value, but my
> changes are ignored by Slurm.
> The value is always 5 minutes.
>
> job_submit.lua.all:
>
> [...]
> function slurm_job_submit(job_desc, part_list, submit_uid)
>     [...]
>     job_desc.wait4switch=6000
>     [...]
>
> scontrol output of the job:
>
> JobId=391 JobName=sleep
>    UserId=rotscher(19426) GroupId=rotscher(19426)
>    Priority=25291 Nice=0 Account=rotscher QOS=normal
>    JobState=RUNNING Reason=None Dependency=(null)
>    Requeue=1 Restarts=0 BatchFlag=0 Reboot=0 ExitCode=0:0
>    RunTime=00:00:33 TimeLimit=08:00:00 TimeMin=N/A
>    SubmitTime=2016-01-26T10:38:39 EligibleTime=2016-01-26T10:38:39
>    StartTime=2016-01-26T10:38:39 EndTime=2016-01-26T18:38:39
>    PreemptTime=None SuspendTime=None SecsPreSuspend=0
>    Partition=sandy2 AllocNode:Sid=sandbox-login:13646
>    ReqNodeList=(null) ExcNodeList=(null)
>    NodeList=sandbox-node1
>    BatchHost=sandbox-node1
>    NumNodes=1 NumCPUs=4 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
>    TRES=cpu=4,mem=40,node=1
>    Socks/Node=* NtasksPerN:B:S:C=0:0:*:1 CoreSpec=*
>    MinCPUsNode=1 MinMemoryCPU=10M MinTmpDiskNode=0
>    Features=(null) Gres=(null) Reservation=(null)
>    Shared=0 Contiguous=0 Licenses=(null) Network=(null)
>    Command=sleep
>    WorkDir=/tmp
>    Switches=1@00:05:00
>
>    Power= SICP=0
>
> I created some debug output which prints out the job_desc.wait4switch
> variable at the end of the job submit script.
> The output shows me the correct value.
>
> Thank you for any help!
>
> Kind regards,
> Danny
>

Reply via email to