Re: [slurm-users] Slurm 17.11 X11 support questions

2018-01-19 Thread Mjelde, Matthew J
Thanks Marcus. I thought I had installed that dependency, but I guess I installed it somewhere other than where I was compiling slurm at. Once installed it compiled with x11 support. However, now I have another problem. I am able to start an x11 job using the command: `srun --pty --x11 bash`

Re: [slurm-users] execute job regardless the exit status of dependent jobs

2018-01-19 Thread Michael Robbert
George, I haven't tested or used this, but why won't afterany do what you want?   afterany:job_id[:jobid...] This job can begin execution after the specified jobs have terminated. Mike On 1/19/18 11:09 AM, Hwa, George wrote: I have a “reaper” job that harve

[slurm-users] execute job regardless the exit status of dependent jobs

2018-01-19 Thread Hwa, George
I have a "reaper" job that harvests the results from a set of proceeding jobs, regardless their exit status. However, none of the dependency options (After, AfterOk, AfterNotok, etc) seems to do what I want. Any idea how to get around this? Thanks George [KLA-Tencor%20Confidential%20Need-to

[slurm-users] Requirement to use QOS?

2018-01-19 Thread Loris Bennett
Hi, Some while ago I defined several QOS thus: Name Priority MaxWall MaxJobs MaxSubmit -- -- --- --- - normal 0 short 1003:00:00 1020 medium 1 3-00:00:00 1

Re: [slurm-users] Nagios or Other Monitoring Plugins

2018-01-19 Thread John Hearns
Not specifically Slurm, but it can be useful to have alerts on jobs which either will never start or which are 'stalled'. You might want to have an alert on jobs which (say) request more slots or nodes than physicall exist, so the users job will never run. Or you can look for 'stalled' jobs where t