[slurm-dev] Re: Minimum Runtime

Moe Jette Mon, 27 Jan 2014 04:31:09 -0800

There were changes in Slurm version 2.6 with respect to lock handlingwhich may effect this. If you are using an earlier version of slurm,that would be a reason to upgrade.


Quoting Paul Edmon <[email protected]>:

I will have to try a few of those tweaks to the configuration wehave. They may help alot.
We are running bear metal hardware though we are logging quite a bitso that likely doesn't help.
I would say high throughput would be 100 jobs completingsimultaneously and then it trying schedule those cores again only tohave them come available immediately. Essentially the master getsso busy that it won't respond to any outside probing. The only wayto get any info is to watch the log roll by as sdiag is alsounresponsive.
Again we will have to try some of that machine tuning stuff. Itshould be helpful.
-Paul Edmon-

On 1/26/2014 7:21 PM, Moe Jette wrote:
A great deal depends upon your hardware and configuration. Slurmshould be able to handle a few hundred jobs per soecond when tunedfor high throughput as described here:
http://slurm.schedmd.com/high_throughput.html
If not tuned for high throughput, say with lots of logging, runningon a virtual machine, etc. then the slurmctld daemon willdefinitely bog down. What sort of throughput were you seeing? Didthe jobs just exit right away?
Moe Jette
SchedMD

Quoting Paul Edmon <[email protected]>:
So I've found that if some one submits a ton of jobs that have avery short runtime slurm tends to trash as jobs are launching andexiting pretty much constantly. Is there an easy way to enforce aminimum runtime?
-Paul Edmon-

[slurm-dev] Re: Minimum Runtime

Reply via email to