[slurm-dev] Re: Minimum Runtime

Paul Edmon Sun, 26 Jan 2014 18:41:50 -0800

I will have to try a few of those tweaks to the configuration we have.They may help alot.

We are running bear metal hardware though we are logging quite a bit sothat likely doesn't help.

I would say high throughput would be 100 jobs completing simultaneouslyand then it trying schedule those cores again only to have them comeavailable immediately. Essentially the master gets so busy that itwon't respond to any outside probing. The only way to get any info isto watch the log roll by as sdiag is also unresponsive.

Again we will have to try some of that machine tuning stuff. It shouldbe helpful.


-Paul Edmon-

On 1/26/2014 7:21 PM, Moe Jette wrote:

A great deal depends upon your hardware and configuration. Slurmshould be able to handle a few hundred jobs per soecond when tuned forhigh throughput as described here:
http://slurm.schedmd.com/high_throughput.html
If not tuned for high throughput, say with lots of logging, running ona virtual machine, etc. then the slurmctld daemon will definitely bogdown. What sort of throughput were you seeing? Did the jobs just exitright away?
Moe Jette
SchedMD

Quoting Paul Edmon <[email protected]>:
So I've found that if some one submits a ton of jobs that have a veryshort runtime slurm tends to trash as jobs are launching and exitingpretty much constantly. Is there an easy way to enforce a minimumruntime?
-Paul Edmon-

[slurm-dev] Re: Minimum Runtime

Reply via email to