[slurm-users] SLURM Install

2020-07-13 Thread Bas van der Vlies
This is maybe a little of topic but I saw some posts in this list about slurm installations. We at SURF(sara) have developed a services framework for CFEngine3 that install/configure/maintain with the aid of templates(mustache) and json files. Everything can be configured with external data

Re: [slurm-users] CPU allocation for the GPU jobs.

2020-07-13 Thread navin srivastava
Thanks Renfro. My scheduling policy is below. SchedulerType=sched/builtin SelectType=select/cons_res SelectTypeParameters=CR_Core AccountingStorageEnforce=associations AccountingStorageHost=192.168.150.223 AccountingStorageType=accounting_storage/slurmdbd ClusterName=hpc

Re: [slurm-users] CPU allocation for the GPU jobs.

2020-07-13 Thread Renfro, Michael
“The SchedulerType configuration parameter specifies the scheduler plugin to use. Options are sched/backfill, which performs backfill scheduling, and sched/builtin, which attempts to schedule jobs in a strict priority order within each partition/queue.”

[slurm-users] Is preempt_reorder_count compatible with preempt_strict_order ?

2020-07-13 Thread Marc Odunlami
Hello, I would like to know if "preempt_reorder_count=#" is really used when using "preempt_strict_order". In my understanding: * "preempt_reorder_count=#" sets the number of iterations to reorder the preemption job list based on the job size. Allows to minimize the number of preempted

[slurm-users] CPU allocation for the GPU jobs.

2020-07-13 Thread navin srivastava
Hi Team, We have separate partitions for the GPU nodes and only CPU nodes . scenario: the jobs submitted in our environment is 4CPU+1GPU as well as 4CPU only in nodeGPUsmall and nodeGPUbig. so when all the GPU exhausted and rest other jobs are in queue waiting for the availability of GPU

Re: [slurm-users] squeue reports ReqNodeNotAvail but node is available

2020-07-13 Thread Ole Holm Nielsen
Hi Janna, If you're running an old Slurm version, there may be bugs already resolved in the later versions. You can search for bugs with ReqNodeNotAvail in the title: https://bugs.schedmd.com/buglist.cgi?quicksearch=ReqNodeNotAvail For example, this one might be relevant: