[slurm-dev] Re: Query number of cores allocated per node for a job

2016-10-26 Thread Kaizaad Bilimorya
not Linux/hardware CPU ids (as reported by, for example, /proc/cpuinfo). http://slurm.schedmd.com/cpu_management.html#Section2 So if you want actually core ids, I guess you will have to query the cgroup hierarchy. hth -k -- Kaizaad Bilimorya Systems Administrator - SHARCNET | http://www.s

[slurm-dev] gres.conf and CPUs

2016-03-08 Thread Kaizaad Bilimorya
Hello, We have slurm 15.08.8 running and seemed to have run into an issue with GPUs and the CPUs that they are suppose to have access to. Our nodes are 2 socket with each socket having 4 GPU devices. We would like to limit which CPUs each GPU device has access to. According to the gres.conf

[slurm-dev] Re: Using another database host

2015-12-04 Thread Kaizaad Bilimorya
What is "ControlMachine" set to? -k On Fri, 4 Dec 2015, Vladislav Falfushinsky wrote: Dear comnunity, I am trying to move slurm database from one host to another. I am not using slurmdbd service. Before slurm controller had been on one host with database. Now when I installed a database on

[slurm-dev] --ntasks 16 --cpus-per-task 8 results in error: _compute_c_b_task_dist oversubscribe for job

2015-09-01 Thread Kaizaad Bilimorya
think the issue here is the "SLURM_TASKS_PER_NODE" which seems to be calculated by the scheduler. It assigns a node that only has 20 cores (mos20) and hence we see the "_compute_c_b_task_dist oversubscribe for job" error. thanks -k -- Kaizaad Bilimorya Systems Administrat