Hi Eva,

if you don't want to use the controller node for jobs, the easiest way
is to not configure it as node at all. Meaning you don't need a line like

NodeName=hpc-0-5 RealMemory=....

for the controller.


A program/user can find out which nodes are allocated by looking into
the environment variables. Try running salloc and then

$ env | grep SLURM

Here is an example output:

SLURM_NODELIST=n523601
SLURM_NODE_ALIASES=(null)
SLURM_NNODES=1
SLURM_JOBID=6437
SLURM_TASKS_PER_NODE=40
SLURM_JOB_ID=6437
SLURM_SUBMIT_DIR=/nfs/admins/adm17
SLURM_JOB_NODELIST=n523601
SLURM_JOB_CPUS_PER_NODE=40
SLURM_SUBMIT_HOST=frontend
SLURM_JOB_PARTITION=foo
SLURM_JOB_NUM_NODES=1



Regards,

        Uwe



Am 12.09.2014 um 00:45 schrieb Eva Hocks:
> 
> 
> 
> I am trying to configure the latest slurm 14.03 and am running into
> problem to prevent slurm from running jobs on the control node.
> 
> sinfo shows 3 nodes configure in the slurm.conf:
> active       up    2:00:00      1  down* hpc-0-5
> active       up    2:00:00      1    mix hpc-0-4
> active       up    2:00:00      1   idle hpc-0-6
> 
> 
> but when I use salloc I end up on the head node
> 
> 
> $ salloc -N 1 -p active sh
> salloc: Granted job allocation 16
> sh-4.1$ hostname
> hpcdev-005.sdsc.edu
> 
> 
> That node is not part of the "active" partition but slurm still uses it.
> How? The allocation btw is for  NodeList=hpc-0-4
> and the user can login to that node without a problem but slurm doesn't
> run the sh on that node for the user.
> 
> Also how can a user find out what nodes are allocated without having to
> run the scontrol command? Is there an option in salloc to return the
> host names?
> 
> Thanks
> Eva
> 

Reply via email to