yes, that's exactly the problem. I assumed salloc was used to run an
interactive job but I was wrong. I guess I have to use srun for that.

I am still trying to figure how to do the user guide to explain when
to use salloc and then srun instead of srun in the first place.

Thanks a bunch for your help
Eva

On Fri, 12 Sep 2014, Bruce Roberts wrote:

>
> I think the problem you have here is the salloc you ran doesn't
> automatically send you to a node in your allocation.  I am gussing you
> ran your salloc from hpcdev-005 that is why hostname by itself returns
> that.  If you ran srun hostname inside of your salloc you would get on
> the node in your allocation.
>
> On 09/11/2014 03:44 PM, Eva Hocks wrote:
> >
> >
> > I am trying to configure the latest slurm 14.03 and am running into
> > problem to prevent slurm from running jobs on the control node.
> >
> > sinfo shows 3 nodes configure in the slurm.conf:
> > active       up    2:00:00      1  down* hpc-0-5
> > active       up    2:00:00      1    mix hpc-0-4
> > active       up    2:00:00      1   idle hpc-0-6
> >
> >
> > but when I use salloc I end up on the head node
> >
> >
> > $ salloc -N 1 -p active sh
> > salloc: Granted job allocation 16
> > sh-4.1$ hostname
> > hpcdev-005.sdsc.edu
> >
> >
> > That node is not part of the "active" partition but slurm still uses it.
> > How? The allocation btw is for  NodeList=hpc-0-4
> > and the user can login to that node without a problem but slurm doesn't
> > run the sh on that node for the user.
> >
> > Also how can a user find out what nodes are allocated without having to
> > run the scontrol command? Is there an option in salloc to return the
> > host names?
> >
> > Thanks
> > Eva
> >
>

Reply via email to