Am 20.01.2016 um 11:00 schrieb Benjamin Redling:
Am 19.01.2016 um 20:37 schrieb Andrus, Brian Contractor:
I am testing our slurm to replace our torque/moab setup here.
The issue I have is to try and put all our node names in the NodeName
and PartitionName entries.
In our cluster, we name our nodes compute-<rack>-<row>
That seems to be problem enough with the abilities to use ranges in
slurm, but it is compounded with the fact that the folks put the nodes
in keeping 1u of space in between.
So I have compute-1-[1,3,5,7,9,11...41]
Why not simply use a comma separated list _generated_ from your
inventory / DNS / /etc/hosts / etc. .?
< --- 8< --->
P.S.
Totally forgot, you can configure a NodeName different from its
NodeHostname:
<quote>
Node names can have up to three name specifications: NodeName is the
name used by all Slurm tools when referring to the node, NodeAddr is the
name or IP address Slurm uses to communicate with the node, and
NodeHostname is the name returned by the command /bin/hostname -s. Only
NodeName is required (the others default to the same name), although
supporting all three parameters provides complete control over naming
and addressing the nodes. See the slurm.conf man page for details on all
configuration parameters.
</quote>
But I wouldn't do that. IMHO in case of an erroneous node it is just one
more level of indirection -- cumbersome to find the culprit.
Then again my host names don't depend on rack units.
Regards, Benjamin
--
FSU Jena | JULIELab.de/Staff/Benjamin+Redling.html
vox: +49 3641 9 44323 | fax: +49 3641 9 44321