[slurm-dev] Re: slurmstepd: Failed task affinity setup

Jason Bacon Fri, 20 May 2016 07:30:31 -0700


Thanks for the link - there's some great info there!

It appears that the issue we saw is due to the fact that/etc/rc.d/rc.local (where HT is disabled) runs *after* slurmd has probedthe hardware. I used a simple hack to test this hypothesis: I added a"service slurm restart" at the end of rc.local. We have not seen anyissues since.


So there appear to be three options:

1) Disable HT as part of the init sequence, but make sure that thishappens before slurmd is started.

2) Disable in the BIOS. On a Dell PowerEdge running CentOS, this canbe done with Dell OpenManage. You don't have to install all 67 RPMs inthe current srvadmin-all package, BTW. You only really needsrvadmin-server-cli:


#!/bin/sh -e

wget -q -O - http://linux.dell.com/repo/hardware/dsu/bootstrap.cgi | bash
yum install -y srvadmin-server-cli
/opt/dell/srvadmin/sbin/srvadmin-services.sh start || true
/opt/dell/srvadmin/sbin/omconfig chassis biossetup \
    attribute=logicalproc setting=disabled

To remove:

/opt/dell/srvadmin/sbin/srvadmin-services.sh stop
yum erase -y srvadmin\*

3) While the noht option in a grub.conf kernel entry has been removedfrom recent Linux systems, one can apparently achieve the same effectusing maxcpus=<number of real cores>. I suspect grub.confcustomizations like this won't survive a Yum kernel update, though, soit would take some additional cleverness to keep grub.conf up-to-dateand avoid unpleasant surprises.

It looks like disabling it in the BIOS is the cleanest solution, sowe'll probably go with that.


Regards,

    Jason

On 05/18/16 11:38, Davide Vanzo wrote:

The thing is that disabling HT via OS or via BIOS may not be the sameas you can see in this thread:
https://software.intel.com/en-us/forums/software-tuning-performance-optimization-platform-monitoring/topic/480007
Moreover, I wouldn't be surprised if hwloc (which SLURM uses foraffinity binding) may be "insensitive" to OS disabled HT. Hoververwhen you disable it via BIOS there will be no ambiguity.
Davide


On Wed, 2016-05-18 at 07:58 -0700, Jason Bacon wrote:
No, opted against that in case we want to experiment with hyperthreading
in the future without having to reboot.

How might that affect SLURM?

Thanks,

      JB

On 05/18/16 09:24, Davide Vanzo wrote:
Jason, have you tried disabling HT from bios instead of doing fromthe OS? Davide On Wed, 2016-05-18 at 06:02 -0700, Jason Bacon wrote:
Just leaving a trail for future Googlers. My colleague did anextensive search for answers and came up empty. We ran into anissue after disabling hyperthreading on one of our CentOS clusters.Here's the scenario: - Our compute nodes had hyperthreading enabledwhile we evaluated the costs and benefits. - SLURM was configuredto schedule only one job per real core. For example, nodes with 24cores / 48 virtual are configured as follows:NodeName=compute-[029-083] RealMemory=64000 Sockets=2CoresPerSocket=12 ThreadsP erCore=1 State=UNKNOWN - I added acommand to /etc/rc.d/rc.local to disable hyperthreading on the nextreboot. - No changes were made to slurm.conf. - After rebootingwith hyperthreading disabled, certain jobs landing on the nodewould fail with the following error: slurmstepd: Failed taskaffinity setup - Restarting the scheduler cleared up the issue.Does anybody know what would cause this? My best hypothesis is thatslurmctld is caching some probed hardware info from slurmd thatchanged when hyperthreading was disabled. Cheers, Jason



--
All wars are civil wars, because all men are brothers ... Each one owes
infinitely more to the human race than to the particular country in
which he was born.
                -- Francois Fenelon

[slurm-dev] Re: slurmstepd: Failed task affinity setup

Reply via email to