Hi!

We have a problem with task/affinity and srun --exclusive

If I submit a job with sbatch that runs srun --exclusive it looks like from the output of hwloc-bind --get that cores are allocated (and binded) to cores before task/affinity gets a chance of distribute them according to the cpu_bind.

In the example below I use 'sbatch --exclusive' and gets 48 cores in total.

srun -n1 -c6 --cpu_bind=rank_ldom sh -c "hwloc-bind --get | ./hex2bin"
results in:
000000 000000 000000 000000 000000 000000 000000 111111  = 0x0000003f

srun -n1 -c6 --exclusive --cpu_bind=rank_ldom sh -c "hwloc-bind --get | ./hex2bin"
results in:
000000 000000 000001 000001 000001 000001 000001 000001  = 0x41041041

This is also looks like the bitmask that task/affinity gets from slurm.

Best regards,
Magnus

--
Magnus Jonsson, Developer, HPC2N, UmeƄ Universitet


Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to