Le 28/05/2014 15:46, Craig Kapfer a écrit : > Wait, I'm sorry, I must be missing something, please bear with me! > > By the way, your discussion of groups 1 and 2 below is wrong. > Group 2 doesn't say that NUMA node == socket, and it doesn't > report 8 sockets of 8 cores each. It reports 4 sockets containing > 2 NUMA nodes each containing 8 cores each, and that's likely what > you have here (AMD Opteron 6300 or 6200 processors?). > > Output of lstopo from nodes of both BIOS versions seem to indicate > that there are 4 sockets, but slurm is reporting on numa nodes, no? > If not, which version of the BIOS is correct?
Ah right, I misread group1. Group1 reports 4 sockets = 4 numa nodes containing 16 cores each. That's wrong. There are 2 NUMA nodes in each socket, and 8 cores in each NUMA nodes (instead of 1 NUMA node in each socket, and 16 core in each NUMA node). Slurm is indeed saying something wrong. I wonder if it confuses NUMA nodes and sockets, I can't find anything like this in Google. On Intel that doesn't matter. On AMD it does. Anyway G2 is correct, so its BIOS may be less buggy than G1. Which BIOS is more recent? Try updating the BIOS on one G1 machines to see if that fixes the issue. Brice