You have found what we found (also in other areas of OpenMPI) – that Slurm
has some “interesting” behaviors.

 

If it was easy, anyone could do it …

 

Ken

==========================

Kenneth A. Lloyd, Jr.

CEO - Director, Systems Science

Watt Systems Technologies Inc.

 

 

From: hwloc-users [mailto:hwloc-users-boun...@open-mpi.org] On Behalf Of
Brice Goglin
Sent: Wednesday, May 28, 2014 7:01 AM
To: Craig Kapfer; Hardware locality user list
Subject: Re: [hwloc-users] node configuration differs form hardware

 

Le 28/05/2014 14:57, Craig Kapfer a écrit :

 


Hmm ... the slurm config defines that all nodes have 4 sockets with 16 cores
per socket (which corresponds to the hardware--all nodes are the same).
Slurm node config is as follows:

 

NodeName=n[001-008] RealMemory=258452 Sockets=4 CoresPerSocket=16
ThreadsPerCore=1 State=UNKNOWN Port=[17001-17008]

 

But we get this error--so I suspect it's a parsing error on the slurm side?


No, it's slurm properly reading info from hwloc, but that info doesn't match
the actual hardware because the BIOS is buggy.

Brice

Reply via email to