Hello
So processor sets are not taken into account when Solaris reports
topology information in kstat etc.
Do you know if hwloc can query processor sets from the C interface?
If so, we could apply the processor set mask to hwloc object cpusets
during discovery to avoid your error.
Brice




Le 05/01/2016 10:18, Karl Behler a écrit :
> There was a processor set defined (command psrset) on this machine.
> Having removed the psrset hwloc-info produces a result without error
> messages:
>
> hwloc-info -v
> depth 0:        1 Machine (type #1)
>  depth 1:       2 NUMANode (type #2)
>   depth 2:      2 Package (type #3)
>    depth 3:     12 Core (type #5)
>     depth 4:    24 PU (type #6)
>
> It seems the concept of defining a psrset is in contradiction to what
> hwloc and/or openmpi expects/allows.
>
>
> On 04.01.16 18:16, Karl Behler wrote:
>> We used to run our MPI application with the SUNWhpc implementation
>> from Sun/Oracle. (This was derived from openmpi 1.5.)
>> However, the Oracle HPC implementation fails for the new Solaris 11.3
>> platform.
>> So we downloaded and made openmpi 1.10.1 on this platform from scratch.
>>
>> All seems fine and a simple test application runs fine.
>> However, with the real application we are running into a hwloc problem.
>>
>> So we also downloaded and made the hwloc package 1.11.2.
>>
>> Now examining hardware locality we get the following error:
>>
>> hwloc-info -v --whole-io
>> ****************************************************************************
>>
>> * hwloc 1.11.2 has encountered what looks like an error from the
>> operating system.
>> *
>> * Core (P#0 cpuset 0x00001001) intersects with NUMANode (P#1 cpuset
>> 0x0003c001) without inclusion!
>> * Error occurred in topology.c line 1046
>> *
>> * The following FAQ entry in the hwloc documentation may help:
>> *   What should I do when hwloc reports "operating system" warnings?
>> * Otherwise please report this error message to the hwloc user's
>> mailing list,
>> * along with any relevant topology information from your platform.
>> ****************************************************************************
>>
>> depth 0:        1 Machine (type #1)
>>  depth 1:       2 Package (type #3)
>>   depth 2:      2 NUMANode (type #2)
>>    depth 3:     1 Core (type #5)
>>     depth 4:    24 PU (type #6)
>>
>> Since I could not find the mentioned FAQ topic I'm asking the list
>> for advice.
>>
>> Our system is an Oracle/ Solaris 11.3 (latest patch level) on an
>> Intel hardware platform from Sun.
>>
>> output of uname -a -> SunOS sxaug28 5.11 11.3 i86pc i386 i86pc
>> output of psrinfo -v ->
>>
>> Status of virtual processor 0 as of: 01/04/2016 17:10:17
>>   on-line since 01/04/2016 14:44:28.
>>   The i386 processor operates at 1600 MHz,
>>         and has an i387 compatible floating point processor.
>> Status of virtual processor 1 as of: 01/04/2016 17:10:17
>>   on-line since 01/04/2016 14:45:10.
>>   The i386 processor operates at 1600 MHz,
>>         and has an i387 compatible floating point processor.
>> .
>> . (similar lines removed)
>> .
>> Status of virtual processor 23 as of: 01/04/2016 17:10:17
>>   on-line since 01/04/2016 14:45:11.
>>   The i386 processor operates at 1600 MHz,
>>         and has an i387 compatible floating point processor.
>>
>> Following comes the script which was used to make hwloc: (used
>> compiler: Sunstudio 12.4, see config.log as bz2 attachment)
>>
>> setenv CFLAGS "-m64 -xtarget=generic -xarch=sse2 -xprefetch
>> -xprefetch_level=2 -xvector=simd -xdepend=yes -xbuiltin=%all -xO5"
>> setenv CXXFLAGS "$CFLAGS"
>> setenv FCFLAGS "-m64 -xtarget=generic -xarch=sse2 -xprefetch
>> -xprefetch_level=2 -xvector=simd -stackvar -xO5"
>> setenv FFLAGS "$FCFLAGS"
>> setenv PREFIX /usr/openmpi/hwloc-1.11.2
>> ./configure --prefix=$PREFIX --disable-debug
>> dmake -j 12
>> # as root: make install
>> #        : cp -p config.status $PREFIX/config.status
>>
>> Any advice much appreciated.
>>
>> Karl
>>
>>
>> _______________________________________________
>> hwloc-users mailing list
>> hwloc-us...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/hwloc-users
>> Searchable archives: 
>> http://www.open-mpi.org/community/lists/hwloc-users/2016/01/1236.php
>
>
> -- 
> Dr. Karl Behler       
> CODAC & IT services ASDEX Upgrade
> phon +49 89 3299-1351 fax 3299-961351
>
>
>
> _______________________________________________
> hwloc-users mailing list
> hwloc-us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/hwloc-users
> Link to this post: 
> http://www.open-mpi.org/community/lists/hwloc-users/2016/01/1236.php

Reply via email to