Re: [hwloc-users] hwloc errors on program startup

2014-01-17 Thread Doug Roberts
The cluster is running centos6.3 (2.6.32-279.5.2) and will be updated to the latest centos6.5 (2.6.32-431.3.1) kernel towards the end of next week. I will reply back to let you know if it worked, thanks very much! -Doug On Fri, 17 Jan 2014, hwloc-users-requ...@open-mpi.org wrote: Hello, Li

Re: [hwloc-users] hwloc errors on program startup

2014-01-17 Thread Brice Goglin
Hello, Linux says socket 0 contains processors 0-7 and socket 1 contains 8-15, while NUMA node 0 contains processors 0-3+8-11 and NUMA node 1 contains processors 4-7+12-15. Given why I read about Opteron 6320 online, the problem is that NUMA 0 should be replaced with two NUMA nodes with processors

[hwloc-users] hwloc errors on program startup

2014-01-17 Thread Doug Roberts
1) We are getting hwloc topology errors when programs startup on some new compute nodes added into our cluster recently ... [roberpj@bro127:~/samples/mpi_test] /opt/sharcnet/openmpi/1.6.5/intel/bin/mpirun -np 2 --mca btl tcp,sm,self --host bro127,bro127 ./a.out ***