Hi,

I updated openmpi from version 1.5.4 to 1.5.5.
Then, an execution speed of my application becomes quite slower than
before,
due to wrong core bindings. As far as I checked, it seems that
openmpi-1.5.4
gives correct core bindings for my magnycore based machine.

1) my script is as follows:
export OMP_NUM_THREADS=4
mpirun -machinefile pbs_hosts \
       -np 8 \
       -x OMP_NUM_THREADS \
       -bind-to-core \
       -cpus-per-proc ${OMP_NUM_THREADS} \
       -report-bindings \
       ./Solver

2)binding reports are as follows:
openmpi-1.5.4:
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],3] to cpus 22220000
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],4] to cpus 4444
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],5] to cpus 44440000
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],6] to cpus 8888
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],7] to cpus 88880000
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],0] to cpus 1111
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],1] to cpus 11110000
[node03.cluster:21706] [[55518,0],0] odls:default:fork binding child
[[55518,1],2] to cpus 2222
openmpi-1.5.5:
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],3] to cpus f000
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],4] to cpus 000f0000
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],5] to cpus 00f00000
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],6] to cpus 0f000000
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],7] to cpus f0000000
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],0] to cpus 000f
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],1] to cpus 00f0
[node03.cluster:04706] [[40566,0],0] odls:default:fork binding child
[[40566,1],2] to cpus 0f00

3)node03 has 32 cores with 4 magnycores(8cores/cpu-type).

Regards,
Tetsuya Mishima

Reply via email to