Hi Bibrak,
The message about malloc looks like a MX message. Which interconnects did you
compile support for?
If you are using MX, does it appear when you run with:
$ mpirun --mca pml cm -np 4 ./exec 98
which uses the MX MTL instead of MX BTL.
Scott
On Jul 18, 2010, at 9:23 AM, Bibrak Qamar wrote:
> Hello,
>
> I have developed a code which I tested on MPICH2, it working fine.
>
> But when I compile and run it with OpenMPI, its not working.
>
> The result of the program with the errors by OpenMPI is below ..
>
> --
>
>
> bibrak@barq:~/XXX> mpirun -np 4 ./exec 98
>
>
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> Send count -- >> 25
> Send count -- >> 25
> Send count -- >> 24
> Send count -- >> 24
> Dis -- >> 0
> Dis -- >> 25
> Dis -- >> 50
> Dis -- >> 74
>
>
>
>
> 0 d[0] = -14.025975
> 1 d[0] = -14.025975
> -- 1 --
> 2 d[0] = -14.025975
> -- 2 --
> -- 0 --
> 3 d[0] = -14.025975
> --3 --
> [barq:27118] *** Process received signal ***
> [barq:27118] Signal: Segmentation fault (11)
> [barq:27118] Signal code: Address not mapped (1)
> [barq:27118] Failing at address: 0x51681f96
> [barq:27121] *** Process received signal ***
> [barq:27121] Signal: Segmentation fault (11)
> [barq:27121] Signal code: Address not mapped (1)
> [barq:27121] Failing at address: 0x77b5685
> [barq:27118] [ 0] [0xe410]
> [barq:27118] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d20f3c]
> [barq:27118] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27118] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cc9705]
> [barq:27121] [ 0] [0xe410]
> [barq:27121] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d0ef3c]
> [barq:27121] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27121] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cb7705]
> [barq:27121] [ 4] ./exec [0x8048b01]
> [barq:27121] *** End of error message ***
> [barq:27118] [ 4] ./exec [0x8048b01]
> [barq:27118] *** End of error message ***
> --
> mpirun noticed that process rank 3 with PID 27121 on node barq exited on
> signal 11 (Segmentation fault).
> --
> [barq:27120] *** Process received signal ***
> [barq:27120] Signal: Segmentation fault (11)
> [barq:27120] Signal code: Address not mapped (1)
> [barq:27120] Failing at address: 0x4bd1ca3e
> [barq:27120] [ 0] [0xe410]
> [barq:27120] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7c97f3c]
> [barq:27120] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27120] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7c40705]
> [barq:27120] [ 4] ./exec [0x8048b01]
> [barq:27120] *** End of error message ***
>
>
>
>
> Because of the warning:regcache incompatible with malloc warning I did
> > bibrak@barq:~/XXX> export MX_RCACHE=2
>
> And now ignored the warning, but the error still remains
>
> I shall appreciate any help.
>
> Bibrak Qamar
> NUST-SEECS
> ___
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users