Hi Bibrak,

The message about malloc looks like a MX message. Which interconnects did you 
compile support for?

If you are using MX, does it appear when you run with:

$ mpirun --mca pml cm -np 4 ./exec 98

which uses the MX MTL instead of MX BTL.

Scott

On Jul 18, 2010, at 9:23 AM, Bibrak Qamar wrote:

> Hello,
> 
> I have developed a code which I tested on MPICH2, it working fine.
> 
> But when I compile and run it with OpenMPI, its not working.
> 
> The result of the program with the errors by OpenMPI is below ..
> 
> --------------
> 
> 
> bibrak@barq:~/XXX> mpirun -np 4 ./exec 98
> 
> 
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> Send count -- >> 25 
> Send count -- >> 25 
> Send count -- >> 24 
> Send count -- >> 24 
> Dis -- >> 0 
> Dis -- >> 25 
> Dis -- >> 50 
> Dis -- >> 74 
> 
> 
> 
> 
>  0 d[0] = -14.025975 
>  1 d[0] = -14.025975 
> -- 1 -- 
>  2 d[0] = -14.025975 
> -- 2 -- 
> -- 0 -- 
>  3 d[0] = -14.025975 
>  --3 --
> [barq:27118] *** Process received signal ***
> [barq:27118] Signal: Segmentation fault (11)
> [barq:27118] Signal code: Address not mapped (1)
> [barq:27118] Failing at address: 0x51681f96
> [barq:27121] *** Process received signal ***
> [barq:27121] Signal: Segmentation fault (11)
> [barq:27121] Signal code: Address not mapped (1)
> [barq:27121] Failing at address: 0x77b5685
> [barq:27118] [ 0] [0xffffe410]
> [barq:27118] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d20f3c]
> [barq:27118] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27118] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cc9705]
> [barq:27121] [ 0] [0xffffe410]
> [barq:27121] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d0ef3c]
> [barq:27121] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27121] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cb7705]
> [barq:27121] [ 4] ./exec [0x8048b01]
> [barq:27121] *** End of error message ***
> [barq:27118] [ 4] ./exec [0x8048b01]
> [barq:27118] *** End of error message ***
> --------------------------------------------------------------------------
> mpirun noticed that process rank 3 with PID 27121 on node barq exited on 
> signal 11 (Segmentation fault).
> --------------------------------------------------------------------------
> [barq:27120] *** Process received signal ***
> [barq:27120] Signal: Segmentation fault (11)
> [barq:27120] Signal code: Address not mapped (1)
> [barq:27120] Failing at address: 0x4bd1ca3e
> [barq:27120] [ 0] [0xffffe410]
> [barq:27120] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7c97f3c]
> [barq:27120] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27120] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7c40705]
> [barq:27120] [ 4] ./exec [0x8048b01]
> [barq:27120] *** End of error message ***
> 
> 
> ----------------
> 
> Because of the warning:regcache incompatible with malloc warning I did 
> >  bibrak@barq:~/XXX> export MX_RCACHE=2
> 
> And now ignored the warning, but the error still remains
> 
> I shall appreciate any help.
> 
> Bibrak Qamar
> NUST-SEECS
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


Reply via email to