Hi Bibrak, The message about malloc looks like a MX message. Which interconnects did you compile support for?
If you are using MX, does it appear when you run with: $ mpirun --mca pml cm -np 4 ./exec 98 which uses the MX MTL instead of MX BTL. Scott On Jul 18, 2010, at 9:23 AM, Bibrak Qamar wrote: > Hello, > > I have developed a code which I tested on MPICH2, it working fine. > > But when I compile and run it with OpenMPI, its not working. > > The result of the program with the errors by OpenMPI is below .. > > -------------- > > > bibrak@barq:~/XXX> mpirun -np 4 ./exec 98 > > > warning:regcache incompatible with malloc > warning:regcache incompatible with malloc > warning:regcache incompatible with malloc > warning:regcache incompatible with malloc > Send count -- >> 25 > Send count -- >> 25 > Send count -- >> 24 > Send count -- >> 24 > Dis -- >> 0 > Dis -- >> 25 > Dis -- >> 50 > Dis -- >> 74 > > > > > 0 d[0] = -14.025975 > 1 d[0] = -14.025975 > -- 1 -- > 2 d[0] = -14.025975 > -- 2 -- > -- 0 -- > 3 d[0] = -14.025975 > --3 -- > [barq:27118] *** Process received signal *** > [barq:27118] Signal: Segmentation fault (11) > [barq:27118] Signal code: Address not mapped (1) > [barq:27118] Failing at address: 0x51681f96 > [barq:27121] *** Process received signal *** > [barq:27121] Signal: Segmentation fault (11) > [barq:27121] Signal code: Address not mapped (1) > [barq:27121] Failing at address: 0x77b5685 > [barq:27118] [ 0] [0xffffe410] > [barq:27118] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d20f3c] > [barq:27118] [ 2] ./exec(main+0x2214) [0x804ad8d] > [barq:27118] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cc9705] > [barq:27121] [ 0] [0xffffe410] > [barq:27121] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d0ef3c] > [barq:27121] [ 2] ./exec(main+0x2214) [0x804ad8d] > [barq:27121] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cb7705] > [barq:27121] [ 4] ./exec [0x8048b01] > [barq:27121] *** End of error message *** > [barq:27118] [ 4] ./exec [0x8048b01] > [barq:27118] *** End of error message *** > -------------------------------------------------------------------------- > mpirun noticed that process rank 3 with PID 27121 on node barq exited on > signal 11 (Segmentation fault). > -------------------------------------------------------------------------- > [barq:27120] *** Process received signal *** > [barq:27120] Signal: Segmentation fault (11) > [barq:27120] Signal code: Address not mapped (1) > [barq:27120] Failing at address: 0x4bd1ca3e > [barq:27120] [ 0] [0xffffe410] > [barq:27120] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7c97f3c] > [barq:27120] [ 2] ./exec(main+0x2214) [0x804ad8d] > [barq:27120] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7c40705] > [barq:27120] [ 4] ./exec [0x8048b01] > [barq:27120] *** End of error message *** > > > ---------------- > > Because of the warning:regcache incompatible with malloc warning I did > > bibrak@barq:~/XXX> export MX_RCACHE=2 > > And now ignored the warning, but the error still remains > > I shall appreciate any help. > > Bibrak Qamar > NUST-SEECS > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users