Wow, it appears everything aborts when opal_event_loop() is called. Did you make any changes to the event library code in opal/event/? If not, that might indicate a mismatch between the binaries and libraries (ie, binaries from one build vs. libraries from another). This will cause random segfaults, possibly like this.

If that's no help, can you run ompi_info under gdb and generate a detailed stack trace?

Thanks,

Brian

On Apr 15, 2007, at 11:40 AM, chaitali dherange wrote:

I have downloaded the developer version of source code by downloading a
nightly Subversion snapshot tarball.And have installed the openmpi.
Using

./configure --prefix=/net/hc293/chaitali/openmpi_dev
(lots of output... without errors)
make all install.
(lots of output... without errors)

then I have tried to run the example provided in this version of source code... the ring_c.c file... I first copied it to my home directory... /net/hc293/chaitali
now when inside my home directory... i did

set path=($path /net.hc293/chaitali/openmpi_dev/bin)
set $LD_LIBRARY_PATH = ( /net/hc293/chaitali/dev_openmpi/lib )
mpicc -o chaitali_test ring_c.c
(This gave no errors at all)
mpirun --prefix /net/hc293/chaitali/openmpi_dev -np 3 --hostfile / net/hc293/chaitali/machinefile ./test_chaitali
(This gave foll errors..)
[oolong:09783] *** Process received signal ***
[oolong:09783] Signal: Segmentation fault (11)
[oolong:09783] Signal code:  (128)
[oolong:09783] Failing at address: (nil)
[oolong:09783] [ 0] /lib64/tls/libpthread.so.0 [0x2a95e01430]
[oolong:09783] [ 1] /net/hc293/chaitali/openmpi_dev/lib/libopen- pal.so.0(opal_event_init+0x166) [0x2a957d9e16] [oolong:09783] [ 2] /net/hc293/chaitali/openmpi_dev/lib/libopen- rte.so.0(orte_init_stage1+0x168) [0x2a95680638] [oolong:09783] [ 3] /net/hc293/chaitali/openmpi_dev/lib/libopen- rte.so.0(orte_system_init+0xa) [0x2a9568375a] [oolong:09783] [ 4] /net/hc293/chaitali/openmpi_dev/lib/libopen- rte.so.0(orte_init+0x49) [0x2a95680329]
[oolong:09783] [ 5] mpirun(orterun+0x155) [0x4029fd]
[oolong:09783] [ 6] mpirun(main+0x1b) [0x4028a3]
[oolong:09783] [ 7] /lib64/tls/libc.so.6(__libc_start_main+0xdb) [0x2a95f273fb]
[oolong:09783] [ 8] mpirun [0x4027fa]
[oolong:09783] *** End of error message ***
Segmentation fault

I understand that the [5] and [6] are the actual errors. But dont understand why? or how to overcome this error?

Please find attached the foll files:
- 'ring_c.c' file which I am trying to run.
- 'config.log' file from the openmpi-1.2.1a0r14362 folder
- 'ompi_info --all.txt' which is the the output of ompi_info -- all... This contains the above mentioned errors.

Thanks and Regards,
Chaitali
<doubt.rar>
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to