Wow, it appears everything aborts when opal_event_loop() is called.
Did you make any changes to the event library code in opal/event/?
If not, that might indicate a mismatch between the binaries and
libraries (ie, binaries from one build vs. libraries from another).
This will cause random segfaults, possibly like this.
If that's no help, can you run ompi_info under gdb and generate a
detailed stack trace?
Thanks,
Brian
On Apr 15, 2007, at 11:40 AM, chaitali dherange wrote:
I have downloaded the developer version of source code by
downloading a
nightly Subversion snapshot tarball.And have installed the openmpi.
Using
./configure --prefix=/net/hc293/chaitali/openmpi_dev
(lots of output... without errors)
make all install.
(lots of output... without errors)
then I have tried to run the example provided in this version of
source code... the ring_c.c file... I first copied it to my home
directory... /net/hc293/chaitali
now when inside my home directory... i did
set path=($path /net.hc293/chaitali/openmpi_dev/bin)
set $LD_LIBRARY_PATH = ( /net/hc293/chaitali/dev_openmpi/lib )
mpicc -o chaitali_test ring_c.c
(This gave no errors at all)
mpirun --prefix /net/hc293/chaitali/openmpi_dev -np 3 --hostfile /
net/hc293/chaitali/machinefile ./test_chaitali
(This gave foll errors..)
[oolong:09783] *** Process received signal ***
[oolong:09783] Signal: Segmentation fault (11)
[oolong:09783] Signal code: (128)
[oolong:09783] Failing at address: (nil)
[oolong:09783] [ 0] /lib64/tls/libpthread.so.0 [0x2a95e01430]
[oolong:09783] [ 1] /net/hc293/chaitali/openmpi_dev/lib/libopen-
pal.so.0(opal_event_init+0x166) [0x2a957d9e16]
[oolong:09783] [ 2] /net/hc293/chaitali/openmpi_dev/lib/libopen-
rte.so.0(orte_init_stage1+0x168) [0x2a95680638]
[oolong:09783] [ 3] /net/hc293/chaitali/openmpi_dev/lib/libopen-
rte.so.0(orte_system_init+0xa) [0x2a9568375a]
[oolong:09783] [ 4] /net/hc293/chaitali/openmpi_dev/lib/libopen-
rte.so.0(orte_init+0x49) [0x2a95680329]
[oolong:09783] [ 5] mpirun(orterun+0x155) [0x4029fd]
[oolong:09783] [ 6] mpirun(main+0x1b) [0x4028a3]
[oolong:09783] [ 7] /lib64/tls/libc.so.6(__libc_start_main+0xdb)
[0x2a95f273fb]
[oolong:09783] [ 8] mpirun [0x4027fa]
[oolong:09783] *** End of error message ***
Segmentation fault
I understand that the [5] and [6] are the actual errors. But dont
understand why? or how to overcome this error?
Please find attached the foll files:
- 'ring_c.c' file which I am trying to run.
- 'config.log' file from the openmpi-1.2.1a0r14362 folder
- 'ompi_info --all.txt' which is the the output of ompi_info --
all... This contains the above mentioned errors.
Thanks and Regards,
Chaitali
<doubt.rar>
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel