Alex,
+1 vote for core. It is good starting point.

* If you can't (from some reason) generate the core file, you may drop while 
(1) somewhere in the init code and attach the gdb later.
* If you are looking for more user-friendly experience, you may try Allinea DDT 
(they have 30day trial version).

Regards,
Pasha.

> Another thing to try is to load up the core file in gdb and see if that gives 
> you a valid stack trace of where exactly the segv occurred.
>
>
> On Apr 25, 2012, at 9:30 AM, Alex Margolin wrote:
>
>> On 04/25/2012 02:57 PM, Ralph Castain wrote:
>>> Strange that your code didn't generate any symbols - is that a mosix thing? 
>>> Have you tried just adding opal_output (so it goes to a special diagnostic 
>>> output channel) statements in your code to see where the segfault is 
>>> occurring?
>>>
>>> It looks like you are getting thru orte_init. You could add -mca 
>>> grpcomm_base_verbose 5 to see if you are getting in/thru the modex - if so, 
>>> then you are probably failing in add_procs.
>>>
>> I guess the symbols are a mosix thing, but it should still show some sort of 
>> segmentation fault trace, no? maybe only the assembly opcode... It seems 
>> that the SEGV is detected, rather then caught. This may also be related to 
>> mosix - I'll check it with the mosix developer.
>>
>> I added the parameter you suggested and appended the output. Modex seems to 
>> be working because I use it to exchange the IP and PID, and as you can see 
>> at the bottom these are received OK. I'll try debug printouts specifically 
>> in add_procs. Thanks for the advice!
>>
>> alex@singularity:~/huji/benchmarks/mpi/npb$ mpirun -mca grpcomm_base_verbose 
>> 5 -mca btl self,mosix -mca btl_base_verbose 100 -n 4 ft.S.4
>> [singularity:08915] mca:base:select:(grpcomm) Querying component [bad]
>> [singularity:08915] mca:base:select:(grpcomm) Query of component [bad] set 
>> priority to 10
>> [singularity:08915] mca:base:select:(grpcomm) Selected component [bad]
>> [singularity:08915] [[37778,0],0] grpcomm:base:receive start comm
>> [singularity:08915] [[37778,0],0] grpcomm:bad:xcast sent to job [37778,0] 
>> tag 1
>> [singularity:08915] [[37778,0],0] grpcomm:xcast:recv:send_relay
>> [singularity:08915] [[37778,0],0] grpcomm:base:xcast updating nidmap
>> [singularity:08915] [[37778,0],0] orte:daemon:send_relay - recipient list is 
>> empty!
>> [singularity:08916] mca:base:select:(grpcomm) Querying component [bad]
>> [singularity:08916] mca:base:select:(grpcomm) Query of component [bad] set 
>> priority to 10
>> [singularity:08916] mca:base:select:(grpcomm) Selected component [bad]
>> [singularity:08916] [[37778,1],0] grpcomm:base:receive start comm
>> [singularity:08919] mca:base:select:(grpcomm) Querying component [bad]
>> [singularity:08919] mca:base:select:(grpcomm) Query of component [bad] set 
>> priority to 10
>> [singularity:08919] mca:base:select:(grpcomm) Selected component [bad]
>> [singularity:08919] [[37778,1],2] grpcomm:base:receive start comm
>> [singularity:08917] mca:base:select:(grpcomm) Querying component [bad]
>> [singularity:08917] mca:base:select:(grpcomm) Query of component [bad] set 
>> priority to 10
>> [singularity:08917] mca:base:select:(grpcomm) Selected component [bad]
>> [singularity:08917] [[37778,1],1] grpcomm:base:receive start comm
>> [singularity:08921] mca:base:select:(grpcomm) Querying component [bad]
>> [singularity:08921] mca:base:select:(grpcomm) Query of component [bad] set 
>> priority to 10
>> [singularity:08921] mca:base:select:(grpcomm) Selected component [bad]
>> [singularity:08921] [[37778,1],3] grpcomm:base:receive start comm
>> [singularity:08916] [[37778,1],0] grpcomm:set_proc_attr: setting attribute 
>> MPI_THREAD_LEVEL data size 1
>> [singularity:08916] [[37778,1],0] grpcomm:set_proc_attr: setting attribute 
>> OMPI_ARCH data size 11
>> [singularity:08919] [[37778,1],2] grpcomm:set_proc_attr: setting attribute 
>> MPI_THREAD_LEVEL data size 1
>> [singularity:08919] [[37778,1],2] grpcomm:set_proc_attr: setting attribute 
>> OMPI_ARCH data size 11
>> [singularity:08917] [[37778,1],1] grpcomm:set_proc_attr: setting attribute 
>> MPI_THREAD_LEVEL data size 1
>> [singularity:08917] [[37778,1],1] grpcomm:set_proc_attr: setting attribute 
>> OMPI_ARCH data size 11
>> [singularity:08921] [[37778,1],3] grpcomm:set_proc_attr: setting attribute 
>> MPI_THREAD_LEVEL data size 1
>> [singularity:08921] [[37778,1],3] grpcomm:set_proc_attr: setting attribute 
>> OMPI_ARCH data size 11
>> [singularity:08916] mca: base: components_open: Looking for btl components
>> [singularity:08916] mca: base: components_open: opening btl components
>> [singularity:08916] mca: base: components_open: found loaded component mosix
>> [singularity:08916] mca: base: components_open: component mosix register 
>> function successful
>> [singularity:08916] mca: base: components_open: component mosix open 
>> function successful
>> [singularity:08916] mca: base: components_open: found loaded component self
>> [singularity:08916] mca: base: components_open: component self has no 
>> register function
>> [singularity:08916] mca: base: components_open: component self open function 
>> successful
>> [singularity:08919] mca: base: components_open: Looking for btl components
>> [singularity:08917] mca: base: components_open: Looking for btl components
>> [singularity:08919] mca: base: components_open: opening btl components
>> [singularity:08919] mca: base: components_open: found loaded component mosix
>> [singularity:08919] mca: base: components_open: component mosix register 
>> function successful
>> [singularity:08919] mca: base: components_open: component mosix open 
>> function successful
>> [singularity:08919] mca: base: components_open: found loaded component self
>> [singularity:08919] mca: base: components_open: component self has no 
>> register function
>> [singularity:08919] mca: base: components_open: component self open function 
>> successful
>> [singularity:08921] mca: base: components_open: Looking for btl components
>> [singularity:08917] mca: base: components_open: opening btl components
>> [singularity:08917] mca: base: components_open: found loaded component mosix
>> [singularity:08917] mca: base: components_open: component mosix register 
>> function successful
>> [singularity:08917] mca: base: components_open: component mosix open 
>> function successful
>> [singularity:08917] mca: base: components_open: found loaded component self
>> [singularity:08917] mca: base: components_open: component self has no 
>> register function
>> [singularity:08917] mca: base: components_open: component self open function 
>> successful
>> [singularity:08921] mca: base: components_open: opening btl components
>> [singularity:08921] mca: base: components_open: found loaded component mosix
>> [singularity:08921] mca: base: components_open: component mosix register 
>> function successful
>> [singularity:08921] mca: base: components_open: component mosix open 
>> function successful
>> [singularity:08921] mca: base: components_open: found loaded component self
>> [singularity:08921] mca: base: components_open: component self has no 
>> register function
>> [singularity:08921] mca: base: components_open: component self open function 
>> successful
>> [singularity:08916] select: initializing btl component mosix
>> [singularity:08916] [[37778,1],0] grpcomm:set_proc_attr: setting attribute 
>> btl.mosix.1.7 data size 20
>> [singularity:08919] select: initializing btl component mosix
>> [singularity:08916] select: init of component mosix returned success
>> [singularity:08916] select: initializing btl component self
>> [singularity:08916] select: init of component self returned success
>> [singularity:08916] [[37778,1],0] grpcomm:base:modex: performing modex
>> [singularity:08916] [[37778,1],0] grpcomm:base:pack_modex: reporting 3 
>> entries
>> [singularity:08916] [[37778,1],0] grpcomm:base:full:modex: executing 
>> allgather
>> [singularity:08916] [[37778,1],0] grpcomm:bad entering allgather
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],0]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] ADDING [[37778,1],WILDCARD] TO PARTICIPANTS
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 0
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08916] [[37778,1],0] grpcomm:bad allgather underway
>> [singularity:08916] [[37778,1],0] grpcomm:base:modex: modex posted
>> [singularity:08919] [[37778,1],2] grpcomm:set_proc_attr: setting attribute 
>> btl.mosix.1.7 data size 20
>> [singularity:08917] select: initializing btl component mosix
>> [singularity:08917] [[37778,1],1] grpcomm:set_proc_attr: setting attribute 
>> btl.mosix.1.7 data size 20
>> [singularity:08921] select: initializing btl component mosix
>> [singularity:08921] [[37778,1],3] grpcomm:set_proc_attr: setting attribute 
>> btl.mosix.1.7 data size 20
>> [singularity:08919] select: init of component mosix returned success
>> [singularity:08919] select: initializing btl component self
>> [singularity:08919] select: init of component self returned success
>> [singularity:08919] [[37778,1],2] grpcomm:base:modex: performing modex
>> [singularity:08919] [[37778,1],2] grpcomm:base:pack_modex: reporting 3 
>> entries
>> [singularity:08919] [[37778,1],2] grpcomm:base:full:modex: executing 
>> allgather
>> [singularity:08919] [[37778,1],2] grpcomm:bad entering allgather
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],2]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 0
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08919] [[37778,1],2] grpcomm:bad allgather underway
>> [singularity:08919] [[37778,1],2] grpcomm:base:modex: modex posted
>> [singularity:08917] select: init of component mosix returned success
>> [singularity:08917] select: initializing btl component self
>> [singularity:08917] select: init of component self returned success
>> [singularity:08917] [[37778,1],1] grpcomm:base:modex: performing modex
>> [singularity:08917] [[37778,1],1] grpcomm:base:pack_modex: reporting 3 
>> entries
>> [singularity:08917] [[37778,1],1] grpcomm:base:full:modex: executing 
>> allgather
>> [singularity:08917] [[37778,1],1] grpcomm:bad entering allgather
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],1]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 0
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08917] [[37778,1],1] grpcomm:bad allgather underway
>> [singularity:08917] [[37778,1],1] grpcomm:base:modex: modex posted
>> [singularity:08921] select: init of component mosix returned success
>> [singularity:08921] select: initializing btl component self
>> [singularity:08921] select: init of component self returned success
>> [singularity:08921] [[37778,1],3] grpcomm:base:modex: performing modex
>> [singularity:08921] [[37778,1],3] grpcomm:base:pack_modex: reporting 3 
>> entries
>> [singularity:08921] [[37778,1],3] grpcomm:base:full:modex: executing 
>> allgather
>> [singularity:08921] [[37778,1],3] grpcomm:bad entering allgather
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],3]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 0
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08915] [[37778,0],0] COLLECTIVE 0 LOCALLY COMPLETE - SENDING TO 
>> GLOBAL COLLECTIVE
>> [singularity:08915] [[37778,0],0] grpcomm:base:daemon_coll: daemon 
>> collective recvd from [[37778,0],0]
>> [singularity:08915] [[37778,0],0] grpcomm:base:daemon_coll: WORKING 
>> COLLECTIVE 0
>> [singularity:08915] [[37778,0],0] grpcomm:base:daemon_coll: NUM CONTRIBS: 4
>> [singularity:08915] [[37778,0],0] grpcomm:bad:xcast sent to job [37778,1] 
>> tag 30
>> [singularity:08915] [[37778,0],0] grpcomm:xcast:recv:send_relay
>> [singularity:08915] [[37778,0],0] orte:daemon:send_relay - recipient list is 
>> empty!
>> [singularity:08921] [[37778,1],3] grpcomm:bad allgather underway
>> [singularity:08921] [[37778,1],3] grpcomm:base:modex: modex posted
>> [singularity:08921] [[37778,1],3] grpcomm:base:receive processing collective 
>> return for id 0
>> [singularity:08921] [[37778,1],3] CHECKING COLL id 0
>> [singularity:08921] [[37778,1],3] STORING MODEX DATA
>> [singularity:08921] [[37778,1],3] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:base:receive processing collective 
>> return for id 0
>> [singularity:08916] [[37778,1],0] grpcomm:base:receive processing collective 
>> return for id 0
>> [singularity:08916] [[37778,1],0] CHECKING COLL id 0
>> [singularity:08917] [[37778,1],1] CHECKING COLL id 0
>> [singularity:08916] [[37778,1],0] STORING MODEX DATA
>> [singularity:08917] [[37778,1],1] STORING MODEX DATA
>> [singularity:08921] [[37778,1],3] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],1]
>> [singularity:08917] [[37778,1],1] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],1]
>> [singularity:08917] [[37778,1],1] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],3]
>> [singularity:08921] [[37778,1],3] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],3]
>> [singularity:08921] [[37778,1],3] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],0]
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],3]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] ADDING [[37778,1],WILDCARD] TO PARTICIPANTS
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 1
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],1]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],1]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],2]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],2]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08921] [[37778,1],3] grpcomm:bad entering barrier
>> [singularity:08921] [[37778,1],3] grpcomm:bad barrier underway
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:bad entering barrier
>> [singularity:08917] [[37778,1],1] grpcomm:bad entering barrier
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],0]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 1
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],1]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 1
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08917] [[37778,1],1] grpcomm:bad barrier underway
>> [singularity:08916] [[37778,1],0] grpcomm:bad barrier underway
>> [singularity:08919] [[37778,1],2] grpcomm:base:receive processing collective 
>> return for id 0
>> [singularity:08919] [[37778,1],2] CHECKING COLL id 0
>> [singularity:08919] [[37778,1],2] STORING MODEX DATA
>> [singularity:08919] [[37778,1],2] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],2]
>> [singularity:08919] [[37778,1],2] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],2]
>> [singularity:08919] [[37778,1],2] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:base:store_modex adding modex 
>> entry for proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:base:update_modex_entries: adding 
>> 3 entries for proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> OMPI_ARCH on proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 11 bytes for 
>> attr OMPI_ARCH on proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],2]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 20 bytes for 
>> attr btl.mosix.1.7 on proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:bad entering barrier
>> [singularity:08915] [[37778,0],0] COLLECTIVE RECVD FROM [[37778,1],2]
>> [singularity:08915] [[37778,0],0] WORKING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] PROGRESSING COLL id 1
>> [singularity:08915] [[37778,0],0] ALL LOCAL PROCS CONTRIBUTE 4
>> [singularity:08915] [[37778,0],0] COLLECTIVE 1 LOCALLY COMPLETE - SENDING TO 
>> GLOBAL COLLECTIVE
>> [singularity:08915] [[37778,0],0] grpcomm:base:daemon_coll: daemon 
>> collective recvd from [[37778,0],0]
>> [singularity:08915] [[37778,0],0] grpcomm:base:daemon_coll: WORKING 
>> COLLECTIVE 1
>> [singularity:08915] [[37778,0],0] grpcomm:base:daemon_coll: NUM CONTRIBS: 4
>> [singularity:08915] [[37778,0],0] grpcomm:bad:xcast sent to job [37778,1] 
>> tag 30
>> [singularity:08915] [[37778,0],0] grpcomm:xcast:recv:send_relay
>> [singularity:08915] [[37778,0],0] orte:daemon:send_relay - recipient list is 
>> empty!
>> [singularity:08919] [[37778,1],2] grpcomm:bad barrier underway
>> [singularity:08916] [[37778,1],0] grpcomm:base:receive processing collective 
>> return for id 1
>> [singularity:08916] [[37778,1],0] CHECKING COLL id 1
>> [singularity:08917] [[37778,1],1] grpcomm:base:receive processing collective 
>> return for id 1
>> [singularity:08921] [[37778,1],3] grpcomm:base:receive processing collective 
>> return for id 1
>> [singularity:08921] [[37778,1],3] CHECKING COLL id 1
>> [singularity:08917] [[37778,1],1] CHECKING COLL id 1
>> [singularity:08919] [[37778,1],2] grpcomm:base:receive processing collective 
>> return for id 1
>> [singularity:08919] [[37778,1],2] CHECKING COLL id 1
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08919] [[37778,1],2] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08921] [[37778,1],3] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08917] [[37778,1],1] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],0]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],1]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],2]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: searching for attr 
>> MPI_THREAD_LEVEL on proc [[37778,1],3]
>> [singularity:08916] [[37778,1],0] grpcomm:get_proc_attr: found 1 bytes for 
>> attr MPI_THREAD_LEVEL on proc [[37778,1],3]
>>
>>
>> NAS Parallel Benchmarks 3.3 -- FT Benchmark
>>
>> No input file inputft.data. Using compiled defaults
>> Size                :   64x  64x  64
>> Iterations          :              6
>> Number of processes :              4
>> Processor array     :         1x   4
>> Layout type         :             1D
>> [singularity:08916] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8917
>> [singularity:08917] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8921
>> [singularity:08916] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8919
>> [singularity:08919] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8921
>> [singularity:08921] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8919
>> [singularity:08917] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8916
>> [singularity:08921] btl: mosix: Establishind TCP link to address 127.0.0.1 
>> and PID #8917
>> [singularity:08915] [[37778,0],0] grpcomm:bad:xcast sent to job [37778,0] 
>> tag 1
>> [singularity:08915] [[37778,0],0] grpcomm:xcast:recv:send_relay
>> [singularity:08915] [[37778,0],0] orte:daemon:send_relay - recipient list is 
>> empty!
>> --------------------------------------------------------------------------
>> mpirun noticed that process rank 2 with PID 8919 on node singularity exited 
>> on signal 11 (Segmentation fault).
>> --------------------------------------------------------------------------
>> [singularity:08915] [[37778,0],0] grpcomm:bad:xcast sent to job [37778,0] 
>> tag 1
>> [singularity:08915] [[37778,0],0] grpcomm:xcast:recv:send_relay
>> [singularity:08915] [[37778,0],0] orte:daemon:send_relay - recipient list is 
>> empty!
>> alex@singularity:~/huji/benchmarks/mpi/npb$
>>
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to: 
> http://www.cisco.com/web/about/doing_business/legal/cri/
>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel


Reply via email to