Hi,

I've seen this issue for a payload node in another post which was attributed to 
a configuration error which was resolved by a reboot (?).
I have rebooted my payload node, just in case, but to no effect.

The logs in /var/log/messages when issuing the "systemctl start 
opensafd.service" command:

Oct  6 09:38:35 linux-9qkx opensafd: Starting OpenSAF Services
Oct  6 09:38:35 linux-9qkx osafdtmd[2987]: Started
Oct  6 09:38:35 linux-9qkx osafimmnd[2999]: Started
Oct  6 09:40:05 linux-9qkx systemd[1]: opensafd.service operation timed out. 
Terminating.
Oct  6 09:40:05 linux-9qkx osafimmnd[2999]: MDTM:socket_recv() = 0, conn lost 
with dh server, exiting library err :Success
Oct  6 09:40:05 linux-9qkx systemd[1]: Unit opensafd.service entered failed 
state.

I had enabled the tracing in immnd.conf which caused these in 
/var/log/opensaf/osafimmnd:

Oct  6  9:38:35.142143 osafimmnd [2999:immnd_main.c:0113] >> immnd_initialize
Oct  6  9:38:35.142188 osafimmnd [2999:osaf_secutil.c:0193] >> 
osaf_auth_server_create
Oct  6  9:38:35.142260 osafimmnd [2999:osaf_secutil.c:0215] << 
osaf_auth_server_create
Oct  6  9:38:35.142270 osafimmnd [2999:ncs_main_pub.c:0223] TR
NCS:PROCESS_ID=2999
Oct  6  9:38:35.142273 osafimmnd [2999:sysf_def.c:0090] TR INITIALIZING LEAP 
ENVIRONMENT
Oct  6  9:38:35.142962 osafimmnd [2999:sysf_def.c:0123] TR DONE INITIALIZING 
LEAP ENVIRONMENT
Oct  6  9:38:35.143088 osafimmnd [2999:ncs_main_pub.c:0755] TR 
NCS:NODE_ID=0x0002030F
Oct  6  9:38:35.143309 osafimmnd [2999:mbcsv_dl_api.c:0059] >> mbcsv_lib_req
Oct  6  9:38:35.143318 osafimmnd [2999:mbcsv_dl_api.c:0096] >> mbcsv_lib_init
Oct  6  9:38:35.143322 osafimmnd [2999:mbcsv_mbx.c:0166] >> 
mbcsv_initialize_mbx_list
Oct  6  9:38:35.143324 osafimmnd [2999:mbcsv_mbx.c:0180] << 
mbcsv_initialize_mbx_list
Oct  6  9:38:35.143328 osafimmnd [2999:mbcsv_pwe_anc.c:0162] >> 
mbcsv_initialize_peer_list
Oct  6  9:38:35.143331 osafimmnd [2999:mbcsv_pwe_anc.c:0176] << 
mbcsv_initialize_peer_list
Oct  6  9:38:35.143332 osafimmnd [2999:mbcsv_dl_api.c:0075] << mbcsv_lib_req
Oct  6  9:38:35.143334 osafimmnd [2999:ncs_main_pub.c:0393] TR
MBCSV:MBCA:ON
Oct  6  9:38:35.143342 osafimmnd [2999:immnd_main.c:0187] T2 Dir:/etc/opensaf 
File:imm.xml ExpectedNodes:3 WaitSecs:3
Oct  6  9:38:35.143352 osafimmnd [2999:immnd_mds.c:0127] >> immnd_mds_register
Oct  6  9:38:35.143457 osafimmnd [2999:immnd_mds.c:0192] T2 cb->node_id:2030f
Oct  6  9:38:35.143461 osafimmnd [2999:immnd_mds.c:0194] << immnd_mds_register
Oct  6  9:38:35.143469 osafimmnd [2999:immnd_main.c:0238] << immnd_initialize
Oct  6  9:38:35.143478 osafimmnd [2999:osaf_secutil.c:0166] >> auth_server_main
Oct  6  9:38:35.244792 osafimmnd [2999:ImmModel.cc:3381] << protocol43Allowed
Oct  6  9:38:35.244836 osafimmnd [2999:immnd_proc.c:1626] T5 tmout:100 ste:1 
ME:0 RE:0 crd:0 rim:FROM_FILE 4.3A:0 2Pbe:0 VetA/B: 0/0 othsc:0/0
Oct  6  9:38:35.244847 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
Oct  6  9:38:35.344974 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
Oct  6  9:38:35.445934 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
Oct  6  9:38:35.546974 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
.
.
.
Oct  6  9:40:04.794307 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
Oct  6  9:40:04.895424 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
Oct  6  9:40:04.996499 osafimmnd [2999:immnd_proc.c:0413] TR Possibly extended 
intro from this IMMND pbeEnabled: 2  dirsize:0
Oct  6  9:40:05.081315 osafimmnd [2999:mds_dt_trans.c:0671] >> 
mdtm_process_poll_recv_data_tcp

The start of opensafd.service eventually timed out and failed. It appears the 
function immnd_introduceMe in immnd_proc.c continually
fails. If the problem is due to pbe, I don't understand why that would happen 
on a payload node. I thought pbe was just on system
controller nodes.

This is a 3 node cluster with SC-1, SC-2, and PL-3. The controller nodes (SC-1, 
SC-2) start up okay, but not the payload node (PL-3).
These nodes are running on openSUSE 12.1 VirtualBox VMs.

I have no application interacting with openSAF, just openSAF itself installed.

Any assistance on this would be appreciated. Thanks in advance!


Jeremy Matthews


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most 
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to