- **status**: review --> fixed
- **Comment**:

changeset:   6452:5cb9e55c30a1
tag:         tip
parent:      6448:f3f0141f63cf
user:        Anders Bjornerstedt <[email protected]>
date:        Thu Apr 09 06:46:10 2015 +0200
summary:     IMM: Halt sync with SIGKILL insted of SIGTERM and replace assert 
with error [#1295]

changeset:   6451:01c7ec2eb9d2
branch:      opensaf-4.6.x
parent:      6446:528d42effd30
user:        Anders Bjornerstedt <[email protected]>
date:        Thu Apr 09 06:46:10 2015 +0200
summary:     IMM: Halt sync with SIGKILL insted of SIGTERM and replace assert 
with error [#1295]

changeset:   6450:1b3668b778c3
branch:      opensaf-4.5.x
parent:      6445:62c33d14251b
user:        Anders Bjornerstedt <[email protected]>
date:        Thu Apr 09 06:46:10 2015 +0200
summary:     IMM: Halt sync with SIGKILL insted of SIGTERM and replace assert 
with error [#1295]

changeset:   6449:7752fcc35b07
branch:      opensaf-4.4.x
parent:      6444:bf87928b267c
user:        Anders Bjornerstedt <[email protected]>
date:        Thu Apr 09 06:46:10 2015 +0200
summary:     IMM: Halt sync with SIGKILL insted of SIGTERM and replace assert 
with error [#1295]




---

** [tickets:#1295] IMM: ImmModel.cc:10908: nextSyncResult: Assertion 
'!(obj->mObjFlags & IMM_RT_UPDATE_LOCK)' failed**

**Status:** fixed
**Milestone:** 4.4.2
**Created:** Wed Apr 01, 2015 09:45 AM UTC by Sirisha Alla
**Last Updated:** Thu Apr 09, 2015 05:12 AM UTC
**Owner:** Anders Bjornerstedt

This issue is seen on 4.6 FC Changeset 6377. The setup is single pbe enabled 
with 50k objects.

Switchover is triggered and opensaf services on payload is started.

syslog on SC-1:

Apr  1 12:38:39 SLES-64BIT-SLOT1 kernel: [ 7362.223601] TIPC: Established link 
<1.1.1:eth0-1.1.3:eth0> on network plane A
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafamfnd[2526]: NO Assigning 
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: WA IMMD not re-electing coord 
for switch-over (si-swap) coord at (2010f)
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafntfimcnd[6497]: NO exiting on signal 15
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 41 
(safMsgGrpService) <313, 2010f>
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 42 
(safLogService) <6, 2010f>
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 43 
(safCheckPointService) <317, 2010f>
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer disconnected 
38 <702, 2010f> (@OpenSafImmReplicatorA)
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Backup create cmd = 
/usr/lib64/opensaf/smf-backup-create
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Bundle check cmd = 
/usr/lib64/opensaf/smf-bundle-check
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Node check cmd = 
/usr/lib64/opensaf/smf-node-check
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO SMF repository check cmd = 
/usr/lib64/opensaf/smf-repository-check
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Cluster reboot cmd = 
/usr/lib64/opensaf/smf-cluster-reboot
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Admin Op Timeout = 
600000000000
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Cli Timeout = 600000000000
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Reboot Timeout = 
600000000000
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO SMF will use the STEP 
standard set of actions.
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 44 
(safLckService) <315, 2010f>
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO DN for si_swap operation = 
safSi=SC-2N,safApp=OpenSAF
..........

Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Got error on non local rt 
object update err: 6
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Failed RtObject update has 
to abort sync
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO NODE STATE-> 
IMM_NODE_FULLY_AVAILABLE (2484)
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Epoch set to 36 in ImmModel
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for IMMND 
process at node 2010f old epoch: 35  new epoch:36
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Coord broadcasting 
ABORT_SYNC, epoch:36
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmpbed: NO Update epoch 36 committing 
with ccbId:10000000e/4294967310
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Global ABORT SYNC received 
for epoch 36
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: WA Successfully aborted sync. 
Epoch:36
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for IMMND 
process at node 2040f old epoch: 35  new epoch:36
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for IMMND 
process at node 2020f old epoch: 35  new epoch:36
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO Node 2030f request sync 
sync-pid:5629 epoch:0
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmloadd: logtrace: trace enabled to file 
/var/log/opensaf/osafimmnd, mask=0xffffffff
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmloadd: NO Sync starting
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO SERVER STATE: 
IMM_SERVER_SYNC_SERVER --> IMM SERVER READY
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: ImmModel.cc:10908: 
nextSyncResult: Assertion '!(obj->mObjFlags & IMM_RT_UPDATE_LOCK)' failed.
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafimmpbed: WA PBE lost contact with parent 
IMMND - Exiting
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafamfnd[2526]: NO 
'safSu=SC-1,safSg=NoRed,safApp=OpenSAF' component restart probation timer 
started (timeout: 60000000000 ns)
Apr  1 12:38:40 SLES-64BIT-SLOT1 osafntfimcnd[6520]: NO saImmOiDispatch() Fail 
SA_AIS_ERR_BAD_HANDLE (9)


CLM initialize returned ERR_TRY_AGAIN and node went for reboot:

Apr  1 12:38:55 SLES-64BIT-SLOT1 osafamfd[2516]: NO Switching StandBy --> 
Active State
Apr  1 12:39:01 SLES-64BIT-SLOT1 osafclmd[2497]: ER saImmOiInitialize_2 failed 
6, exiting
Apr  1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: NO 
'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' : 
Recovery is 'nodeFailfast'
Apr  1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: ER 
safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery 
is:nodeFailfast
Apr  1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: Rebooting OpenSAF NodeId = 
131343 EE Name = , Reason: Component faulted: recovery is node failfast, 
OwnNodeId = 131343, SupervisionTime = 60
Apr  1 12:39:01 SLES-64BIT-SLOT1 opensaf_reboot: Rebooting local node; 
timeout=60

syslog and immnd traces of the controllers are attached. 



---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to