Hi AndersBj,
You can accept the ticket.
/Neel
On Wednesday 08 April 2015 02:34 PM, Anders Bjornerstedt wrote:
>
> The problem is that the abortSync arrives before the sync process
> (immloader)
> attaches to the local IMMND coord. So the sync is abortedeven before
> it has
> managed to start.
>
> The sync process should either be aborted and/or its requests rejected.
>
> Neel, do you mind if I accept the ticket ?
>
> ------------------------------------------------------------------------
>
> *[tickets:#1295] <http://sourceforge.net/p/opensaf/tickets/1295> IMM:
> ImmModel.cc:10908: nextSyncResult: Assertion '!(obj->mObjFlags &
> IMM_RT_UPDATE_LOCK)' failed*
>
> *Status:* accepted
> *Milestone:* 4.4.2
> *Created:* Wed Apr 01, 2015 09:45 AM UTC by Sirisha Alla
> *Last Updated:* Wed Apr 08, 2015 07:02 AM UTC
> *Owner:* Neelakanta Reddy
>
> This issue is seen on 4.6 FC Changeset 6377. The setup is single pbe
> enabled with 50k objects.
>
> Switchover is triggered and opensaf services on payload is started.
>
> syslog on SC-1:
>
> Apr 1 12:38:39 SLES-64BIT-SLOT1 kernel: [ 7362.223601] TIPC:
> Established link <1.1.1:eth0-1.1.3:eth0> on network plane A
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafamfnd[2526]: NO Assigning
> 'safSi=SC-2N,safApp=OpenSAF' ACTIVE to
> 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: WA IMMD not
> re-electing coord for switch-over (si-swap) coord at (2010f)
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafntfimcnd[6497]: NO exiting on
> signal 15
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer
> connected: 41 (safMsgGrpService) <313, 2010f>
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer
> connected: 42 (safLogService) <6, 2010f>
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer
> connected: 43 (safCheckPointService) <317, 2010f>
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer
> disconnected 38 <702, 2010f> (@OpenSafImmReplicatorA)
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Backup create cmd =
> /usr/lib64/opensaf/smf-backup-create
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Bundle check cmd =
> /usr/lib64/opensaf/smf-bundle-check
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Node check cmd =
> /usr/lib64/opensaf/smf-node-check
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO SMF repository
> check cmd = /usr/lib64/opensaf/smf-repository-check
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Cluster reboot cmd
> = /usr/lib64/opensaf/smf-cluster-reboot
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Admin Op Timeout =
> 600000000000
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Cli Timeout =
> 600000000000
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Reboot Timeout =
> 600000000000
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO SMF will use the
> STEP standard set of actions.
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer
> connected: 44 (safLckService) <315, 2010f>
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO DN for si_swap
> operation = safSi=SC-2N,safApp=OpenSAF
> ..........
>
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Got error on non
> local rt object update err: 6
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Failed RtObject
> update has to abort sync
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO NODE STATE->
> IMM_NODE_FULLY_AVAILABLE (2484)
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Epoch set to 36 in
> ImmModel
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for
> IMMND process at node 2010f old epoch: 35 new epoch:36
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Coord broadcasting
> ABORT_SYNC, epoch:36
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmpbed: NO Update epoch 36
> committing with ccbId:10000000e/4294967310
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Global ABORT SYNC
> received for epoch 36
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: WA Successfully
> aborted sync. Epoch:36
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for
> IMMND process at node 2040f old epoch: 35 new epoch:36
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for
> IMMND process at node 2020f old epoch: 35 new epoch:36
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO Node 2030f request
> sync sync-pid:5629 epoch:0
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmloadd: logtrace: trace enabled
> to file /var/log/opensaf/osafimmnd, mask=0xffffffff
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmloadd: NO Sync starting
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO SERVER STATE:
> IMM_SERVER_SYNC_SERVER --> IMM SERVER READY
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: ImmModel.cc:10908:
> nextSyncResult: Assertion '!(obj->mObjFlags & IMM_RT_UPDATE_LOCK)' failed.
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmpbed: WA PBE lost contact with
> parent IMMND - Exiting
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafamfnd[2526]: NO
> 'safSu=SC-1,safSg=NoRed,safApp=OpenSAF' component restart probation
> timer started (timeout: 60000000000 ns)
> Apr 1 12:38:40 SLES-64BIT-SLOT1 osafntfimcnd[6520]: NO
> saImmOiDispatch() Fail SA_AIS_ERR_BAD_HANDLE (9)
>
> CLM initialize returned ERR_TRY_AGAIN and node went for reboot:
>
> Apr 1 12:38:55 SLES-64BIT-SLOT1 osafamfd[2516]: NO Switching StandBy
> --> Active State
> Apr 1 12:39:01 SLES-64BIT-SLOT1 osafclmd[2497]: ER saImmOiInitialize_2
> failed 6, exiting
> Apr 1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: NO
> 'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to
> 'avaDown' : Recovery is 'nodeFailfast'
> Apr 1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: ER
> safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due to:avaDown
> Recovery is:nodeFailfast
> Apr 1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: Rebooting OpenSAF
> NodeId = 131343 EE Name = , Reason: Component faulted: recovery is
> node failfast, OwnNodeId = 131343, SupervisionTime = 60
> Apr 1 12:39:01 SLES-64BIT-SLOT1 opensaf_reboot: Rebooting local node;
> timeout=60
>
> syslog and immnd traces of the controllers are attached.
>
> ------------------------------------------------------------------------
>
> Sent from sourceforge.net because you indicated interest in
> https://sourceforge.net/p/opensaf/tickets/1295/
> <https://sourceforge.net/p/opensaf/tickets/1295>
>
> To unsubscribe from further messages, please visit
> https://sourceforge.net/auth/subscriptions/
> <https://sourceforge.net/auth/subscriptions>
>
---
** [tickets:#1295] IMM: ImmModel.cc:10908: nextSyncResult: Assertion
'!(obj->mObjFlags & IMM_RT_UPDATE_LOCK)' failed**
**Status:** accepted
**Milestone:** 4.4.2
**Created:** Wed Apr 01, 2015 09:45 AM UTC by Sirisha Alla
**Last Updated:** Wed Apr 08, 2015 09:04 AM UTC
**Owner:** Neelakanta Reddy
This issue is seen on 4.6 FC Changeset 6377. The setup is single pbe enabled
with 50k objects.
Switchover is triggered and opensaf services on payload is started.
syslog on SC-1:
Apr 1 12:38:39 SLES-64BIT-SLOT1 kernel: [ 7362.223601] TIPC: Established link
<1.1.1:eth0-1.1.3:eth0> on network plane A
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafamfnd[2526]: NO Assigning
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: WA IMMD not re-electing coord
for switch-over (si-swap) coord at (2010f)
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafntfimcnd[6497]: NO exiting on signal 15
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 41
(safMsgGrpService) <313, 2010f>
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 42
(safLogService) <6, 2010f>
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 43
(safCheckPointService) <317, 2010f>
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer disconnected
38 <702, 2010f> (@OpenSafImmReplicatorA)
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Backup create cmd =
/usr/lib64/opensaf/smf-backup-create
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Bundle check cmd =
/usr/lib64/opensaf/smf-bundle-check
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Node check cmd =
/usr/lib64/opensaf/smf-node-check
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO SMF repository check cmd =
/usr/lib64/opensaf/smf-repository-check
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Cluster reboot cmd =
/usr/lib64/opensaf/smf-cluster-reboot
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Admin Op Timeout =
600000000000
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Cli Timeout = 600000000000
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO Reboot Timeout =
600000000000
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO SMF will use the STEP
standard set of actions.
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Implementer connected: 44
(safLckService) <315, 2010f>
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafsmfd[2545]: NO DN for si_swap operation =
safSi=SC-2N,safApp=OpenSAF
..........
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Got error on non local rt
object update err: 6
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Failed RtObject update has
to abort sync
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO NODE STATE->
IMM_NODE_FULLY_AVAILABLE (2484)
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Epoch set to 36 in ImmModel
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for IMMND
process at node 2010f old epoch: 35 new epoch:36
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO Coord broadcasting
ABORT_SYNC, epoch:36
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmpbed: NO Update epoch 36 committing
with ccbId:10000000e/4294967310
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: WA Global ABORT SYNC received
for epoch 36
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: WA Successfully aborted sync.
Epoch:36
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for IMMND
process at node 2040f old epoch: 35 new epoch:36
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO ACT: New Epoch for IMMND
process at node 2020f old epoch: 35 new epoch:36
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmd[2431]: NO Node 2030f request sync
sync-pid:5629 epoch:0
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmloadd: logtrace: trace enabled to file
/var/log/opensaf/osafimmnd, mask=0xffffffff
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmloadd: NO Sync starting
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: NO SERVER STATE:
IMM_SERVER_SYNC_SERVER --> IMM SERVER READY
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmnd[2441]: ImmModel.cc:10908:
nextSyncResult: Assertion '!(obj->mObjFlags & IMM_RT_UPDATE_LOCK)' failed.
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafimmpbed: WA PBE lost contact with parent
IMMND - Exiting
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafamfnd[2526]: NO
'safSu=SC-1,safSg=NoRed,safApp=OpenSAF' component restart probation timer
started (timeout: 60000000000 ns)
Apr 1 12:38:40 SLES-64BIT-SLOT1 osafntfimcnd[6520]: NO saImmOiDispatch() Fail
SA_AIS_ERR_BAD_HANDLE (9)
CLM initialize returned ERR_TRY_AGAIN and node went for reboot:
Apr 1 12:38:55 SLES-64BIT-SLOT1 osafamfd[2516]: NO Switching StandBy -->
Active State
Apr 1 12:39:01 SLES-64BIT-SLOT1 osafclmd[2497]: ER saImmOiInitialize_2 failed
6, exiting
Apr 1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: NO
'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'nodeFailfast'
Apr 1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: ER
safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery
is:nodeFailfast
Apr 1 12:39:01 SLES-64BIT-SLOT1 osafamfnd[2526]: Rebooting OpenSAF NodeId =
131343 EE Name = , Reason: Component faulted: recovery is node failfast,
OwnNodeId = 131343, SupervisionTime = 60
Apr 1 12:39:01 SLES-64BIT-SLOT1 opensaf_reboot: Rebooting local node;
timeout=60
syslog and immnd traces of the controllers are attached.
---
Sent from sourceforge.net because [email protected] is
subscribed to http://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
http://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets