commit 412f5da2a9c5cc60a349be6672f13238306c9785 (HEAD -> develop,
origin/develop, ticket-3315)
Author: hieu.h.hoang <[email protected]>
Date: Mon May 2 13:20:08 2022 +0700
imm: Remove dead admin owner [#3315]
While imm is synchronizing, the function to delete admin owner
only changes the state of admin owner to dying instead of deleting
it. If imm synchronizes successful, those admin owners will be
removed. If the synchronization is aborted, those admin owner
won't be deleted until next synchronization. Solution is to
delete all dead admin owners after aborting a synchronization.
---
** [tickets:#3315] imm : Redundancy admin owner after aborting sync**
**Status:** review
**Milestone:** 5.22.04
**Created:** Mon May 02, 2022 03:54 PM UTC by Hieu Hong Hoang
**Last Updated:** Tue May 03, 2022 07:38 AM UTC
**Owner:** Hieu Hong Hoang
While imm is synchronizing, the function to delete admin owner only changes
the state of admin owner to dying instead of deleting it. If imm aborts sync,
those admin owners won't be deleted until next sync. We should clean it after
imm aborted sync.
Syslog:
~~~
2022-05-02 16:55:24.257 SC-1 osafimmnd[425]: NO Announce sync, epoch:17
2022-05-02 16:55:24.258 SC-1 osafimmnd[425]: NO SERVER STATE: IMM_SERVER_READY
--> IMM_SERVER_SYNC_SERVER
2022-05-02 16:55:24.259 SC-1 osafimmnd[425]: NO NODE STATE->
IMM_NODE_R_AVAILABLE
2022-05-02 16:55:24.259 SC-1 osafimmd[411]: NO Successfully announced sync. New
ruling epoch:17
2022-05-02 16:55:24.259 SC-1 osafimmnd[425]: WA Postponing hard delete of admin
owner with id:3942 when imm is not writable state
2022-05-02 16:55:24.260 SC-1 osafimmnd[425]: WA Failed in hard remove of admin
owner 3942
2022-05-02 16:55:24.359 SC-1 osafimmnd[425]: WA Apparent deadlock detected
between IMMND sync and restarting PBE, with Ccbs in critical. Aborting this
sync attempt
2022-05-02 16:55:24.360 SC-1 osafimmnd[425]: NO NODE STATE->
IMM_NODE_FULLY_AVAILABLE (2728)
2022-05-02 16:55:24.360 SC-1 osafimmnd[425]: NO Epoch set to 17 in ImmModel
2022-05-02 16:55:24.360 SC-1 osafimmnd[425]: NO Coord broadcasting ABORT_SYNC,
epoch:17
~~~
Source code:
if (immNotWritable() || is_sync_aborting()) {
if (hard) {
unsigned int siz = (unsigned int)(*i)->mTouchedObjects.size();
if (siz >= IMMSV_MAX_OBJECTS) {
LOG_WA(
"Forcing immediate hard delete of large (%u) admin owner with
id:%u "
"to clear way for sync",
siz, ownerId);
goto forced;
}
if (sImmNodeState > IMM_NODE_UNKNOWN) {
LOG_WA(
"Postponing hard delete of admin owner with id:%u "
"when imm is not writable state",
ownerId);
}
(*i)->mDying = true;
err = SA_AIS_ERR_BUSY;
} else {
err = SA_AIS_ERR_TRY_AGAIN;
}
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list._______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets