Attached is the patch that pushes notifications as job by making imm_ntf_job as 
common queue for both imm and ntf. It handles TRY_AGAIN and TIMEOUT in the same 
way as it is handled for IMM jobs. Thus AMFD will not get blocked for retrying 
to send notification and also it will not log then as error.
There is a discussion in some tickets that notifications should be sent almost 
at the same time when AMF updates IMM. Patch improves this behaviour but not 
completely.

I think there is already a separate enhacement ticket for moving IMM, NTF, LOG 
related things to  separate thread(s).  So sending notification in separate 
could be done in future a part of that enhancement ticket.


Attachments:

- 
[314_notificaion.patch](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/d49f7047/3035/attachment/314_notificaion.patch)
 (15.7 kB; application/octet-stream)


---

** [tickets:#314] AMF looses alarms and notifications during switch-over**

**Status:** assigned
**Milestone:** future
**Created:** Fri May 24, 2013 08:34 AM UTC by Nagendra Kumar
**Last Updated:** Mon Oct 03, 2016 08:26 AM UTC
**Owner:** Praveen
**Attachments:**

- [messages](https://sourceforge.net/p/opensaf/tickets/314/attachment/messages) 
(41.9 kB; application/octet-stream)
- [osafamfd](https://sourceforge.net/p/opensaf/tickets/314/attachment/osafamfd) 
(5.7 MB; application/octet-stream)


Migrated from http://devel.opensaf.org/ticket/3051

Background: http://devel.opensaf.org/ticket/3028


If another node (payload) leaves the cluster in the middle of switch-over, amfd 
logs this:


Mar 8 10:18:21 SC-1 osafamfd[304]: ER sendStateChangeNotificationAvd: 
saNtfNotificationSend Failed (6)
Mar 8 10:18:21 SC-1 osafamfd[304]: ER sendAlarmNotificationAvd: 
saNtfNotificationSend Failed (6)


These logs means that amfd failed to send an alarm and a notification due to 
TRYAGAIN returned from NTF (in NOACTIVE state)


AMF needs to store the alarms/notifications produced in the NOACTIVE state and 
send them at the end of the switch-over. Or with using a separate thread that 
can block forever (?) on TRYAGAIN.


The problem exist in all opensaf releases





---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most 
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to