- **Milestone**: future --> never
---
** [tickets:#323] amf: Reboot of the node is not happening when amfnd is
killed.**
**Status:** not-reproducible
**Milestone:** never
**Created:** Fri May 24, 2013 09:14 AM UTC by Praveen
**Last Updated:** Wed Aug 27, 2014 03:13 AM UTC
**Owner:** Nagendra Kumar
Migrated from http://devel.opensaf.org/ticket/2195.
Setup:
RHEL 6.0 VM setup. The transport used is TCP. Testing is done on changeset
2775.n
Scenario:
==========
Observed that when amfnd is killed on the active controller, the node did not
go for reboot.
From /var/log/messages:
=============================================================
Sep 28 00:37:35 SLOT_1 root: killing osafamfnd from invoke_failover.sh
Sep 28 00:37:35 SLOT_1 osafclmd[1528]: AMF Node Director is down, terminate
this process
Sep 28 00:37:35 SLOT_1 app_comp: AMF Node Director is down, terminate this
process
Sep 28 00:37:35 SLOT_1 osafntfd[1517]: AMF Node Director is down, terminate
this process
Sep 28 00:37:35 SLOT_1 osaflogd[1506]: AMF Node Director is down, terminate
this process
Sep 28 00:37:35 SLOT_1 osafrded[1456]: AMF Node Director is down, terminate
this process
Sep 28 00:37:35 SLOT_1 osaffmd[1467]: AMF Node Director is down, terminate
this process
Sep 28 00:37:35 SLOT_1 osafevtd[1750]: AMF Node Director is down, terminate
this process
Sep 28 00:37:35 SLOT_1 osafckptnd[1736]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafsmfd[1720]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafmsgnd[1715]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osaflcknd[1691]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafmsgd[1688]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafsmfnd[1666]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafimmd[1477]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osaflckd[1644]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafimmnd[1488]: AMF Node Director is down, terminate
this process
Sep 28 00:37:36 SLOT_1 osafamfwd[1618]: Rebooting OpenSAF NodeId? = 0 EE Name
= No EE Mapped, Reason: AMF unexpectedly crashed
Sep 28 00:37:40 SLOT_1 osafamfd[1549]: avd_mds_send: failed 2, to
13@2010f0000061a
Sep 28 00:37:40 SLOT_1 osafamfd[1549]: avd_tmr_snd_hb_evh failed to send HB msg
Sep 28 00:37:50 SLOT_1 osafamfd[1549]: avd_mds_send: failed 2, to
13@2010f0000061a
Sep 28 00:37:50 SLOT_1 osafamfd[1549]: avd_tmr_snd_hb_evh failed to send HB msg
Sep 28 00:38:00 SLOT_1 osafamfd[1549]: avd_mds_send: failed 2, to
13@2010f0000061a
Sep 28 00:38:00 SLOT_1 osafamfd[1549]: avd_tmr_snd_hb_evh failed to send HB msg
Sep 28 00:38:10 SLOT_1 osafamfd[1549]: avd_mds_send: failed 2, to
13@2010f0000061a
Sep 28 00:38:10 SLOT_1 osafamfd[1549]: avd_tmr_snd_hb_evh failed to send HB msg
Sep 28 00:38:21 SLOT_1 osafamfd[1549]: avd_mds_send: failed 2, to
13@2010f0000061a
Sep 28 00:38:21 SLOT_1 osafamfd[1549]: avd_tmr_snd_hb_evh failed to send HB msg
Sep 28 00:38:31 SLOT_1 osafamfwd[1618]: TIMEOUT receiving AMF health check
request, generating core for amfnd
Sep 28 00:38:31 SLOT_1 osafamfwd[1618]: Last received healthcheck cnt=51 at
Wed Sep 28 00:37:31 2011
Sep 28 00:38:31 SLOT_1 osafamfwd[1618]: ordering system reboot
==================================================================
Amfnd, Immnd traces are of huge size and hence not attaching them.
Changed 20 months ago by hafe
You are running OpenSAF as the opensaf user and you haven't given the opensaf
user sudo permission for the needed commands.
But agree that we need to log this issue to make it clear that's the problem.
Basically OpenSAF needs better logs in the opensaf_reboot script as well in the
opensaf_reboot function.
Changed 19 months ago by surya
■milestone changed from 4.2.0.GA to 4.2.1
Changed 19 months ago by surya
■status changed from new to closed
■resolution set to duplicate
Duplicate of 2106 TICKET
Changed 18 months ago by manu
■status changed from closed to reopened
■resolutionduplicate deleted
■description modified (diff)
On SLES11 64bit PC setup, with changeset 3065, observed that after killing
amfnd, node is not going for reboot.
Changed 13 months ago by ehsjoar
■milestone changed from 4.2.1 to future_releases
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets