I analysed from Amf perspectibe and found that CPND is not calling reg api
after its start.
Amf detected down at:
Apr 6 10:52:00.226409 osafamfnd [3015:err.cc:0317] >> avnd_err_process:
Comp:'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' esc_rcvr:'2'
It instantiated and waited for registration:
Apr 6 10:52:00.317144 osafamfnd [3015:clc.cc:0325] TR
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF', command
type:AVND_COMP_CLC_CMD_TYPE_INSTANTIATE(1)
Apr 6 10:52:00.317180 osafamfnd [3015:tmr.cc:0088] TR CLC comp register timer
started
But it didn't and timer expired:
Apr 6 10:52:00.919377 osafamfnd [3015:cbq.cc:0240] >> avnd_evt_ava_resp_evh
Apr 6 10:52:00.919391 osafamfnd [3015:proxy.cc:0501] TR
safComp=AMFWDOG,safSu=PL-4,safSg=NoRed,safApp=OpenSAF: Type=15
Apr 6 10:52:00.919404 osafamfnd [3015:proxy.cc:0604] >> avnd_int_ext_comp_val:
safComp=AMFWDOG,safSu=PL-4,safSg=NoRed,safApp=OpenSAF
Apr 6 10:52:00.919424 osafamfnd [3015:tmr.cc:0126] TR callback response timer
stopped
Apr 6 10:52:00.919439 osafamfnd [3015:cbq.cc:0526] << avnd_evt_ava_resp_evh
Apr 6 10:52:00.919451 osafamfnd [3015:main.cc:0633] TR Evt Type:32 success
Apr 6 10:52:00.919464 osafamfnd [3015:main.cc:0638] << avnd_evt_process
Apr 6 10:52:10.419794 osafamfnd [3015:main.cc:0610] >> avnd_evt_process
Apr 6 10:52:10.419886 osafamfnd [3015:main.cc:0627] TR Evt type:36
Apr 6 10:52:10.419903 osafamfnd [3015:clc.cc:0484] >>
avnd_evt_tmr_clc_comp_reg_evh
Apr 6 10:52:10.419978 osafamfnd [3015:clc.cc:0494] NO Instantiation of
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' failed
Apr 6 10:52:10.420011 osafamfnd [3015:clc.cc:0495] NO Reason: component
registration timer expired
You can see above that other health checks are working fine foer AMFWDOG, so no
issues with Amfnd.
Please investigate from CPND point of view.
The same thing happended twice.
Thanks
-Nagu
---
** [tickets:#1733] Payload got rebooted when cpnd is killed on payload**
**Status:** unassigned
**Milestone:** 4.6.2
**Created:** Wed Apr 06, 2016 11:05 AM UTC by Madhurika Koppula
**Last Updated:** Mon Apr 11, 2016 07:29 AM UTC
**Owner:** nobody
**Attachments:**
-
[cpsv.tgz](https://sourceforge.net/p/opensaf/tickets/1733/attachment/cpsv.tgz)
(15.0 MB; application/octet-stream)
Setup:
Changeset- 7436
Version - opensaf 5.0
4 nodes configured with single PBE
Issue Observed: It is random.
1) When CPND is killed on payload, component restart of CPND failed because of
expiration of component registration timer.
2) Node went for reboot. Test application is being ran.
Below is the timestamp of PL-4:
Apr 6 10:52:00 OEL_M-SLOT-4 osafamfnd[3015]: NO
'safSu=PL-4,safSg=NoRed,safApp=OpenSAF' component restart probation timer
started (timeout: 60000000000 ns)
Apr 6 10:52:00 OEL_M-SLOT-4 osafamfnd[3015]: NO Restarting a component of
'safSu=PL-4,safSg=NoRed,safApp=OpenSAF' (comp restart count: 1)
Apr 6 10:52:00 OEL_M-SLOT-4 osafamfnd[3015]: NO
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'componentRestart'
Apr 6 10:52:00 OEL_M-SLOT-4 osafckptnd[6263]: Started
Apr 6 10:52:10 OEL_M-SLOT-4 osafamfnd[3015]: NO Instantiation of
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' failed
Apr 6 10:52:10 OEL_M-SLOT-4 osafamfnd[3015]: NO Reason: component registration
timer expired
Apr 6 10:52:10 OEL_M-SLOT-4 osafckptnd[6294]: Started
Apr 6 10:52:20 OEL_M-SLOT-4 osafamfnd[3015]: NO Instantiation of
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' failed
Apr 6 10:52:20 OEL_M-SLOT-4 osafamfnd[3015]: NO Reason: component registration
timer expired
Apr 6 10:52:20 OEL_M-SLOT-4 osafamfnd[3015]: WA
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' Presence State RESTARTING
=> INSTANTIATION_FAILED
Apr 6 10:52:20 OEL_M-SLOT-4 osafamfnd[3015]: NO Component Failover trigerred
for 'safSu=PL-4,safSg=NoRed,safApp=OpenSAF': Failed component:
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF'
Apr 6 10:52:20 OEL_M-SLOT-4 osafamfnd[3015]: ER
'safComp=CPND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF'got Inst failed
Apr 6 10:52:20 OEL_M-SLOT-4 osafamfnd[3015]: Rebooting OpenSAF NodeId = 132111
EE Name = , Reason: NCS component Instantiation failed, OwnNodeId = 132111,
SupervisionTime = 60
Apr 6 10:52:20 OEL_M-SLOT-4 opensaf_reboot: Rebooting local node; timeout=60
Apr 6 10:52:46 OEL_M-SLOT-4 kernel: imklog 5.8.10, log source = /proc/kmsg
started.
3) Below is the timestamp of ACTIVE controller:
Apr 6 10:51:59 OEL_M-SLOT-1 osafimmd[6916]: WA No coordinator IMMND known
(case B) - ignoring sync request
Apr 6 10:51:59 OEL_M-SLOT-1 osafimmd[6916]: NO Node 2040f request sync
sync-pid:2980 epoch:0
Apr 6 10:52:24 OEL_M-SLOT-1 kernel: TIPC: Resetting link
<1.1.1:eth3-1.1.4:eth3>, peer not responding
Apr 6 10:52:24 OEL_M-SLOT-1 kernel: TIPC: Lost link <1.1.1:eth3-1.1.4:eth3> on
network plane A
Apr 6 10:52:24 OEL_M-SLOT-1 kernel: TIPC: Lost contact with <1.1.4>
Apr 6 10:52:24 OEL_M-SLOT-1 osafamfd[7003]: NO Node 'PL-4' left the cluster
Apr 6 10:52:24 OEL_M-SLOT-1 osafclmd[6988]: NO Node 132111 went down. Not
sending track callback for agents on that node
Apr 6 10:52:24 OEL_M-SLOT-1 osafclmd[6988]: NO Node 132111 went down. Not
sending track callback for agents on that node
Apr 6 10:52:24 OEL_M-SLOT-1 osafimmnd[3728]: NO Global discard node received
for nodeId:2040f pid:2980
Apr 6 10:52:24 OEL_M-SLOT-1 osafimmnd[3728]: NO Implementer connected: 1539
(MsgQueueService132111) <12283, 2010f>
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Find and fix application performance issues faster with Applications Manager
Applications Manager provides deep performance insights into multiple tiers of
your business applications. It resolves application problems quickly and
reduces your MTTR. Get your free trial!
https://ad.doubleclick.net/ddm/clk/302982198;130105516;z
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets