> What do you want more? I can easily reproduce the problem. I just
> reboot the active controller and this crash happens on the standby.
>
Hans,
I'm unable to reproduce it in my setup. The macro
m_NCS_PSSTS_GET_CLIENTS is in use for along time.
Not sure, if your's is occuring in a different/special-case flow.
So, If we could generate the core when the scandir fails, we can see the
"buff" contents, as in ===> m_NCS_PSSTS_GET_CLIENTS(inst->pssts_api,
ts_hdl, retval, profile_next, &buff), [in pss_reformat(), in
pss_refmt.c]
So, probably we can try to generate the core at (in pss_reformat(), in
pss_rfmt.c) and examine the contents of 'buff'
---------------------------------------
if (table_cnt < 0)
{
m_LOG_PSS_HEADLINE(NCSFL_SEV_ERROR,
PSS_HDLN_SCAN_DIR_FAILED);
final_retval = NCSCC_RC_FAILURE;
goto clean_stale_ps_n_uba; }
---------------------------------------
In the mean time, I shall try other options to see if ur problem is
reproducable at all.
> But there must an error in the pss_reformat function. The loops are
> not terminated correctly or something. Here is my PSSV content:
>
> > SC_2_2# find /var/opt/opensaf/pssv_store/2/current/1/
> > /var/opt/opensaf/pssv_store/2/current/1/
> > /var/opt/opensaf/pssv_store/2/current/1/AVD
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/13
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/15
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/17
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/24
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/26
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/25
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/12
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/11
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/10
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/22
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/23
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/28
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/8
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/6
> > /var/opt/opensaf/pssv_store/2/current/1/AVD/18
> > /var/opt/opensaf/pssv_store/2/current/1/AVM
> > /var/opt/opensaf/pssv_store/2/current/1/AVM/39
>
> Why does it do a scandir on
> '/var/opt/opensaf/pssv_store/2/current/1/1'?
>
> Attached is the PSSV console log.
>
> Thanks,
> Hans
>
>
> Mathivanan Np-G19859 wrote:
> >
> >> The standby PSS reports an error to AMF when it is about
> the become
> >> active during fail-over. I tracked it down to a failed scandir in
> >> psr_rfmt.c#1347. The path is non-existent in the file system.
> >
> > PSR reporting an error just seems to be fine since the data
> > "/var/opt/opensaf/pssv_store/2/current/1/1" is anyway non-existent.
> >
> > But the real problem would be that, in the normal flow,
> while reading
> > from the persistent store, this value
> > ===>"/var/opt/opensaf/pssv_store/2/current/1/1"
> > itself is unexpected/incorrect and should not occur.
> >
> > This could happen if either the persistent-store was tampered
> > with/corrupt(?) OR
> > If a decode failed!
> >
> > We can try generating a core of the 'ncs_psr' process the
> next time we
> > hit this issue.
> > That would help closing-in on to the problem.
> >
> > - Mathi.
> >
> >
> >> Should I open a ticket on this?
> >>
> >> Any ideas?
> >>
> >> Thanks,
> >> Hans
> >>
> >> syslog:
> >>
> >>> Feb 6 15:39:15 SC_2_1 ncs_scap: NCS_AvSv: Card going for reboot
> >>> -safComp=CompT_PSS,safSu=SuT_NCS_CNTLR,safNode=SC_2_1
> >> faulted due to 1
> >>> -rcvr=9
> >> DTS log:
> >>
> >>> NOTICE : 0x0002010f 2625322986 49 1 PSS 06Feb2008_15.39.15.107
> >>> HEADLINE: PSS PWE found: data: 1 NOTICE : 0x0002010f
> >> 2625322986 49 1
> >>> PSS 06Feb2008_15.39.15.107 HEADLINE: PSS PCN found: String: AVM
> >>> NOTICE : 0x0002010f 2625322986 49 1 PSS 06Feb2008_15.39.15.121
> >>> HEADLINE: PSS PCN found: String: AVD NOTICE : 0x0002010f
> >> 2625322986
> >>> 49 1 PSS 06Feb2008_15.39.15.133 HEADLINE: PSS PWE found: data: 1
> >>> NOTICE : 0x0002010f 2625322986 49 1 PSS 06Feb2008_15.39.15.133
> >>> HEADLINE: PSS PCN found: String: 1 ERROR : 0x0002010f
> >> 2625322986 49
> >>> 1 PSS 06Feb2008_15.39.15.133 HEADLINE : Scanning the
> directory for
> >>> Table IDs failed
> >> SC_2_1# cat NID_PSSV.log
> >> <snip>
> >> pwe_id: 1
> >> pcn_cnt: 1
> >> str_len: 2
> >> PCN_PATH: /var/opt/opensaf/pssv_store/2/current/1/1
> >> table_cnt: -1
> >>
> >> SC_2_1# find /var/opt/opensaf/pssv_store/
> >> /var/opt/opensaf/pssv_store/
> >> /var/opt/opensaf/pssv_store/2
> >> /var/opt/opensaf/pssv_store/2/current
> >> /var/opt/opensaf/pssv_store/2/current/1
> >> /var/opt/opensaf/pssv_store/2/current/1/AVM
> >> /var/opt/opensaf/pssv_store/2/current/1/AVM/39
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/13
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/15
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/17
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/24
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/26
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/25
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/12
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/11
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/10
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/22
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/23
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/28
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/8
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/6
> >> /var/opt/opensaf/pssv_store/2/current/1/AVD/18
> >> /var/opt/opensaf/pssv_store/2/current/_ISU
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD/13
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD/15
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD/17
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD/24
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD/26
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVD/25
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVM
> >> /var/opt/opensaf/pssv_store/2/current/_ISU/1/AVM/39
> >> _______________________________________________
> >> Users mailing list
> >> [email protected]
> >> http://list.opensaf.org/maillist/listinfo/users
> >>
> >
> >
> ______________________________________________________________________
> > This email has been scanned by the MessageLabs Email
> Security System.
> > For more information please visit http://www.messagelabs.com/email
> >
> ______________________________________________________________________
> >
>
>
______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email
______________________________________________________________________
_______________________________________________
Users mailing list
[email protected]
http://list.opensaf.org/maillist/listinfo/users