Ok, precis sådant som jag bett om tidigare :-)

Svårt att reproducera, måste köras en massa ggr => trace är omöjligt.

Jag tror din ide om att aborta vid AMFND för att få coredump är det enda
kostsiktigt möjliga. 

Om trace gick att routa till en RAM intern cyclisk buffer som också
gick att hitta i en coredump vore ännu bättre. 

/AndersBj

-----Original Message-----
From: Hans Feldt [mailto:[email protected]] 
Sent: den 13 september 2013 13:13
To: Suryanarayana Garlapati
Cc: [email protected]
Subject: Re: [devel] [PATCH 1 of 1] amfd: exit at MDS quiesced event in wrong 
state [#516]

amfd does not know shutdown is in progress.

After a suggestion by Mathi I propose "duplicate ACTIVE detected, exiting" 
instead of the "Controller split brain detected, exiting"

/Hans

On 09/13/2013 12:52 PM, Suryanarayana Garlapati wrote:
> In the case where opensaf is shutting down and took more time to 
> shutdown Something like:
>
> +        LOG_CR("While OpenSAF Shutting down, exiting");
>
> and other case where there is tipc link flap:
>
> +        LOG_CR("Controller split brain detected, exiting");
>
>
> On Thursday 05 September 2013 03:09 PM, Hans Feldt wrote:
>> Any suggestion?
>> /Hans
>>
>>> -----Original Message-----
>>> From: Suryanarayana Garlapati 
>>> [mailto:[email protected]]
>>> Sent: den 5 september 2013 11:06
>>> To: praveen malviya
>>> Cc: Hans Feldt; [email protected]
>>> Subject: Re: [devel] [PATCH 1 of 1] amfd: exit at MDS quiesced event 
>>> in wrong state [#516]
>>>
>>> Change the following log accordingly:
>>>
>>> +        LOG_CR("Controller split brain detected, exiting");
>>>
>>> This is always not true. The above is true in Tipc link flap.
>>>
>>>
>>> On Thursday 05 September 2013 02:34 PM, praveen malviya wrote:
>>>> Ack,
>>>>
>>>> Thanks,
>>>> Praveen
>>>> On 16-Aug-13 7:03 PM, Hans Feldt wrote:
>>>>>     osaf/services/saf/avsv/avd/avd_role.cc |  9 +++++++++
>>>>>     1 files changed, 9 insertions(+), 0 deletions(-)
>>>>>
>>>>>
>>>>> MDS can force an active vdest into quiesced state (see docs). 
>>>>> Reasons for this happening is unclear. The logic 
>>>>> avd_mds_qsd_role_evh() can only handle this event in context of a 
>>>>> controller switch-over. Otherwise it could e.g. hang in using IMM which 
>>>>> eventually times out and calls abort() generating a core dump.
>>>>>
>>>>> Instead exit the amfd process when this event happens in non 
>>>>> controller switch-over state. amfnd will failfast reboot the node when it 
>>>>> detects this.
>>>>>
>>>>> diff --git a/osaf/services/saf/avsv/avd/avd_role.cc 
>>>>> b/osaf/services/saf/avsv/avd/avd_role.cc
>>>>> --- a/osaf/services/saf/avsv/avd/avd_role.cc
>>>>> +++ b/osaf/services/saf/avsv/avd/avd_role.cc
>>>>> @@ -569,6 +569,15 @@ void avd_mds_qsd_role_evh(AVD_CL_CB *cb,
>>>>>
>>>>>         TRACE_ENTER();
>>>>>
>>>>> +    /* Only accept this event in controller switch-over state, in other
>>>>> +     * states it is invalid and indicates severe cluster problems.
>>>>> +     */
>>>>> +    if (cb->swap_switch == SA_FALSE) {
>>>>> +        LOG_NO("%s: MDS unexpectedly changed role to QUIESCED", 
>>>>> __FUNCTION__);
>>>>> +        LOG_CR("Controller split brain detected, exiting");
>>>>> +        _exit(EXIT_FAILURE); // should never get here...
>>>>> +    }
>>>>> +
>>>>>         /* Give up IMM OI implementer role */
>>>>>         if ((rc = immutil_saImmOiImplementerClear(cb->immOiHandle)) != 
>>>>> SA_AIS_OK) {
>>>>>             LOG_ER("FAILOVER Active --> Quiesced FAILED, 
>>>>> ImplementerClear failed %u", rc);
>>>> -------------------------------------------------------------------
>>>> ----------- Learn the latest--Visual Studio 2012, SharePoint 2013, 
>>>> SQL 2012, more!
>>>> Discover the easy way to master current and previous Microsoft 
>>>> technologies and advance your career. Get an incredible 1,500+ 
>>>> hours of step-by-step tutorial videos with LearnDevNow. Subscribe today 
>>>> and save!
>>>> http://pubads.g.doubleclick.net/gampad/clk?id=58041391&iu=/4140/ost
>>>> g.clktrk _______________________________________________
>>>> Opensaf-devel mailing list
>>>> [email protected]
>>>> https://lists.sourceforge.net/lists/listinfo/opensaf-devel
>
>
>

------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. Consolidate legacy IT systems to a single system of record for IT 2. 
Standardize and globalize service processes across IT 3. Implement zero-touch 
automation to replace manual, redundant tasks 
http://pubads.g.doubleclick.net/gampad/clk?id=51271111&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-devel

------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. Consolidate legacy IT systems to a single system of record for IT
2. Standardize and globalize service processes across IT
3. Implement zero-touch automation to replace manual, redundant tasks
http://pubads.g.doubleclick.net/gampad/clk?id=51271111&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-devel

Reply via email to