Re: [Linux-HA] failcount for master/slave resource

Andrew Beekhof Mon, 21 Apr 2008 05:05:08 -0700

On Mon, Apr 21, 2008 at 11:07 AM, Junko IKEDA <[EMAIL PROTECTED]> wrote:
> Hi,
>
>  I have one master/slave resource.
>  (Heartbeat 2.2.0 + Pacemaker 0.6.2)
>
>  Master/Slave Set: ms-sf
>  stateful-1:0 (ocf::heartbeat:Stateful):Master node-b
>  stateful-1:1 (ocf::heartbeat:Stateful):Started node-a
>
>  If stateful-1:0 fails, crm_mon would show like this;
>
>  Master/Slave Set: ms-sf
>  stateful-1:0 (ocf::heartbeat:Stateful):Stopped
>  stateful-1:1 (ocf::heartbeat:Stateful):Master node-a
>
>  Failed actions:
>     stateful-1:0_demote_0 (node=node-b, call=7, rc=7): complete
>
>  I tried to clear the failcount of stateful-1:0 with crm_failcount.


That doesn't remove the failed operation though... only the counter
which tracks how many times the resource failed.

Perhaps try crm_resource -C

>
>  # crm_failcount -r stateful-1:0 -U node-b -D
>
>  After that, crm_mon says,
>
>  Master/Slave Set: ms-sf
>  stateful-1:0 (ocf::heartbeat:Stateful):Master node-b
>  stateful-1:1 (ocf::heartbeat:Stateful):Slave node-a
>
>  Failed actions:
>     stateful-1:0_demote_0 (node=node-b, call=7, rc=7): complete
>
>  This looks nice.
>  But if stateful-1:0 fails again, something is wrong.
>
>  Master/Slave Set: ms-sf
>  stateful-1:0 (ocf::heartbeat:Stateful):Master node-b FAILED
>  stateful-1:1 (ocf::heartbeat:Stateful):Slave node-a
>
>  Failed actions:
>     stateful-1:0_demote_0 (node=node-b, call=7, rc=7): complete
>     stateful-1:0_monitor_10000 (node=node-b, call=11, rc=7): complete
>
>  Is it expected?
>  How can I rescue stateful-1:0?
>
>  Best Regards,
>  Junko Ikeda
>
>  NTT DATA INTELLILINK CORPORATION
>
>
> _______________________________________________
>  Linux-HA mailing list
>  [email protected]
>  http://lists.linux-ha.org/mailman/listinfo/linux-ha
>  See also: http://linux-ha.org/ReportingProblems
>
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] failcount for master/slave resource

Reply via email to