Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2017-12-15 Thread Jan Pokorný
On 20/05/16 17:04 +0100, Adam Spiers wrote: > Klaus Wenninger wrote: >> On 05/20/2016 08:39 AM, Ulrich Windl wrote: >>> I think RAs should not rely on "stop" being called multiple times >>> for a resource to be stopped. > > Well, this would be a major architectural change. Currently if > stop fa

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-25 Thread Adam Spiers
Ken Gaillot wrote: > On 06/24/2016 05:41 AM, Adam Spiers wrote: > > Andrew Beekhof wrote: > >> On Fri, Jun 24, 2016 at 1:01 AM, Adam Spiers wrote: > >>> Andrew Beekhof wrote: > > Earlier in this thread I proposed > > the idea of a tiny temporary file in /run which tracks the last known

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-24 Thread Ken Gaillot
On 06/24/2016 05:41 AM, Adam Spiers wrote: > Andrew Beekhof wrote: >> On Fri, Jun 24, 2016 at 1:01 AM, Adam Spiers wrote: >>> Andrew Beekhof wrote: >> > Well, if you're OK with bending the rules like this then that's good > enough for me to say we should at least try it :) I st

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-24 Thread Adam Spiers
Andrew Beekhof wrote: > On Fri, Jun 24, 2016 at 1:01 AM, Adam Spiers wrote: > > Andrew Beekhof wrote: > > >> > Well, if you're OK with bending the rules like this then that's good > >> > enough for me to say we should at least try it :) > >> > >> I still say you shouldn't only do it on error. >

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-23 Thread Andrew Beekhof
On Fri, Jun 24, 2016 at 1:01 AM, Adam Spiers wrote: > Andrew Beekhof wrote: >> > Well, if you're OK with bending the rules like this then that's good >> > enough for me to say we should at least try it :) >> >> I still say you shouldn't only do it on error. > > When else should it be done? I wa

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-23 Thread Andrew Beekhof
On Fri, Jun 24, 2016 at 1:26 AM, Adam Spiers wrote: > Adam Spiers wrote: >> As per the FIXME, one remaining problem is dealing with this kind of >> scenario: >> >> - Cloud operator notices SMART warnings on the compute node >> which is not yet causing hard failures but signifies that the >>

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-23 Thread Adam Spiers
Adam Spiers wrote: > As per the FIXME, one remaining problem is dealing with this kind of > scenario: > > - Cloud operator notices SMART warnings on the compute node > which is not yet causing hard failures but signifies that the > hard disk might die soon. > > - Operator manually ru

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-23 Thread Adam Spiers
Andrew Beekhof wrote: > On Wed, Jun 15, 2016 at 10:42 PM, Adam Spiers wrote: > > Andrew Beekhof wrote: > >> On Mon, Jun 13, 2016 at 9:34 PM, Adam Spiers wrote: > >> > Andrew Beekhof wrote: > >> >> On Wed, Jun 8, 2016 at 6:23 PM, Adam Spiers wrote: > >> >> > Andrew Beekhof wrote: > >> >> >> O

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-15 Thread Andrew Beekhof
On Wed, Jun 15, 2016 at 10:42 PM, Adam Spiers wrote: > Andrew Beekhof wrote: >> On Mon, Jun 13, 2016 at 9:34 PM, Adam Spiers wrote: >> > Andrew Beekhof wrote: >> >> On Wed, Jun 8, 2016 at 6:23 PM, Adam Spiers wrote: >> >> > Andrew Beekhof wrote: >> >> >> On Wed, Jun 8, 2016 at 12:11 AM, Adam

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-15 Thread Adam Spiers
Andrew Beekhof wrote: > On Mon, Jun 13, 2016 at 9:34 PM, Adam Spiers wrote: > > Andrew Beekhof wrote: > >> On Wed, Jun 8, 2016 at 6:23 PM, Adam Spiers wrote: > >> > Andrew Beekhof wrote: > >> >> On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: > >> >> > We would also need to ensure that se

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-14 Thread Andrew Beekhof
On Mon, Jun 13, 2016 at 9:34 PM, Adam Spiers wrote: > Andrew Beekhof wrote: >> On Wed, Jun 8, 2016 at 6:23 PM, Adam Spiers wrote: >> > Andrew Beekhof wrote: >> >> On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: >> >> > Ken Gaillot wrote: >> >> >> On 06/06/2016 05:45 PM, Adam Spiers wrote:

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-13 Thread Adam Spiers
Andrew Beekhof wrote: > On Wed, Jun 8, 2016 at 6:23 PM, Adam Spiers wrote: > > Andrew Beekhof wrote: > >> On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: > >> > Ken Gaillot wrote: > >> >> On 06/06/2016 05:45 PM, Adam Spiers wrote: > >> >> > Maybe your point was that if the expected start n

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-09 Thread Andrew Beekhof
On Wed, Jun 8, 2016 at 6:23 PM, Adam Spiers wrote: > Andrew Beekhof wrote: >> On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: >> > Ken Gaillot wrote: >> >> On 06/06/2016 05:45 PM, Adam Spiers wrote: >> >> > Adam Spiers wrote: >> >> >> Andrew Beekhof wrote: >> >> >>> On Tue, Jun 7, 2016 at

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-08 Thread Adam Spiers
Andrew Beekhof wrote: > On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: > > Ken Gaillot wrote: > >> On 06/06/2016 05:45 PM, Adam Spiers wrote: > >> > Adam Spiers wrote: > >> >> Andrew Beekhof wrote: > >> >>> On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: > >> Ken Gaillot wrote:

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-07 Thread Andrew Beekhof
On Wed, Jun 8, 2016 at 10:29 AM, Andrew Beekhof wrote: > On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: >> Ken Gaillot wrote: >>> On 06/06/2016 05:45 PM, Adam Spiers wrote: >>> > Adam Spiers wrote: >>> >> Andrew Beekhof wrote: >>> >>> On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: >>

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-07 Thread Andrew Beekhof
On Wed, Jun 8, 2016 at 12:11 AM, Adam Spiers wrote: > Ken Gaillot wrote: >> On 06/06/2016 05:45 PM, Adam Spiers wrote: >> > Adam Spiers wrote: >> >> Andrew Beekhof wrote: >> >>> On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: >> Ken Gaillot wrote: >> > My main question is how usef

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-07 Thread Adam Spiers
Ken Gaillot wrote: > On 06/06/2016 05:45 PM, Adam Spiers wrote: > > Adam Spiers wrote: > >> Andrew Beekhof wrote: > >>> On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: > Ken Gaillot wrote: > > My main question is how useful would it actually be in the proposed use > > cases. Co

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Vladislav Bogdanov
07.06.2016 02:20, Ken Gaillot wrote: On 06/06/2016 03:30 PM, Vladislav Bogdanov wrote: 06.06.2016 22:43, Ken Gaillot wrote: On 06/06/2016 12:25 PM, Vladislav Bogdanov wrote: 06.06.2016 19:39, Ken Gaillot wrote: On 06/05/2016 07:27 PM, Andrew Beekhof wrote: On Sat, Jun 4, 2016 at 12:16 AM, Ke

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Andrew Beekhof
On Tue, Jun 7, 2016 at 9:07 AM, Ken Gaillot wrote: > On 06/06/2016 05:45 PM, Adam Spiers wrote: >> Adam Spiers wrote: >>> Andrew Beekhof wrote: On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: > Ken Gaillot wrote: >> My main question is how useful would it actually be in the pro

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Andrew Beekhof
On Tue, Jun 7, 2016 at 8:45 AM, Adam Spiers wrote: > Adam Spiers wrote: >> Andrew Beekhof wrote: >> > On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: >> > > Ken Gaillot wrote: >> > >> My main question is how useful would it actually be in the proposed use >> > >> cases. Considering the poss

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Ken Gaillot
On 06/06/2016 03:30 PM, Vladislav Bogdanov wrote: > 06.06.2016 22:43, Ken Gaillot wrote: >> On 06/06/2016 12:25 PM, Vladislav Bogdanov wrote: >>> 06.06.2016 19:39, Ken Gaillot wrote: On 06/05/2016 07:27 PM, Andrew Beekhof wrote: > On Sat, Jun 4, 2016 at 12:16 AM, Ken Gaillot > wrote:

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Ken Gaillot
On 06/06/2016 05:45 PM, Adam Spiers wrote: > Adam Spiers wrote: >> Andrew Beekhof wrote: >>> On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: Ken Gaillot wrote: > My main question is how useful would it actually be in the proposed use > cases. Considering the possibility that the

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Adam Spiers
Adam Spiers wrote: > Andrew Beekhof wrote: > > On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: > > > Ken Gaillot wrote: > > >> My main question is how useful would it actually be in the proposed use > > >> cases. Considering the possibility that the expected start might never > > >> happen (

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Adam Spiers
Andrew Beekhof wrote: > On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: > > Ken Gaillot wrote: > >> On 06/02/2016 08:01 PM, Andrew Beekhof wrote: > >> > On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: > >> >> A recent thread discussed a proposed new feature, a new environment > >> >> var

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Andrew Beekhof
On Tue, Jun 7, 2016 at 8:29 AM, Adam Spiers wrote: > Ken Gaillot wrote: >> On 06/02/2016 08:01 PM, Andrew Beekhof wrote: >> > On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: >> >> A recent thread discussed a proposed new feature, a new environment >> >> variable that would be passed to resou

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Adam Spiers
Ken Gaillot wrote: > On 06/02/2016 08:01 PM, Andrew Beekhof wrote: > > On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: > >> A recent thread discussed a proposed new feature, a new environment > >> variable that would be passed to resource agents, indicating whether a > >> stop action was part

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Vladislav Bogdanov
06.06.2016 22:43, Ken Gaillot wrote: On 06/06/2016 12:25 PM, Vladislav Bogdanov wrote: 06.06.2016 19:39, Ken Gaillot wrote: On 06/05/2016 07:27 PM, Andrew Beekhof wrote: On Sat, Jun 4, 2016 at 12:16 AM, Ken Gaillot wrote: On 06/02/2016 08:01 PM, Andrew Beekhof wrote: On Fri, May 20, 2016 at

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Ken Gaillot
On 06/06/2016 12:25 PM, Vladislav Bogdanov wrote: > 06.06.2016 19:39, Ken Gaillot wrote: >> On 06/05/2016 07:27 PM, Andrew Beekhof wrote: >>> On Sat, Jun 4, 2016 at 12:16 AM, Ken Gaillot >>> wrote: On 06/02/2016 08:01 PM, Andrew Beekhof wrote: > On Fri, May 20, 2016 at 1:53 AM, Ken Gaillo

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Vladislav Bogdanov
06.06.2016 19:39, Ken Gaillot wrote: On 06/05/2016 07:27 PM, Andrew Beekhof wrote: On Sat, Jun 4, 2016 at 12:16 AM, Ken Gaillot wrote: On 06/02/2016 08:01 PM, Andrew Beekhof wrote: On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: A recent thread discussed a proposed new feature, a new en

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-06 Thread Ken Gaillot
On 06/05/2016 07:27 PM, Andrew Beekhof wrote: > On Sat, Jun 4, 2016 at 12:16 AM, Ken Gaillot wrote: >> On 06/02/2016 08:01 PM, Andrew Beekhof wrote: >>> On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: A recent thread discussed a proposed new feature, a new environment variable that

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-05 Thread Andrew Beekhof
On Sat, Jun 4, 2016 at 12:16 AM, Ken Gaillot wrote: > On 06/02/2016 08:01 PM, Andrew Beekhof wrote: >> On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: >>> A recent thread discussed a proposed new feature, a new environment >>> variable that would be passed to resource agents, indicating wheth

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-03 Thread Ken Gaillot
On 06/02/2016 08:01 PM, Andrew Beekhof wrote: > On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: >> A recent thread discussed a proposed new feature, a new environment >> variable that would be passed to resource agents, indicating whether a >> stop action was part of a recovery. >> >> Since th

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-06-02 Thread Andrew Beekhof
On Fri, May 20, 2016 at 1:53 AM, Ken Gaillot wrote: > A recent thread discussed a proposed new feature, a new environment > variable that would be passed to resource agents, indicating whether a > stop action was part of a recovery. > > Since that thread was long and covered a lot of topics, I'm s

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-05-20 Thread Adam Spiers
Ken Gaillot wrote: > A recent thread discussed a proposed new feature, a new environment > variable that would be passed to resource agents, indicating whether a > stop action was part of a recovery. > > Since that thread was long and covered a lot of topics, I'm starting a > new one to focus on

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-05-19 Thread Jehan-Guillaume de Rorthais
Le Thu, 19 May 2016 13:15:20 -0500, Ken Gaillot a écrit : > On 05/19/2016 11:43 AM, Jehan-Guillaume de Rorthais wrote: >> Le Thu, 19 May 2016 10:53:31 -0500, >> Ken Gaillot a écrit : >> >>> A recent thread discussed a proposed new feature, a new environment >>> variable that would be passed to

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-05-19 Thread Ken Gaillot
On 05/19/2016 11:43 AM, Jehan-Guillaume de Rorthais wrote: > Le Thu, 19 May 2016 10:53:31 -0500, > Ken Gaillot a écrit : > >> A recent thread discussed a proposed new feature, a new environment >> variable that would be passed to resource agents, indicating whether a >> stop action was part of a

Re: [ClusterLabs] Informing RAs about recovery: failed resource recovery, or any start-stop cycle?

2016-05-19 Thread Jehan-Guillaume de Rorthais
Le Thu, 19 May 2016 10:53:31 -0500, Ken Gaillot a écrit : > A recent thread discussed a proposed new feature, a new environment > variable that would be passed to resource agents, indicating whether a > stop action was part of a recovery. > > Since that thread was long and covered a lot of topic