Re: [openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Bogdan Dobrelya
On 17.11.2015 15:28, Kyrylo Galanov wrote: > Hi Team, Hello > > I have been testing fail-over after free disk space is less than 512 mb. > (https://review.openstack.org/#/c/240951/) > Affected node is stopped correctly and services migrate to a healthy node. > > However, after free disk space

Re: [openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Alex Schultz
On Tue, Nov 17, 2015 at 9:01 AM, Vladimir Kuklin wrote: > Folks > > Is not it possible for an OCF script to clear this attribute after a > sufficient period of successful monitoring of node health? It could be a > better approach in this case then restarting the node. > So

Re: [openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Alex Schultz
Hey Kyrylo, On Tue, Nov 17, 2015 at 8:28 AM, Kyrylo Galanov wrote: > Hi Team, > > I have been testing fail-over after free disk space is less than 512 mb. > (https://review.openstack.org/#/c/240951/) > Affected node is stopped correctly and services migrate to a healthy

[openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Kyrylo Galanov
Hi Team, I have been testing fail-over after free disk space is less than 512 mb. ( https://review.openstack.org/#/c/240951/) Affected node is stopped correctly and services migrate to a healthy node. However, after free disk space is more than 512 mb again the node does not recover it's state

Re: [openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Vladimir Kuklin
Bogdan I think we should firstly check whether attribute deletion leads to node starting its services or not. From what I read in the official Pacemaker documentation, it should work out of the box without the need to restart the node. And by the way the quote above mentions 'use ONE of the

Re: [openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Alex Schultz
On Tue, Nov 17, 2015 at 11:12 AM, Vladimir Kuklin wrote: > Bogdan > > I think we should firstly check whether attribute deletion leads to node > starting its services or not. From what I read in the official Pacemaker > documentation, it should work out of the box without

Re: [openstack-dev] [Fuel] HA cluster disk monitoring, failover and recovery

2015-11-17 Thread Andrew Beekhof
> On 18 Nov 2015, at 4:52 AM, Alex Schultz wrote: > > On Tue, Nov 17, 2015 at 11:12 AM, Vladimir Kuklin > wrote: >> Bogdan >> >> I think we should firstly check whether attribute deletion leads to node >> starting its services or not. From what I