Re: [Pacemaker] Order of resources in a group and crm_diff

2014-02-18 Thread Vladislav Bogdanov
29.01.2014 08:44, Andrew Beekhof wrote: ... Thats a known deficiency in the v1 diff format (and why we need costly digests to detect ordering changes). Happily .12 will have a new and improve diff format that will handle this correctly. Does your recent cib-performance rewrite address

Re: [Pacemaker] Manual resource reload

2014-02-18 Thread Vladislav Bogdanov
17.02.2014 04:11, Andrew Beekhof wrote: On 11 Feb 2014, at 2:49 am, Vladislav Bogdanov bub...@hoster-ok.com wrote: Hi, cannot find anywhere (am I blind?), is it possible to manually inject 'reload' op for a given resource? Background for this is if some configuration files are edited,

Re: [Pacemaker] What is the reason which the node in which failure has not occurred carries out lost?

2014-02-18 Thread Vladislav Bogdanov
18.02.2014 03:49, Andrew Beekhof wrote: On 31 Jan 2014, at 6:20 pm, yusuke iida yusk.i...@gmail.com wrote: Hi, all I measure the performance of Pacemaker in the following combinations. Pacemaker-1.1.11.rc1 libqb-0.16.0 corosync-2.3.2 All nodes are KVM virtual machines. stopped the

Re: [Pacemaker] Order of resources in a group and crm_diff

2014-02-18 Thread Andrew Beekhof
On 18 Feb 2014, at 7:25 pm, Vladislav Bogdanov bub...@hoster-ok.com wrote: 29.01.2014 08:44, Andrew Beekhof wrote: ... Thats a known deficiency in the v1 diff format (and why we need costly digests to detect ordering changes). Happily .12 will have a new and improve diff format that will

Re: [Pacemaker] What is the reason which the node in which failure has not occurred carries out lost?

2014-02-18 Thread Andrew Beekhof
On 18 Feb 2014, at 7:40 pm, Vladislav Bogdanov bub...@hoster-ok.com wrote: 18.02.2014 03:49, Andrew Beekhof wrote: On 31 Jan 2014, at 6:20 pm, yusuke iida yusk.i...@gmail.com wrote: Hi, all I measure the performance of Pacemaker in the following combinations. Pacemaker-1.1.11.rc1

Re: [Pacemaker] What is the reason which the node in which failure has not occurred carries out lost?

2014-02-18 Thread Andrew Beekhof
On 18 Feb 2014, at 8:18 pm, Andrew Beekhof and...@beekhof.net wrote: On 18 Feb 2014, at 7:40 pm, Vladislav Bogdanov bub...@hoster-ok.com wrote: 18.02.2014 03:49, Andrew Beekhof wrote: On 31 Jan 2014, at 6:20 pm, yusuke iida yusk.i...@gmail.com wrote: Hi, all I measure the

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Asgaroth
The 3rd node should (and needs to be) fenced at this point to allow the cluster to continue. Is this not happening? The fencing operation appears to complete successfully, here is the sequence: [1] All 3 nodes running properly [2] On node 3 I run echo c /proc/sysrq-trigger which hangs

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrey Groshev
Hi, ALL and Andrew! Today is a good day - I killed a lot, and a lot of shooting at me. In general - I am happy (almost like an elephant) :) Except resources on the node are important to me eight processes: corosync,pacemakerd,cib,stonithd,lrmd,attrd,pengine,crmd. I killed them with different

Re: [Pacemaker] What is the reason which the node in which failure has not occurred carries out lost?

2014-02-18 Thread yusuke iida
Hi, Andrew and Digimer Thank you for the comment. I solved with reference to other mailing list about this problem. https://bugzilla.redhat.com/show_bug.cgi?id=880035 It seems that the kernel of my environment was old when said from the conclusion. It updated to the newest kernel now.

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Nikita Staroverov
18.02.2014 14:12, Asgaroth пишет: The 3rd node should (and needs to be) fenced at this point to allow the cluster to continue. Is this not happening? The fencing operation appears to complete successfully, here is the sequence: [1] All 3 nodes running properly [2] On node 3 I run echo c

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrey Groshev
Hi, ALL and Andrew! Today is a good day - I killed a lot, and a lot of shooting at me. In general - I am happy (almost like an elephant)   :) Except resources on the node are important to me eight processes: corosync,pacemakerd,cib,stonithd,lrmd,attrd,pengine,crmd. I killed them with different

Re: [Pacemaker] Possible error in RA invocation

2014-02-18 Thread David Vossel
- Original Message - From: Santiago Pérez santiago.pe...@entertainment-solutions.eu To: pacemaker@oss.clusterlabs.org Sent: Thursday, January 30, 2014 1:50:41 PM Subject: [Pacemaker] Possible error in RA invocation Hi everyone, I am running a two-node cluster which hosts two

Re: [Pacemaker] [Gluster-users] Pacemaker and GlusterFS

2014-02-18 Thread David Vossel
- Original Message - From: Jefferson Carlos Machado lista.li...@results.com.br To: The Pacemaker cluster resource manager pacemaker@oss.clusterlabs.org Sent: Tuesday, February 11, 2014 7:03:50 AM Subject: Re: [Pacemaker] [Gluster-users] Pacemaker and GlusterFS Hi Vossel, I

Re: [Pacemaker] [Problem] Fail-over is delayed.(State transition is not calculated.)

2014-02-18 Thread David Vossel
- Original Message - From: renayama19661...@ybb.ne.jp To: PaceMaker-ML pacemaker@oss.clusterlabs.org Sent: Monday, February 17, 2014 7:06:53 PM Subject: [Pacemaker] [Problem] Fail-over is delayed.(State transition is not calculated.) Hi All, I confirmed movement at the time

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Asgaroth
i sometimes have the same situation. sleep ~30 seconds between startup cman and clvmd helps a lot. Thanks for the tip, I just tried this (added sleep 30 in the start section of case statement in cman script, but this did not resolve the issue for me), for some reason clvmd just refuses to

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Vladislav Bogdanov
18.02.2014 19:49, Asgaroth wrote: i sometimes have the same situation. sleep ~30 seconds between startup cman and clvmd helps a lot. Thanks for the tip, I just tried this (added sleep 30 in the start section of case statement in cman script, but this did not resolve the issue for me), for

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread David Vossel
- Original Message - From: Vladislav Bogdanov bub...@hoster-ok.com To: pacemaker@oss.clusterlabs.org Sent: Tuesday, February 18, 2014 1:02:09 PM Subject: Re: [Pacemaker] node1 fencing itself after node2 being fenced 18.02.2014 19:49, Asgaroth wrote: i sometimes have the same

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Vladislav Bogdanov
18.02.2014 23:01, David Vossel wrote: - Original Message - From: Vladislav Bogdanov bub...@hoster-ok.com To: pacemaker@oss.clusterlabs.org Sent: Tuesday, February 18, 2014 1:02:09 PM Subject: Re: [Pacemaker] node1 fencing itself after node2 being fenced 18.02.2014 19:49,

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Asgaroth
Just a guess. Do you have startup fencing enabled in dlm-controld (I actually do not remember if it is applicable to cman's version, but it exists in dlm-4) or cman? If yes, then that may play its evil game, because imho it is not intended to use with pacemaker which has its own startup

Re: [Pacemaker] About the difference in handling of sequential.

2014-02-18 Thread Kristoffer Grönlund
Hi everyone, On Mon, 17 Feb 2014 10:54:29 +0900 (JST) renayama19661...@ybb.ne.jp wrote: Hi Andrew, I found your correction. https://github.com/beekhof/pacemaker/commit/37ff51a0edba208e6240e812936717fffc941a41 Many Thanks! Hideo Yamauchi. --- On Wed, 2014/2/12,

Re: [Pacemaker] [Problem] Fail-over is delayed.(State transition is not calculated.)

2014-02-18 Thread renayama19661014
Hi David, Thank you for comments. You have resource-stickiness=INFINITY, this is what is preventing the failover from occurring. Set resource-stickiness=1 or 0 and the failover should occur. However, the resource moves by a calculation of the next state transition. By a calculation of

Re: [Pacemaker] About the difference in handling of sequential.

2014-02-18 Thread Andrew Beekhof
On 19 Feb 2014, at 10:48 am, Kristoffer Grönlund kgronl...@suse.com wrote: Hi everyone, On Mon, 17 Feb 2014 10:54:29 +0900 (JST) renayama19661...@ybb.ne.jp wrote: Hi Andrew, I found your correction.

Re: [Pacemaker] About the difference in handling of sequential.

2014-02-18 Thread Kristoffer Grönlund
On Wed, 19 Feb 2014 11:57:29 +1100 Andrew Beekhof and...@beekhof.net wrote: It appears Yan did this on purpose. The reason would likely be that this set is for use in a location constraint (not ordering or colocation). And in particular, it is creating a fake set for resources using the same

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrew Beekhof
On 18 Feb 2014, at 11:05 pm, Andrey Groshev gre...@yandex.ru wrote: Hi, ALL and Andrew! Today is a good day - I killed a lot, and a lot of shooting at me. In general - I am happy (almost like an elephant) :) Except resources on the node are important to me eight processes:

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrew Beekhof
On 18 Feb 2014, at 9:29 pm, Andrey Groshev gre...@yandex.ru wrote: Hi, ALL and Andrew! Today is a good day - I killed a lot, and a lot of shooting at me. In general - I am happy (almost like an elephant) :) Except resources on the node are important to me eight processes:

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Andrew Beekhof
On 19 Feb 2014, at 7:16 am, Vladislav Bogdanov bub...@hoster-ok.com wrote: 18.02.2014 23:01, David Vossel wrote: - Original Message - From: Vladislav Bogdanov bub...@hoster-ok.com To: pacemaker@oss.clusterlabs.org Sent: Tuesday, February 18, 2014 1:02:09 PM Subject: Re:

Re: [Pacemaker] node1 fencing itself after node2 being fenced

2014-02-18 Thread Andrew Beekhof
On 18 Feb 2014, at 9:12 pm, Asgaroth li...@blueface.com wrote: The 3rd node should (and needs to be) fenced at this point to allow the cluster to continue. Is this not happening? The fencing operation appears to complete successfully, here is the sequence: [1] All 3 nodes running

Re: [Pacemaker] [Problem] Fail-over is delayed.(State transition is not calculated.)

2014-02-18 Thread renayama19661014
Hi Andrew, I'll follow up on the bug. Thanks! Hideo Yamauch. --- On Wed, 2014/2/19, Andrew Beekhof and...@beekhof.net wrote: I'll follow up on the bug. On 19 Feb 2014, at 10:55 am, renayama19661...@ybb.ne.jp wrote: Hi David, Thank you for comments. You have

Re: [Pacemaker] [Patch]Information of Connectivity is lost is not displayed

2014-02-18 Thread renayama19661014
Hi Andrew, Thank you for comments. So I'm confused as to what the problem is. What are you expecting crm_mon to show? I wish it is displayed as follows. * Node srv01: + default_ping_set : 0 : Connectivity is lost Best Regards, Hideo Yamauchi. --- On Wed,

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrew Beekhof
On 19 Feb 2014, at 4:00 pm, Andrey Groshev gre...@yandex.ru wrote: 19.02.2014, 06:48, Andrew Beekhof and...@beekhof.net: On 18 Feb 2014, at 11:05 pm, Andrey Groshev gre...@yandex.ru wrote: Hi, ALL and Andrew! Today is a good day - I killed a lot, and a lot of shooting at me. In

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrey Groshev
19.02.2014, 09:08, Andrew Beekhof and...@beekhof.net: On 19 Feb 2014, at 4:00 pm, Andrey Groshev gre...@yandex.ru wrote:  19.02.2014, 06:48, Andrew Beekhof and...@beekhof.net:  On 18 Feb 2014, at 11:05 pm, Andrey Groshev gre...@yandex.ru wrote:   Hi, ALL and Andrew!   Today is a good day

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrew Beekhof
On 19 Feb 2014, at 4:18 pm, Andrey Groshev gre...@yandex.ru wrote: 19.02.2014, 09:08, Andrew Beekhof and...@beekhof.net: On 19 Feb 2014, at 4:00 pm, Andrey Groshev gre...@yandex.ru wrote: 19.02.2014, 06:48, Andrew Beekhof and...@beekhof.net: On 18 Feb 2014, at 11:05 pm, Andrey

Re: [Pacemaker] [Patch]Information of Connectivity is lost is not displayed

2014-02-18 Thread Andrew Beekhof
On 19 Feb 2014, at 2:55 pm, renayama19661...@ybb.ne.jp wrote: Hi Andrew, Thank you for comments. So I'm confused as to what the problem is. What are you expecting crm_mon to show? I wish it is displayed as follows. * Node srv01: + default_ping_set : 0

Re: [Pacemaker] [Patch]Information of Connectivity is lost is not displayed

2014-02-18 Thread renayama19661014
Hi Andrew, I wish it is displayed as follows. * Node srv01:     + default_ping_set                  : 0             : Connectivity is lost Ah!   https://github.com/beekhof/pacemaker/commit/5d51930 It was displayed definitely. Many Thanks! Hideo Yamauchi. --- On Wed, 2014/2/19,

Re: [Pacemaker] hangs pending

2014-02-18 Thread Andrey Groshev
19.02.2014, 09:49, Andrew Beekhof and...@beekhof.net: On 19 Feb 2014, at 4:18 pm, Andrey Groshev gre...@yandex.ru wrote:  19.02.2014, 09:08, Andrew Beekhof and...@beekhof.net:  On 19 Feb 2014, at 4:00 pm, Andrey Groshev gre...@yandex.ru wrote:   19.02.2014, 06:48, Andrew Beekhof