Re: [Pacemaker] failcount,start/stop-failure in crm_mon

2013-06-06 Thread Vadym Chepkov
On Jun 6, 2013, at 10:29 AM, Wolfgang Routschka wrote: > Hi, > > one question today about deleting start/stop error in crm_mon. > > How can I delete failure/errors in crm_mon without having to restart/refresh > resources? crm resource cleanup some-resource __

Re: [Pacemaker] stonith and pacemaker with Centos 6

2012-07-12 Thread Vadym Chepkov
On Sep 20, 2011, at 1:59 PM, Charles Richard wrote: > Hi, on my quest to trying to understand STONITH more and to get it working > with CentOS 6, i installed the fencing_agents on the os and also installed > OpenIPMI. If i understood a little, the fence_ipmilan fencing agent is the > one I'd

Re: [Pacemaker] Service automatically restart after IP moving

2012-07-12 Thread Vadym Chepkov
On Jul 12, 2012, at 3:18 AM, CHAMLEY Stephane wrote: > Ah, Thanx you ! :) > > But it is not working ... : ( I think it's a bug. I saw the same behavior in 1.0.12 as well. I ended up defining independent resources, not clone. What's interesting though, if you stop ipv_test resource, the monclo

[Pacemaker] Pacemaker 1.1.7 order constraint syntax

2012-07-19 Thread Vadym Chepkov
Hi, When Pacemaker 1.1.7 was announced, a new feature was mentioned: The ability to specify that A starts after ( B or C or D ) I wasn't able to find an example how to express it crm shell in neither man crm nor in Pacemaker Explained. In fact, http://www.clusterlabs.org/doc/en-US/Pacemaker/1.

Re: [Pacemaker] Pacemaker 1.1.7 order constraint syntax

2012-07-19 Thread Vadym Chepkov
On Jul 19, 2012, at 6:55 AM, Phillip Frost wrote: > On Jul 19, 2012, at 5:47 AM, Vadym Chepkov wrote: > >> Hi, >> >> When Pacemaker 1.1.7 was announced, a new feature was mentioned: >> >> The ability to specify that A starts after ( B or C or D ) >>

Re: [Pacemaker] Pacemaker 1.1.7 order constraint syntax

2012-07-19 Thread Vadym Chepkov
On Jul 19, 2012, at 8:57 AM, Rasto Levrinc wrote: > On Thu, Jul 19, 2012 at 2:38 PM, Andreas Kurz wrote: >> On 07/19/2012 11:47 AM, Vadym Chepkov wrote: >>> Hi, >>> >>> When Pacemaker 1.1.7 was announced, a new feature was mentioned: >>> >>&g

Re: [Pacemaker] Pacemaker 1.1.7 order constraint syntax

2012-07-19 Thread Vadym Chepkov
On Jul 19, 2012, at 8:16 AM, Phillip Frost wrote: > On Jul 19, 2012, at 7:44 AM, Vadym Chepkov wrote: > >> Not there either. > > Maybe I'm not understanding your question. Isn't this what you are seeking? > >> Ordered resource sets have an >> ext

Re: [Pacemaker] Pacemaker 1.1.7 order constraint syntax

2012-07-19 Thread Vadym Chepkov
On Jul 19, 2012, at 8:57 AM, Rasto Levrinc wrote: > On Thu, Jul 19, 2012 at 2:38 PM, Andreas Kurz wrote: >> On 07/19/2012 11:47 AM, Vadym Chepkov wrote: >>> Hi, >>> >>> When Pacemaker 1.1.7 was announced, a new feature was mentioned: >>> >>&g

Re: [Pacemaker] None of the standard agents in ocf:heartbeat are working in centos 6

2012-07-24 Thread Vadym Chepkov
On Jul 24, 2012, at 12:25 AM, Vladislav Bogdanov wrote: > 24.07.2012 04:50, Andrew Beekhof wrote: >> On Tue, Jul 24, 2012 at 5:38 AM, David Barchas wrote: >>> >>> On Monday, July 23, 2012 at 7:48 AM, David Barchas wrote: >>> >>> >>> Date: Mon, 23 Jul 2012 14:15:27 +0300 >>> From: Vladislav Bo

Re: [Pacemaker] about iTCO_wdt watchdog

2012-09-09 Thread Vadym Chepkov
On Aug 2, 2012, at 4:06 AM, Mia Lueng wrote: > you misunderstand me. I just simulate a system crash to test if the > watchdog can reboot the system . > All kernel wdt modules still rely on a functioning kernel. But you crashed kernel at this point, so no one will reboot your system. What I thin

Re: [Pacemaker] rhel6/cman+pacemaker - how to use clvm?

2013-04-08 Thread Vadym Chepkov
On Apr 8, 2013, at 6:52 AM, David Coulson wrote: > > On 4/8/13 6:42 AM, Yuriy Demchenko wrote: >> >> The purpose of my cluster is to provide HA VM and routing/gateway (thus RHCS >> isn't an option for me - no IPaddr2 and Route resources). >> But I cannot find any documentation how to use cLVM

Re: [Pacemaker] pcs equivalent of crm configure erase

2013-04-17 Thread Vadym Chepkov
On Apr 17, 2013, at 11:57 AM, T. wrote: > Hi, > >> b) If I can't do it woith pcs, is there a reliable >> and secure way to do it with pacemaker low level tools? > why not just installing the crmsh from a different repository? > > This is what I have done on CentOS 6.4. My sentiments exactly. A

Re: [Pacemaker] pcs equivalent of crm configure erase

2013-04-17 Thread Vadym Chepkov
On Apr 17, 2013, at 8:04 PM, Chris Feist wrote: > On 04/17/13 11:13, Vadym Chepkov wrote: >> >> On Apr 17, 2013, at 11:57 AM, T. wrote: >> >>> Hi, >>> >>>> b) If I can't do it woith pcs, is there a reliable >>>> and

Re: [Pacemaker] Multiple attribute match

2011-02-28 Thread Vadym Chepkov
On Feb 28, 2011, at 3:52 AM, Dejan Muhamedagic wrote: > Hi, > > On Sun, Feb 27, 2011 at 05:18:22PM -0500, Vadym Chepkov wrote: >> Hi >> >> In pacemaker-1.0.10 I discovered a following issue: >> >> since "start" operation is not exactly rev

Re: [Pacemaker] disable failover

2011-02-28 Thread Vadym Chepkov
On Feb 28, 2011, at 7:38 PM, Bob Schatz wrote: > Alan, > > Have you tried this command? > > # crm configure property maintenance-mode=true > > I have used this when I don't want Pacemaker to react to a monitor failing > such > as when I run gdb on a process. > I usually use crm configure

[Pacemaker] agents and git

2011-03-26 Thread Vadym Chepkov
Hi, Since resource-agents repository was migrated to git, instructions at http://www.clusterlabs.org/wiki/Install#Resource_Agents are outdated. I was not able to find any references at http://linux-ha.org/wiki/Resource_Agents either. Could somebody with the knowledge of the process publish the i

Re: [Pacemaker] How to send email-notification on failure of resource in cluster frame work

2011-03-28 Thread Vadym Chepkov
On Mar 24, 2011, at 12:46 AM, Rakesh K wrote: > Hi ALL > Is there any way to send Email notifications when a resource is failure in the > cluster frame work. > > while i was going through the Pacemaker-explained document provided in the > website www.clusterlabs.org > > There was no content in

[Pacemaker] pacemaker and snmptt

2011-03-28 Thread Vadym Chepkov
Hi, I have been trying to have snmptt (http://www.snmptt.org/) catch pacemaker's traps, but haven't been successful so far. snmpttconvertmib utility doesn't process PCMK-MIB.txt, complains it doesn't have any TRAP-TYPE / NOTIFICATION-TYPE lines I am catching "unknown" traps: Mon Mar 28 21:06:

Re: [Pacemaker] How to send email-notification on failure of resource in cluster frame work

2011-03-29 Thread Vadym Chepkov
On Mar 29, 2011 6:12 AM, "Michael Schwartzkopff" wrote: > > > On Tue, Mar 29, 2011 at 3:29 AM, Vadym Chepkov wrote: > > > On Mar 24, 2011, at 12:46 AM, Rakesh K wrote: > > >> Hi ALL > > >> Is there any way to send Email notifications when a

Re: [Pacemaker] How to send email-notification on failure of resource in cluster frame work

2011-03-30 Thread Vadym Chepkov
On Mar 29, 2011, at 11:34 PM, Michael Schwartzkopff wrote: >> On Mar 29, 2011 6:12 AM, "Michael Schwartzkopff" >> >> wrote: >>>> On Tue, Mar 29, 2011 at 3:29 AM, Vadym Chepkov >> >> wrote: >>>>> On Mar 24, 2011, at 12:46 AM,

Re: [Pacemaker] pacemaker and snmptt

2011-04-01 Thread Vadym Chepkov
On Apr 1, 2011, at 4:04 AM, Andrew Beekhof wrote: > On Tue, Mar 29, 2011 at 5:10 AM, Vadym Chepkov wrote: >> Hi, >> >> I have been trying to have snmptt (http://www.snmptt.org/) catch pacemaker's >> traps, but haven't been successful so far. >>

Re: [Pacemaker] ERROR: te_connect_stonith: Sign-in failed: triggered a retry

2011-04-17 Thread Vadym Chepkov
On Apr 16, 2011, at 5:33 PM, Aleksey Mykhailov wrote: > Hello! > > I've installed pacemaker with heartbeat following instruction on this page > http://www.clusterlabs.org/rpm-next/ (Installation - EPEL) > I have CentOS 5.6 32 bit > > rpm -qi pacemaker|grep Version > Version : 1.1.5 > > rpm

Re: [Pacemaker] [pacemaker]Notification alerts when fail-over take place from one node to other node in cluster.

2011-04-21 Thread Vadym Chepkov
On Apr 21, 2011, at 9:00 AM, rakesh k wrote: > Hello Every body > > We had configured HA/pacemaker,even we can migrate the resources between the > nodes in a cluster frame work, when there is a failure occurs for certain > process, > > My concerns here at any given point of time in cluster f

Re: [Pacemaker] [pacemaker]Notification alerts when fail-over take place from one node to other node in cluster.

2011-04-25 Thread Vadym Chepkov
ideal, but again, since this functionality doesn't come with pacemaker yet, you would have to invent your own wheel ;) Cheers, Vadym On Apr 25, 2011 1:15 AM, "Rakesh K" wrote: > Vadym Chepkov writes: > > >> You can colocate your resource with a MailTo pseudo resource

Re: [Pacemaker] [pacemaker]Notification alerts when fail-over take place from one node to other node in cluster.

2011-04-26 Thread Vadym Chepkov
On Apr 26, 2011, at 7:00 AM, Rakesh K wrote: > > Hi Vadym Chepkov > > Thanks for giving the reply. As mentioned i am trying to configure MailTo RA > with the Heartbeat from the command line > > i used the following configuration to configure it on the Heartbeat

[Pacemaker] [PATCH] Low: minor corrections in the spec file

2011-04-30 Thread Vadym Chepkov
# HG changeset patch # User Vadym Chepkov # Date 1304167609 14400 # Branch stable-1.0 # Node ID a051be4bc03ea0daaf9a9beaf51298c52cc3f3b7 # Parent 1554a83db0d3c3e546cfd3aaff6af1184f79ee87 Low: minor corrections in the spec file diff --git a/pacemaker.spec.in b/pacemaker.spec.in --- a

Re: [Pacemaker] Pacemaker in RHEL6.

2011-08-10 Thread Vadym Chepkov
On Aug 10, 2011, at 11:43 AM, Marco van Putten wrote: > On 08/10/2011 04:31 PM, Andreas Kurz wrote: >> On 2011-08-10 14:13, Marco van Putten wrote: >>> Hi, >>> >>> Is it possible to get the pacemaker rpm's available for RHEL6 on the >>> Clusterlabs repository (like for RHEL5)? >>> >>> I know th

Re: [Pacemaker] group depending on clones restarting unnescessary

2011-08-26 Thread Vadym Chepkov
On Aug 26, 2011, at 2:24 PM, Michael Schwartzkopff wrote: > Hi, > > I set up HA NFS Server according to the HOWTO from linbit. Basically it is a > clone of the NFS server and a clone of the root filesystem. A group of the > Filesystem, the exportfs and the ip address depends on a DRBD and the

Re: [Pacemaker] resource moving unnecessarily due to ping race condition

2011-09-10 Thread Vadym Chepkov
On Sep 8, 2011, at 3:40 PM, Florian Haas wrote: >>> On 09/08/11 20:59, Brad Johnson wrote: We have a 2 node cluster with a single resource. The resource must run on only a single node at one time. Using the pacemaker:ocf:ping RA we are pinging a WAN gateway and a LAN host on each n

Re: [Pacemaker] Pacemaker and DRBD - Will Not Promote

2011-10-28 Thread Vadym Chepkov
>From DRBD point of view, it's running, just not being used. You need a promote constraint and a resource depending on Master, IMHO. Cheers, Vadym ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pace

[Pacemaker] duality and equality

2010-04-10 Thread Vadym Chepkov
nfigure property default-action-timeout=120 makes it happy. And this makes me wonder, are these parameters really the same or do they have a different meanings? Thank you. Sincerely yours, Vadym Chepkov ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Re: [Pacemaker] OpenAIS priorities

2010-04-29 Thread Vadym Chepkov
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained/node-score-equal.html On Apr 29, 2010, at 10:20 AM, Dan Frincu wrote: > Greetings all, > > In the case of two servers in a cluster with OpenAIS, take the following > example: > > location Failover_Alert_1 Failover_Aler

[Pacemaker] pacemaker and gnbd

2010-05-01 Thread Vadym Chepkov
thing else needs to be done? Thank you, Vadym Chepkov ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.or

Re: [Pacemaker] pacemaker and gnbd

2010-05-03 Thread Vadym Chepkov
On May 3, 2010, at 2:23 AM, Andrew Beekhof wrote: > > > I doubt openais conflicts with corosync, unless you have a very old > version of cman. > The repos include openais 1.0.x which is built against corosync. > Unless I am doing something terribly wrong, this is not the case. Redhat 5.5 (the

Re: [Pacemaker] pacemaker and gnbd

2010-05-03 Thread Vadym Chepkov
enais. Hmm, this is what I asked at the very beginning: On Sat, May 1, 2010 at 3:30 PM, Vadym Chepkov wrote: > Hi, > > I found out I can't use gnbd if I use pacemaker rpm from clusterlabs > repository, because gnbd depends on cman which requires openais which > conflicts with

Re: [Pacemaker] pacemaker and gnbd

2010-05-03 Thread Vadym Chepkov
On May 3, 2010, at 5:39 PM, Andrew Beekhof wrote: > > perhaps try the srpm from F-12 Would be nice, but the last one was in F-9, it seems: http://koji.fedoraproject.org/koji/packageinfo?packageID=182 Vadym ___ Pacemaker mailing list: Pacemaker@oss.

Re: [Pacemaker] pacemaker and gnbd

2010-05-03 Thread Vadym Chepkov
On May 3, 2010, at 6:03 PM, Vadym Chepkov wrote: > > On May 3, 2010, at 5:39 PM, Andrew Beekhof wrote: > >> >> perhaps try the srpm from F-12 > > Would be nice, but the last one was in F-9, it seems: > > http://koji.fedoraproject.org/koji/packageinfo?package

Re: [Pacemaker] pacemaker and gnbd

2010-05-04 Thread Vadym Chepkov
On Tue, May 4, 2010 at 3:41 AM, Andrew Beekhof wrote: > > Hmmm... I wonder if the RHEL5.5 kernel is new enough to run the dlm. > I suspect not. > > Why not try the RHEL6 beta? It comes with compatible versions of > everything (including pacemaker). > > http://ftp.redhat.com/redhat/rhel/beta/6/x8

[Pacemaker] corosync rpm in clusterlabs repo

2010-05-04 Thread Vadym Chepkov
Hi, I noticed my mock fails on the corosync libraries and I find out corosync rpms were replaced with packages with the same version number. This is what I have installed: $ rpm -qi corosync Name: corosync Relocations: (not relocatable) Version : 1.2.1

Re: [Pacemaker] clone ip definition and location stops my resources...

2010-05-10 Thread Vadym Chepkov
# crm ra meta ping name (string, [undef]): Attribute name The name of the attributes to set. This is the name to be used in the constraints. By default is "pingd", but you are checking against pinggw I suggest you do not change name though, but adjust your location constraint to use pingd i

Re: [Pacemaker] clone ip definition and location stops my resources...

2010-05-11 Thread Vadym Chepkov
You forgot to turn on monitor operation for ping (actual job) On May 11, 2010, at 5:15 AM, Gianluca Cecchi wrote: > On Mon, May 10, 2010 at 4:39 PM, Vadym Chepkov wrote: > # crm ra meta ping > > name (string, [undef]): Attribute name > The name of the attributes to set. T

Re: [Pacemaker] clone ip definition and location stops my resources...

2010-05-11 Thread Vadym Chepkov
ngd or pingd lte 0 On May 11, 2010, at 7:06 AM, Gianluca Cecchi wrote: > On Tue, May 11, 2010 at 12:50 PM, Vadym Chepkov wrote: > You forgot to turn on monitor operation for ping (actual job) > > > > I saw from the > [r...@ha1 ~]# crm ra meta ping > command &

Re: [Pacemaker] clone ip definition and location stops my resources...

2010-05-11 Thread Vadym Chepkov
wrote: > On Tue, May 11, 2010 at 12:50 PM, Vadym Chepkov wrote: > You forgot to turn on monitor operation for ping (actual job) > > > > I saw from the > [r...@ha1 ~]# crm ra meta ping > command > > Operations' defaults (advisory minimum): > >

Re: [Pacemaker] clone ip definition and location stops my resources...

2010-05-11 Thread Vadym Chepkov
property default-action-timeout="120s" On Tue, May 11, 2010 at 11:00 AM, Gianluca Cecchi wrote: > On Tue, May 11, 2010 at 1:13 PM, Vadym Chepkov wrote: > >> First of all, none of the monitor operation is on by default in pacemaker, >> this is something that you have t

Re: [Pacemaker] clone ip definition and location stops my resources...

2010-05-11 Thread Vadym Chepkov
The is no "default" unless it's set, that's why crm complains On Tue, May 11, 2010 at 12:41 PM, Gianluca Cecchi wrote: > On Tue, May 11, 2010 at 5:47 PM, Vadym Chepkov wrote: > >> pingd is a daemon with is running all the time and does it job >> you

Re: [Pacemaker] Pacemaker installation on CentOs 5.3

2010-05-11 Thread Vadym Chepkov
You didn't have to do 'yum makecache' Sometimes ago Andrew accidentally replaced some rpms without bumping up revision number. This made yum to complain. 'yum clean all' should have cured all that. On Tue, May 11, 2010 at 2:09 PM, Simon Lavigne-Giroux wrote: > I found the solution to my probl

Re: [Pacemaker] two nodes fenced when drbd link fails

2010-05-13 Thread Vadym Chepkov
On May 13, 2010, at 1:37 PM, Ivan Coronado wrote: > Hello to everybody, > > I have a problem with the corosync.conf setup. I have a drbd service runing > on eth3, and a general network and the stonith device (idrac6) in the eth0. > If I unplug the eth3 to simulate a network failure two nodes

[Pacemaker] pengine self-maintenance

2010-05-15 Thread Vadym Chepkov
Hi I noticed pengine (pacemaker-1.0.8-6.el5) creates quite a lot of files in /var/lib/pengine, especially when cluster-recheck-interval is set to enable failure-timeout checks. /var/lib/heartbeat/crm/ seems also growing unattended. Does pacemaker do any self-maintenance or it will cause system t

Re: [Pacemaker] pengine self-maintenance

2010-05-17 Thread Vadym Chepkov
On May 17, 2010, at 2:52 AM, Andrew Beekhof wrote: > On Sun, May 16, 2010 at 1:09 AM, Vadym Chepkov wrote: >> Hi >> >> I noticed pengine (pacemaker-1.0.8-6.el5) creates quite a lot of files in >> /var/lib/pengine, >> especially when cluster-recheck-interval

Re: [Pacemaker] Detecting a lost network connection

2010-05-17 Thread Vadym Chepkov
On May 17, 2010, at 11:56 AM, Simon Lavigne-Giroux wrote: > Hi, > > I have 2 servers running Pacemaker. When the router fails, both nodes become > primary. Is it possible for Pacemaker on the secondary server to detect that > the network connection is not available and not become primary. >

Re: [Pacemaker] IP address does not failover on a new test cluster

2010-05-17 Thread Vadym Chepkov
On May 17, 2010, at 5:40 PM, Ruiyuan Jiang wrote: > Hi, Gianluca > > I modified my configuration and deleted “crm configure property > no-quorum-policy=ignore” as you suggested but I have the same problem that > the IP address does not fail. Thanks. > > [r...@usnbrl52 log]# crm configure sh

Re: [Pacemaker] IP address does not failover on a new test cluster

2010-05-18 Thread Vadym Chepkov
On Tue, May 18, 2010 at 2:22 PM, Ruiyuan Jiang wrote: > Hi, Vadym > > > > I modified the configuration per your suggestion. Here is the current > configuration of the cluster: > > > > [r...@usnbrl52 ~]# crm configure show > > node usnbrl52 > > node usnbrl53 > > primitive ClusterIP ocf:heartbeat:I

Re: [Pacemaker] IP address does not failover on a new test cluster

2010-05-18 Thread Vadym Chepkov
On Tue, May 18, 2010 at 3:58 PM, Ruiyuan Jiang wrote: > Thanks, Vadym > > > > This time it failed over to another node. For two nodes cluster, does the > cluster have to set to “no-quorum-policy=ignore” to failover or work > correctly? > > I can't say it better myself: http://www.clusterlabs.org

[Pacemaker] crm complains about resource id

2010-05-19 Thread Vadym Chepkov
Hi, It seems I found a bug in crm # rpm -q pacemaker pacemaker-1.0.8-6.el5 # crm configure rsc_defaults failure-timeout="10min" # crm configure show | tail -2 rsc_defaults $id="rsc-options" \ failure-timeout="10min" # crm configure edit rsc-options :%s/10/20/ ZZ ERROR: element meta_at

Re: [Pacemaker] pengine self-maintenance

2010-05-19 Thread Vadym Chepkov
On May 17, 2010, at 11:38 AM, Dejan Muhamedagic wrote: > > You don't want to set it that low. PE input files are part of > your cluster history. Set it to a few thousand. > What could be the side-backs of having it too low? How are these files being used? And shouldn't be some reasonable defau

Re: [Pacemaker] pengine self-maintenance

2010-05-19 Thread Vadym Chepkov
On Wed, May 19, 2010 at 1:26 PM, Dejan Muhamedagic wrote: > > And shouldn't be some reasonable default be in place? I just > > happened to notice 90% inode utilization on my /var, some could > > be not so lucky. > > > Yes, that could be a problem. Perhaps that default could be > changed to say 100

[Pacemaker] manage/unmanage and stop/start logic

2010-05-19 Thread Vadym Chepkov
There is some flow in start/stop and manage/unmanage logic in the crm, in my opinion. For example, I unmanaged some resource to do maintenance, then I issued crm resource manage again. At this point crm will add meta is-managed="true" to the resource. Later on I need to upgrade pacemaker soft

Re: [Pacemaker] manage/unmanage and stop/start logic

2010-05-20 Thread Vadym Chepkov
On Thu, May 20, 2010 at 5:05 AM, Dejan Muhamedagic wrote: > Too late for that, we shouldn't change semantics. I did think > about it at the time and say "resource manage rsc" seemed > unequivocal. BTW, there's a way to remove a meta attribute: > > crm resource meta delete > > Thanks, > > Dejan >

Re: [Pacemaker] SNMP/SMTP alerts on move or STONITH?

2010-05-24 Thread Vadym Chepkov
On May 24, 2010, at 10:05 AM, Simpson, John R wrote: > Greetings all, > > First, my compliments to the Pacemaker and Corosync developers. I’ve > been trying out Pacemaker for the past few months, and (especially from the > command line) I’ve found building and managing Pacemaker-based c

[Pacemaker] power failure handling

2010-05-26 Thread Vadym Chepkov
Hi, What would be the proper way to shutdown members of two-node cluster in case of a power outage? I assume as soon I issue 'crm node standby node-1 reboot' resources will start to fail-over to the second node and, first of all, there is no reason for that, and, second of all, consecutive 'c

Re: [Pacemaker] power failure handling

2010-05-27 Thread Vadym Chepkov
On May 27, 2010, at 7:21 AM, Andrew Beekhof wrote: > On Wed, May 26, 2010 at 9:07 PM, Vadym Chepkov wrote: >> Hi, >> >> What would be the proper way to shutdown members of two-node cluster in case >> of a power outage? >> I assume as soon I issue 'crm

Re: [Pacemaker] corosync/openais fails to start

2010-05-27 Thread Vadym Chepkov
On May 27, 2010, at 11:40 AM, Diego Remolina wrote: > Is there any workaround for this? Perhaps a slightly older version of the > rpms? If so where do I find those? chkconfig corosync off chkconfig heartbeat on Unfortunately, that's what I had to do on PPC64 RHEL5 > > I cannot get the opens

[Pacemaker] master/slave or unique clones

2010-05-28 Thread Vadym Chepkov
ation says I shouldn't do it. As a workaround I could probably create a separate "master" instance and clones with clone-max=node-1 and create -INFINITY collocation constraint between them, but it's more of a hack it seems. What would Andrew do ? :) Thank you, Vadym C

Re: [Pacemaker] master/slave or unique clones

2010-05-28 Thread Vadym Chepkov
On May 28, 2010, at 8:12 AM, Florian Haas wrote: > On 2010-05-28 14:01, Vadym Chepkov wrote: >> Hi, >> >> I want to convert our home-made application to be managed by pacemaker >> cluster. >> The way it works now: application starts, discovers all IPs confi

Re: [Pacemaker] master/slave or unique clones

2010-05-28 Thread Vadym Chepkov
On May 28, 2010, at 8:27 AM, Florian Haas wrote: >> >> Imperative word was "started". You think I still should go multi-state RA >> for this application? > > If the application which that RA applies to distinguishes between roles > equivalent to a Master and a Slave, and you want the RA to mana

Re: [Pacemaker] master/slave or unique clones

2010-05-28 Thread Vadym Chepkov
On May 28, 2010, at 11:17 AM, Florian Haas wrote: > On 05/28/2010 02:37 PM, Vadym Chepkov wrote: >> >> On May 28, 2010, at 8:27 AM, Florian Haas wrote: >>>> >>>> Imperative word was "started". You think I still should go multi-state RA >&

[Pacemaker] Stateful RA

2010-06-01 Thread Vadym Chepkov
Hi, I was looking into Stateful resource agent (branch: stable-1.0) stateful_start() { stateful_check_state master if [ $? = 0 ]; then # CRM Error - Should never happen return $OCF_RUNNING_MASTER fi ... Why does it return $OCF_RUNNING_MASTER when master is not r

Re: [Pacemaker] Stateful RA

2010-06-02 Thread Vadym Chepkov
On Jun 2, 2010, at 3:08 AM, Andrew Beekhof wrote: > On Wed, Jun 2, 2010 at 8:55 AM, Vadym Chepkov wrote: >> Hi, >> >> I was looking into Stateful resource agent (branch: stable-1.0) >> >> stateful_start() { >>stateful_check_state master &g

[Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-03 Thread Vadym Chepkov
ing? Thank you, Vadym Chepkov ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-05 Thread Vadym Chepkov
On Jun 3, 2010, at 9:16 AM, Vadym Chepkov wrote: > Hi, > > Not sure what I am doing wrong > > primitive dummy1 ocf:pacemaker:Dummy > primitive dummy2 ocf:pacemaker:Dummy > primitive dummy3 ocf:pacemaker:Dummy > > only two nodes alive in 3 node cluster, so I

Re: [Pacemaker] i stop mysql service but the crm status is still runing

2010-06-06 Thread Vadym Chepkov
On Jun 6, 2010, at 9:15 PM, ch huang wrote: > mysql is running ,and crm status output is > > > Last updated: Sat Jun 5 09:48:58 2010 > Stack: openais > Current DC: PRIM - partition with quorum > Version: 1.0.8-9881a7350d6182bae9e8e557cf20a3cc5dac3ee7 > 2 Nodes configured, 2 expecte

Re: [Pacemaker] i stop mysql service but the crm status is still runing

2010-06-06 Thread Vadym Chepkov
On Jun 6, 2010, at 9:23 PM, ch huang wrote: > i notice my drbd status is abnormal,it seems they can not find each other,and > i try to restart drbd ,but still can not find each other Well, this has nothing to do with Pacemaker, you would need to issue crm resource stop drbd_mysql and then fi

Re: [Pacemaker] i stop mysql service but the crm status is still runing

2010-06-06 Thread Vadym Chepkov
to restart it on this host,but i want mysql start on another backup host No, you are not, this is your config, right? > > primitive mysqld lsb:mysqld no monitor operation is defined > > On Mon, Jun 7, 2010 at 9:30 AM, Vadym Chepkov wrote: > > On Jun 6, 2010, at 9:15

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-07 Thread Vadym Chepkov
On Jun 7, 2010, at 5:05 AM, Dejan Muhamedagic wrote: > Hi, > > On Sat, Jun 05, 2010 at 10:38:17AM -0400, Vadym Chepkov wrote: >> >> On Jun 3, 2010, at 9:16 AM, Vadym Chepkov wrote: >> >>> Hi, >>> >>> Not sure what I am doing wrong >&

[Pacemaker] How to replace an agent

2010-06-10 Thread Vadym Chepkov
ay to handle such modification? Thank you, Vadym Chepkov ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.or

Re: [Pacemaker] How to replace an agent

2010-06-11 Thread Vadym Chepkov
On Jun 10, 2010, at 9:03 AM, Dejan Muhamedagic wrote: > Hi, > > On Thu, Jun 10, 2010 at 08:46:22AM -0400, Vadym Chepkov wrote: >> Hi, >> >> I stumbled upon interesting feature or a bug, not sure how to classify it. >> >> I needed to add a resource to a

Re: [Pacemaker] crm node delete

2010-06-11 Thread Vadym Chepkov
On Jun 11, 2010, at 10:45 AM, Maros Timko wrote: > Hi all, > > using heartbeat stack. I have a system with one node offline: > > Last updated: Fri Jun 11 13:52:40 2010 > Stack: Heartbeat > Current DC: vsp7.example.com (ba6d6332-71dd-465b-a030-227bcd31a25f) - > partition with quorum

Re: [Pacemaker] How to replace an agent

2010-06-11 Thread Vadym Chepkov
On Jun 11, 2010, at 10:09 AM, Dejan Muhamedagic wrote: >> >> I found what I did wrong. When I loaded new resource definition >> I also removed meta target-role="Stopped". >> pacemaker decided it needs to restart the resource because the >> has changed. But it also using params definitions of the

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-14 Thread Vadym Chepkov
On Jun 7, 2010, at 8:04 AM, Vadym Chepkov wrote: > > I filed bug 2435, glad to hear "it's not me" > Andrew closed this bug (http://developerbugs.linux-foundation.org/show_bug.cgi?id=2435) as resolved, but I respectfully disagree. I will try to explain a problem a

Re: [Pacemaker] VirtualDomain/DRBD live migration with pacemaker...

2010-06-14 Thread Vadym Chepkov
On Mon, Jun 14, 2010 at 4:37 PM, Erich Weiler wrote: > Hi All, > > We have this interesting problem I was hoping someone could shed some light > on.  Basically, we have 2 servers acting as a pacemaker cluster for DRBD and > VirtualDomain (KVM) resources under CentOS 5.5. > > As it is set up, if on

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 4:57 AM, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 10:23 AM, Andreas Kurz > wrote: >> On Tuesday 15 June 2010 08:40:58 Andrew Beekhof wrote: >>> On Mon, Jun 14, 2010 at 4:22 PM, Vadym Chepkov wrote: >>>> On Jun 7, 2010, at 8:04 AM, V

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
gt; On Mon, Jun 14, 2010 at 4:22 PM, Vadym Chepkov wrote: >>>>> On Jun 7, 2010, at 8:04 AM, Vadym Chepkov wrote: >>>>>> I filed bug 2435, glad to hear "it's not me" >>>>> >>>>> Andrew closed this bug >>>>> (

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 8:11 AM, Gianluca Cecchi wrote: > On Tue, Jun 15, 2010 at 1:50 PM, Andrew Beekhof wrote: > [snip] > > Score = -inf, plus the patch, plus sequential = true (or unset). > Not sure how that looks in shell syntax though. > > > Which patch? http://hg.clusterlabs.org/pacemaker/

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 7:50 AM, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 1:38 PM, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 4:57 AM, Andrew Beekhof wrote: >> >>> On Tue, Jun 15, 2010 at 10:23 AM, Andreas Kurz >>> wrote: >>>> On

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Tue, Jun 15, 2010 at 9:14 AM, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 2:57 PM, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 7:50 AM, Andrew Beekhof wrote: >> >>> On Tue, Jun 15, 2010 at 1:38 PM, Vadym Chepkov wrote: >>>> >>>

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 9:26 AM, Vadym Chepkov wrote: >>> >>> what about this part? what do I need to do to prevent them from running on >>> different nodes for sure? >> >> You can't have it both ways. >> Either they have to run on the same node

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 3:36 PM, Dejan Muhamedagic wrote: > Hi, > > On Tue, Jun 15, 2010 at 08:45:37AM -0400, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 6:14 AM, Dejan Muhamedagic wrote: >> >>> Hi, >>> >>> On Tue, Jun 15, 2010 at 10:57

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 3:55 PM, Dejan Muhamedagic wrote: > On Tue, Jun 15, 2010 at 03:41:17PM -0400, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 3:36 PM, Dejan Muhamedagic wrote: >> >>> Hi, >>> >>> On Tue, Jun 15, 2010 at 08:45:37AM -0400, Vadym

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Vadym Chepkov
On Jun 15, 2010, at 5:26 PM, Dejan Muhamedagic wrote: > On Tue, Jun 15, 2010 at 04:44:31PM -0400, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 3:55 PM, Dejan Muhamedagic wrote: >> >>> On Tue, Jun 15, 2010 at 03:41:17PM -0400, Vadym Chepkov wrote: >>>>

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-16 Thread Vadym Chepkov
On Jun 15, 2010, at 3:52 PM, Dejan Muhamedagic wrote: > On Tue, Jun 15, 2010 at 12:53:07PM -0400, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 9:26 AM, Vadym Chepkov wrote: >>>>> >>>>> what about this part? what do I need to do to prevent them fr

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-16 Thread Vadym Chepkov
On Jun 16, 2010, at 2:55 AM, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 9:41 PM, Dejan Muhamedagic > wrote: > >> colocation not-together -inf: d1 d2 d3 > > I think there is a problem with this syntax, particularly for +inf. > > Consider: > colocation together1 inf: d1 d2 > > This mean

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-17 Thread Vadym Chepkov
On Jun 17, 2010, at 7:15 AM, Dejan Muhamedagic wrote: > On Wed, Jun 16, 2010 at 08:54:37AM -0400, Vadym Chepkov wrote: >> >> On Jun 15, 2010, at 3:52 PM, Dejan Muhamedagic wrote: >> >>> On Tue, Jun 15, 2010 at 12:53:07PM -0400, Vadym Chepkov wrote: >>>>

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-17 Thread Vadym Chepkov
Andrew, I took the latest sources from the repository an got myself pacemaker Version: 1.0.9-6bf91e9195fe7649e174af0ba2c67dbd902d4a2b Just to remind what all this story began from, I want to be able to define resources that should never run on the same node. Here is the config primitive d1 o

Re: [Pacemaker] Drbd/Nfs MS don't failover on slave node

2010-07-07 Thread Vadym Chepkov
On Jul 7, 2010, at 11:41 AM, Guillaume Chanaud wrote: > I already tried the > > no_quorum-policy="ignore" > which has no effect > > I just retried when reading your mail and resources doesn't migrate. > In fact even if i boot only one server, this one won't promote the resources > until the se

[Pacemaker] crm configure update and properties

2010-07-09 Thread Vadym Chepkov
and loaded it (or so I thought) with # crm configure load update pdu.crm To my surprise this has removed any other properties my cluster has had :( Is it expected behavior? pacemaker-1.0.9.1 Thank you, Vadym Chepkov ___ Pacemaker mailing list: Pacemak

Re: [Pacemaker] error installing CentOS clvm after using clusterlabs repository

2010-08-04 Thread Vadym Chepkov
On Wed, Aug 4, 2010 at 10:49 AM, Michael Fung wrote: > > On 2010/8/4 下午 09:06, Andrew Beekhof wrote: > >> You can either use cluster.conf for configuring corosync/cman or I can >> send you the corosync.conf snippet. >> > > Yes, please send me the corosync.conf snippet. > If it's not much trouble,

[Pacemaker] pacemaker in rhel6

2010-08-07 Thread Vadym Chepkov
Hi, It seems pacemaker is broken in rhel6-beta2 # rpm -q pacemaker pacemaker-1.1.2-5.el6.x86_64 # crm configure load replace crm.cfg crm_standby not available, check your installation And it's true, crm_standby is not part of the RPM Thanks, Vadym ___

Re: [Pacemaker] pacemaker in rhel6

2010-08-09 Thread Vadym Chepkov
On Aug 9, 2010, at 3:59 AM, Andrew Beekhof wrote: > yes, its fixed in -6 > I wasn't able to find "updates" repository for rhel6. Is it available? > On Sat, Aug 7, 2010 at 8:03 PM, Vadym Chepkov wrote: >> Hi, >> >> It seems pacemaker is broke

Re: [Pacemaker] migration-threshold and failure-timeout

2010-09-21 Thread Vadym Chepkov
On Tue, Sep 21, 2010 at 9:14 AM, Dan Frincu wrote: > Hi, > > This => > http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-failure-migration.html > explains it pretty well. Notice the INFINITY score and what sets it. > > However I don't know of any automatic method to cle

Re: [Pacemaker] starting a xen-domU depending on available hardware-resources using SysInfo-RA

2010-09-30 Thread Vadym Chepkov
On Sep 30, 2010, at 2:35 AM, Sascha Reimann wrote: > Hi Dejan, > > it's working fine with the amount of free ram as the score and a bigger > default-resource-stickiness: > > primitive v01 ocf:heartbeat:Xen \ > params xmfile="/etc/xen/conf.d/v01.cfg" \ > op monitor interval="30s" ti

  1   2   >