[Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread rakesh k
Hi All I had configured Heartbeeat, pacemkaer on my two VM's Cluster with two nodes, two nodes are running *cent-os* as operation systems. Cluster configured with 8 resources and defined order, taking reference pacemaker explained please find the order and co-location constraints, taken from

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Andrew Beekhof
What version of pacemaker? On Tue, Apr 19, 2011 at 9:10 AM, rakesh k rakirocker4...@gmail.com wrote: Hi All I had configured Heartbeeat, pacemkaer on my two VM's Cluster with two nodes, two nodes are running cent-os as operation systems. Cluster configured with 8 resources and defined

Re: [Pacemaker] Heartbeat over Disk or non IP possible?

2011-04-19 Thread Ulf
Hi, So it seems not to be possible to do a heartbeat over disk. Is it planned to introduce such a feature? Cheers, Ulf On Sat, Apr 16, 2011 at 12:23 PM, Ulf m...@gmx.net wrote: Hi, is there a way to implement a heartbeat over disk? Or any other non IP medium? I think the SFEX

[Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
hi, i'm starting a new thread to address a specific Could not find first log file name in binary log index file error upon failover. background: i currently have a two node mysql m/s setup. expire_logs_days (was) set to 7 days last failover happend 7 days ago (therefore, binlogs have been

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:01 AM, Raoul Bhatia [IPAX] wrote: the failover worked and wdb02 is up and running. upon rejoin, wdb01 wanted to start syncing from mysql-bin.15, position 24386 (as saved in the cib). this fails with error Last_IO_Errno: 1236 and the message: Last_IO_Error: Got fatal

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:20 AM, Marek Marczykowski wrote: On 04/19/11 10:01, Raoul Bhatia [IPAX] wrote: what i can currently think of: 1. run a cronjob which periodically analyzes the binlogs and will update the node's log-file and log-pos attributes if there are empty binlogs; (that's the best

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Rakesh K
Andrew Beekhof andrew@... writes: Hi Andrew thanks for giving reply. The version of pacemaker i am using is pacemaker-1.0.9.1 Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:38 AM, Marek Marczykowski wrote: On 04/19/11 10:29, Raoul Bhatia [IPAX] wrote: On 04/19/2011 10:20 AM, Marek Marczykowski wrote: On 04/19/11 10:01, Raoul Bhatia [IPAX] wrote: what i can currently think of: 1. run a cronjob which periodically analyzes the binlogs and will

Re: [Pacemaker] Heartbeat over Disk or non IP possible?

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 9:31 AM, Ulf m...@gmx.net wrote: Hi, So it seems not to be possible to do a heartbeat over disk. Is it planned to introduce such a feature? It would be a feature of the underlying communications layer. So you'd have to ask the heartbeat or corosync maintainers - but

Re: [Pacemaker] Question of the syslog output in pacemaker-1.1

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 9:25 AM, Yuusuke IIDA iiday...@intellilink.co.jp wrote: Hi, Andrew I use corosync-1.3.0 and Pacemaker-1.1.5. The log outputs it via rsyslog. I changed syslog_facility of corosync.conf to local1 and was going to let a designated file output the log of the cluster.

Re: [Pacemaker] Pacemaker / Postfix startup problem...

2011-04-19 Thread Raoul Bhatia [IPAX]
adam, any news on this? if this is not working for you, i've got another idea. but please report the current status first... thanks, raoul On 04/14/2011 08:33 PM, Raoul Bhatia [IPAX] wrote: hi adam, On 14.04.2011 18:10, Adam Reiss wrote: Hi Raoul, We're trying to setup a HA SMTP Relay, so

Re: [Pacemaker] A question and demand to a resource placement strategy function

2011-04-19 Thread Andrew Beekhof
Yan is our utilization expert, lets see if he can provide some direction here :-) -- Andrew 2011/4/18 Yuusuke IIDA iiday...@intellilink.co.jp: Hi, Andrew I want to disperse using a resource placement strategy function of Pacemaker-1.1 in the fail-over point of the resource in N to N

Re: [Pacemaker] how to get pacemaker:ping recheck before promoting drbd resources on a node

2011-04-19 Thread Andrew Beekhof
On Mon, Apr 18, 2011 at 8:57 PM, Jelle de Jong jelledej...@powercraft.nl wrote: Hello everybody, I need to be able to bring down my network interface (network failure test) and few seconds later bring it up again. Without my drbd cluster going nuts and creating split brains. I was advised

Re: [Pacemaker] how to get pacemaker:ping recheck before promoting drbd resources on a node

2011-04-19 Thread Jelle de Jong
On 19-04-11 11:31, Andrew Beekhof wrote: It the underlying messaging/membership layer goes into spasms - there's not much ping can do to help you. What version of corosync have you got? Some versions have been better than others. corosync 1.2.1-4 pacemaker 1.0.9.1+hg15626-1

[Pacemaker] SBD kills both nodes in a two node cluster.

2011-04-19 Thread Ulf
I' ve two nodes with shared storage and multipathing. But the SBD device doesn't work as expected. My idea was that in case of a split brain: One node kills the other node and one will survive. But in my case I get a double kill, both nodes will be killed at the same time. I simulated the split

Re: [Pacemaker] Unable to stop Multi state resource

2011-04-19 Thread Rakesh K
Rakesh K rakirocker4236@... writes: Hi Andrew FSR is a File system replication script which adheres to ocf cluster frame work, the script is similar to Mysql ocf script, which is a multi state resource, where in master ssh server would be running and in slave there are rsync scripts which

Re: [Pacemaker] Pacemaker / Postfix startup problem...

2011-04-19 Thread Adam Reiss
I'll get a chance to work on it today. I'll let you know what happens. :) Thanks!! -Original Message- From: Raoul Bhatia [IPAX] [mailto:r.bha...@ipax.at] Sent: Tuesday, April 19, 2011 5:15 AM To: The Pacemaker cluster resource manager Cc: Adam Reiss Subject: Re: [Pacemaker]

Re: [Pacemaker] A question and demand to a resource placement strategy function

2011-04-19 Thread Yan Gao
On 04/18/11 18:17, Yuusuke IIDA wrote: * When it is not dispersed well When I produced trouble in a resource in order of next, I am partial, and the resource is placed in one node. main_rsc3 - main_rsc2 - main_rsc1 Online: [srv-b1 srv-b2 srv-a1] Full list of resources: main_rsc1

Re: [Pacemaker] how to get pacemaker:ping recheck before promoting drbd resources on a node

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 11:54 AM, Jelle de Jong jelledej...@powercraft.nl wrote: On 19-04-11 11:31, Andrew Beekhof wrote: It the underlying messaging/membership layer goes into spasms - there's not much ping can do to help you. What version of corosync have you got?  Some versions have been

[Pacemaker] crm : unknown expected votes

2011-04-19 Thread hari.n.tatituri
Hi, I created a 2 node cluster created using pacemaker on Fedora 14(2.6.35.6-45.fc14.x86_64) I have two errors that I am not able to resolve. Can someone help me resolve these errors. 1 ) It always shows unknown expected votes when I see 'crm status'. 2 ) In the logfile it

[Pacemaker] Resource Agents 1.0.4: HA LVM Patch

2011-04-19 Thread Ulf
Hi, I attached a patch to enhance the LVM agent with the capability to set a tag on the VG (set_hosttag = true) in conjunction with a volume_list filter this can prevent to activate a VG on multiple host. Unfortunately active VGs will stay active in case of unclean operation. The tag is always

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:38 AM, Marek Marczykowski wrote: in your opintion, is it possible to fix this via the ocf ra or does it have to be a separate cronjob? I haven't idea how to do it in ra. There is no easy way to look what binlogs are on the other node. Maybe some tricks storing that info on

[Pacemaker] Resources won't start

2011-04-19 Thread Phil Hunt
Hi I've been having alot of problems figuring out a problem. In the enclosed config for a 2 node cluster, letting 2 RHEL5 boxes work as a cluster with a shared iSCSI disk stored on a Windows Storage Server box, the resources will not start. I have iscsid running, no iscsi. I was modifying

Re: [Pacemaker] Resources won't start

2011-04-19 Thread mark - pacemaker list
Hi Phil, On Tue, Apr 19, 2011 at 3:36 PM, Phil Hunt phil.h...@orionhealth.com wrote: Hi I have iscsid running, no iscsi. Good. You don't want the system to auto-connect the iSCSI disks on boot, pacemaker will do that for you. Here is the crm status: Last updated: Tue Apr

Re: [Pacemaker] Resources won't start

2011-04-19 Thread Phil Hunt
Did it start? No, here is the output, all resources kind of went away. Thats what I've been fighting all day.. Last updated: Tue Apr 19 13:52:18 2011 Stack: openais Current DC: CentClus2 - partition with quorum Version: 1.0.10-da7075976b5ff0bee71074385f8fd02f296ec8a3 2 Nodes

Re: [Pacemaker] Question of the syslog output in pacemaker-1.1

2011-04-19 Thread Yuusuke IIDA
Hi, Andrew (2011/04/19 18:13), Andrew Beekhof wrote: On Tue, Apr 19, 2011 at 9:25 AM, Yuusuke IIDA iiday...@intellilink.co.jp wrote: Hi, Andrew I use corosync-1.3.0 and Pacemaker-1.1.5. The log outputs it via rsyslog. I changed syslog_facility of corosync.conf to local1 and was going to