[Pacemaker] Pacemaker GUI won't start if it can't determine the format of its window icon

2009-05-18 Thread Florian Haas
Hello, seen on SLE 11 HAE, Pacemaker 1.0.3, Pacemaker GUI 1.4: # crm_gui /usr/bin/crm_gui:5785: GtkWarning: Cannot open pixbuf loader module file '/etc/gtk-2.0/gdk-pixbuf.loaders': No such file or directory win_widget.set_icon_from_file(/usr/share/heartbeat-gui/ha.png) Traceback (most recent

[Pacemaker] Pacemaker GUI won't start if it can't determine the format of its window icon

2009-05-18 Thread Florian Haas
Hello, seen on SLE 11 HAE, Pacemaker 1.0.3, Pacemaker GUI 1.4: # crm_gui /usr/bin/crm_gui:5785: GtkWarning: Cannot open pixbuf loader module file '/etc/gtk-2.0/gdk-pixbuf.loaders': No such file or directory win_widget.set_icon_from_file(/usr/share/heartbeat-gui/ha.png) Traceback (most recent

Re: [Pacemaker] Pacemaker GUI won't start if it can't determine the format of its window icon

2009-05-18 Thread Florian Haas
On 2009-05-18 15:44, Lars Marowsky-Bree wrote: On 2009-05-18T14:30:30, Florian Haas flor...@linbit.com wrote: For the record: fixed by /usr/bin/gdk-pixbuf-query-loaders /etc/gtk-2.0/gdk-pixbuf.loaders. Maybe this is missing from the SLE 11 gtk2 RPM post-install script? Can you file a bug

Re: [Pacemaker] Pacemaker on OpenAIS, RRP, and link failure

2009-05-25 Thread Florian Haas
On 2009-05-25 17:45, Andrew Beekhof wrote: SUSE is currently recommending NIC bonding. We've not been able to get satisfactory behavior from clusters using RRP. I've repeatedly told customers that NIC bonding is not a valid substitute for redundant Heartbeat links, I will stubbornly insist it

Re: [Pacemaker] Pacemaker on OpenAIS, RRP, and link failure

2009-05-26 Thread Florian Haas
Steve, On 2009-05-25 19:56, Juha Heinanen wrote: Steven Dake writes: The only options I see is to periodically try the failed ring for liveness. The problem with this approach is it is hard to implement. try all the time also after failure like was done before failure. Complete Totem

Re: [Pacemaker] lvm2-clvm RPMs in opensuse.org package repo?

2009-05-26 Thread Florian Haas
On 2009-05-11 10:56, Andrew Beekhof wrote: On Mon, May 11, 2009 at 10:45 AM, Florian Haas flor...@linbit.com wrote: Andrew, On 2009-05-08 15:33, Andrew Beekhof wrote: we hope to have the whole stack up there. Excellent. OK if I nag you again in a couple of weeks if it doesn't show up

Re: [Pacemaker] lvm2-clvm RPMs in opensuse.org package repo?

2009-05-27 Thread Florian Haas
On 2009-05-27 15:13, Andrew Beekhof wrote: thanks :-) they're building now. most distro/version combinations can't build it though as libdlm needs a very recent kernel (and everything else depends on that) When you say most combinations can't build it, could you provide a list where this

Re: [Pacemaker] lvm2-clvm RPMs in opensuse.org package repo?

2009-05-27 Thread Florian Haas
Hmmm. Debian lenny is on 2.6.26, and it already has clvm, albeit including a dependency on cman which is now probably obsolete. Rumor has it that Martin is working on fixing that. :) Cheers, Florian On 2009-05-27 15:26, Andrew Beekhof wrote: openSUSE 11.0 SLE11 thats about it until the

[Pacemaker] OCF_RESKEY_CRM_meta* envars

2009-05-28 Thread Florian Haas
Andrew, would you mind pointing me to wherever in the code the OCF_RESKEY_CRM_meta* variables are set and passed into the RA environment? I'd like to understand where and how this happens, have been unable to find documentation, and a simple recursive grep in the hg checkout wasn't too

Re: [Pacemaker] dopd on openais

2009-05-28 Thread Florian Haas
Raoul, No such thing currently exists. We're currently figuring out how to best do this with OpenAIS. Stay tuned. Cheers, Florian On 05/28/2009 08:40 PM, Raoul Bhatia [IPAX] wrote: hi, how do i configure dopd for openais? is this possible or is there another way to handle drbd devices?

[Pacemaker] pingd comments and metadata

2009-06-04 Thread Florian Haas
Andrew, Dejan, Dominik, I am by no means a pingd expert, but the current incarnation in stable-1.0 seems to have some outdated and misleading comments and meta data. Examples: parameter name=host_list unique=0 longdesc lang=en The list of ping nodes to count. Defaults to all configured ping

Re: [Pacemaker] kernel.core_uses_pid and ulimit -c

2009-06-04 Thread Florian Haas
On 06/04/2009 08:42 AM, Andrew Beekhof wrote: On Thu, Jun 4, 2009 at 8:40 AM, Florian Haas flor...@linbit.com wrote: Andrew, Dejan et al., The TODO page at http://clusterlabs.org/wiki/TODO states that Pacemaker now automagically sets the kernel.core_uses_pid sysctl to ease debugging

[Pacemaker] ocf_log debug ... doesn't?

2009-06-17 Thread Florian Haas
Hello everyone, correct me if I'm wrong here, but: if I configure - Pacemaker on OpenAIS with logging { to_syslog: yes }, and my syslog configured to capture the DEBUG priority into an output file, OR - Pacemaker on Heartbeat with use_logd no and no debugfile set, and my syslog configured to

Re: [Pacemaker] ocf_log debug ... doesn't?

2009-06-17 Thread Florian Haas
Dejan, On 2009-06-17 16:46, Dejan Muhamedagic wrote: Am I overlooking something blatantly obvious? Did you try logger -p trala.debug ... ? Does that work? If so, then ocf_log debug should work too. ocf_log is reduced to either logger(1) or ha_logger(1) call (see .ocf-shellfuncs and

Re: [Pacemaker] ocf_log debug ... doesn't?

2009-06-18 Thread Florian Haas
Dejan, On 2009-06-17 21:13, Dejan Muhamedagic wrote: Where exactly does it set HA_LOGFACILITY? It should be done by aisexec, i.e. some ais plugin. Oh, Andrew set it to HA_logfacility :( And that in December 2007. OK, I'll add the other one now. Thanks, I saw the commit. You may want to

[Pacemaker] What's going to happen with the heartbeat RAs?

2009-06-25 Thread Florian Haas
Hello everyone, This is something that's been on my mind for a while, and I'm still looking for a definitive answer. :) Just what exactly is the current plan for the recent changes to the RAs provided by Heartbeat (i.e. the ones that install into /usr/lib/ocf/resource.d/heartbeat)? I understand

Re: [Pacemaker] DRBD User's Guide 1.2.0rc1

2009-06-29 Thread Florian Haas
On 2009-06-29 15:10, Dejan Muhamedagic wrote: Hi, On Fri, Jun 26, 2009 at 10:32:39AM +0200, Florian Haas wrote: Hello everyone, A new version of the DRBD User's Guide is up at http://www.drbd.org/users-guide. As this version has some major additions to the previous release, I've decided

[Pacemaker] Resource agents: parameter type enforcement and normalization

2009-07-03 Thread Florian Haas
Hello, continuing a discussion I started, briefly, with Andrew and Dejan yesterday. As I hear from Andrew, when we define parameters in resource agents and describe them in the RA metadata information, then the CRM will in fact do nothing to actually enforce parameter types in the CIB. Instead,

[Pacemaker] Unexpected resource restarts after putting a node in standby mode

2009-07-06 Thread Florian Haas
Hello everyone, probably at bad time to ask this as Andrew is out on vacation, but maybe Dejan or Dominik can help shed some light on this one. I'm testing my iSCSITarget and iSCSILogicalUnit agents in a 2-node Pacemaker 1.0.4 cluster. If you don't feel like grokking the full config that

[Pacemaker] [DRBD-user] DRBD User's Guide 1.2.0rc2

2009-07-06 Thread Florian Haas
Hello everyone, I had originally planned to release the 1.2.0 version of the DRBD User's Guide today, but a few illustrations are still missing -- so instead I'm doing another release candidate, with the 1.2.0 release to follow in a few days' time. For your reference, here is what has changed

Re: [Pacemaker] Unexpected resource restarts after putting a node in standby mode

2009-07-06 Thread Florian Haas
Dejan, All those actions are fine, except for those restarts of the rg_iscsivg02 resource group on alice. What am I doing wrong? Not sure if there's anything wrong with the configuration. I would assume there must be a way to avoid these. I suspect that this has again to do with

Re: [Pacemaker] Unexpected resource restarts after putting a node in standby mode

2009-07-10 Thread Florian Haas
On 07/10/2009 01:33 PM, daniel peess wrote: hi florian, On Mon, Jul 06, 2009 at 08:00:38AM +0200, Florian Haas wrote: order o_drbd_before_iscsivg01 inf: ms_drbd_iscsivg01:promote rg_iscsivg01:start order o_drbd_before_iscsivg02 inf: ms_drbd_iscsivg02:promote rg_iscsivg02:start before

Re: [Pacemaker] Unexpected resource restarts after putting a node in standby mode

2009-07-10 Thread Florian Haas
On 07/10/2009 02:26 PM, Lars Marowsky-Bree wrote: On 2009-07-06T08:00:38, Florian Haas florian.h...@linbit.com wrote: ms ms_drbd_iscsivg01 res_drbd_iscsivg01 \ meta clone-max=2 clone-node-max=1 master-max=1 master-node-max=1 target-role=Started notify=true ms ms_drbd_iscsivg02

Re: [Pacemaker] stonith suice / chaining stonith agents

2009-07-23 Thread Florian Haas
http://clusterlabs.org/wiki/TODO * Implement cascading STONITH (If method A fails, try B, etc) Scheduled for 1.2, it seems. Unless Andrew has changed his mind. :) Florian On 2009-07-23 17:19, Bernd Schubert wrote: Hello, for suicides I would prefer to have a stonith agent that resets the

[Pacemaker] Announcing www.planet-ha.org

2009-08-18 Thread Florian Haas
Hello everyone, www.planet-ha.org has just been launched. Please take a look at the initial announcement at http://www.planet-ha.org/#Introducing+Planet+HA%21 Needless to say, blog feed submissions are more than welcome! So are suggestions regarding design, usability, and any other comments you

Re: [Pacemaker] Linux Plumbers Conference mini-conf on clustering?

2009-08-19 Thread Florian Haas
Lars, any update on whatever became of this proposal? Phil and myself are attending LPC next month -- who else is planning to attend? Cheers, Florian On 2009-04-14 00:53, Lars Marowsky-Bree wrote: Hi all, what do you think of a half-day / day long miniconference on clustering along LPC?

[Pacemaker] pacemaker-mgmt packages in CentOS 5 repo (again)

2009-08-19 Thread Florian Haas
Hello, post-1.0.5 and following the cluster-glue/agents split, the pacemaker-mgmt package in the CentOS 5 repository at OSBS appears to be broken again -- at least it breaks yum update: pacemaker-mgmt-1.99.2-1.2.x86_64 from server_ha-clustering has depsolving problems -- Missing Dependency:

Re: [Pacemaker] pacemaker-mgmt packages in CentOS 5 repo (again)

2009-08-19 Thread Florian Haas
.x86_64 conflicts with file from package libpacemaker3-1.0.4-23.1.x86_64 Missing Conflicts in pacemaker-libs? Cheers, Florian On 2009-08-19 11:23, Florian Haas wrote: Hello, post-1.0.5 and following the cluster-glue/agents split, the pacemaker-mgmt package in the CentOS 5 repository at OSBS

Re: [Pacemaker] pacemaker-mgmt packages in CentOS 5 repo (again)

2009-08-19 Thread Florian Haas
And one more: file /etc/ha.d/shellfuncs from install of heartbeat-3.0.0-32.1.x86_64 conflicts with file from package resource-agents-1.0-29.1.x86_64 Sorry for not rolling all of this into one email as I should have. Cheers, Florian signature.asc Description: OpenPGP digital signature

[Pacemaker] migrate_from/migrate_to for Stateful RAs

2009-09-01 Thread Florian Haas
Andrew, does the CRM/PE allow for resource migration with migrate_to/migrate_from for Master/Slave RAs? This would imply that at some point (during migration) multiple instances would be in the Master role, and master-max would be ignored (or implicitly incremented by 1) but while *not*

Re: [Pacemaker] migrate_from/migrate_to for Stateful RAs

2009-09-01 Thread Florian Haas
On 09/01/2009 08:30 AM, Andrew Beekhof wrote: only one layer of the stack is ever able to migrate. a resource can only migrate if its location and ordering pre-requisits are satisfied on the source and target nodes. clearly this can't be true for drbd _and_ something running on top of it

Re: [Pacemaker] Drbd primary never on SyncTarget

2009-09-03 Thread Florian Haas
On 2009-09-03 14:31, Johan Verrept wrote: Hello, I have been trying to keep my cluster from making the synctarget the drbd primary, but I cannot get it to work. See the end of the email for versions I use. I am using the heartbeat::drbd OCF RA. That's the problem. Use DRBD 8.3.2, and

Re: [Pacemaker] pacemaker 1.0.5 packages for debian?

2009-09-03 Thread Florian Haas
Martin has built these for sid; lenny builds should follow tomorrow. Cheers, Florian On 09/03/2009 08:05 PM, Dan Urist wrote: There aren't any here: http://people.debian.org/~madkiss/ha/dists/lenny/main/binary-amd64/ Are packages not available yet, or am I looking in the wrong place?

Re: [Pacemaker] Location rule troubles

2009-09-04 Thread Florian Haas
Johan, If you monitor the output 'ptest -LsV 21 | grep -i master score', you'll notice that the RA automatically sets a low master score for a node that is SyncTarget. It updates the master score in other clever ways, too. So if you have two nodes available and one is a (DRBD) SyncTarget, it

Re: [Pacemaker] Recovering from failed stonith

2009-09-04 Thread Florian Haas
If it's not of any concern to you that your hard failover is not automatic, why not go with the meatware plugin (aka human intervention) in the first place? Just my two cents. Cheers, Florian On 2009-09-04 11:16, Erik Hensema / HostingXS wrote: Dear list, I'm currently setting up a cluster

Re: [Pacemaker] Drbd primary never on SyncTarget

2009-09-08 Thread Florian Haas
On 2009-09-08 13:35, Dejan Muhamedagic wrote: Hi, On Thu, Sep 03, 2009 at 02:15:49PM +0200, Jov wrote: Hello, I have been trying to keep my cluster from making the synctarget the drbd primary, but I cannot get it to work. See the end of the email for versions I use. I am using the

[Pacemaker] crm shell suggestions (was Re: Location rule troubles)

2009-09-08 Thread Florian Haas
On 2009-09-04 11:39, Johan Verrept wrote: Hello Florian, thank you for your help, the ptest commands was very useful. Perhaps this could be mentioned in the Pacemaker wiki under a Troubleshooting section? I read all the docs and didn't see it mentioned anywhere (could have missed it).

Re: [Pacemaker] Node attributes

2009-09-08 Thread Florian Haas
On 2009-09-04 14:45, Johan Verrept wrote: Hello, I have been thinking about how the DRBD RA influences the master placement by manipulating a temporary attribute. How does that work exactly? Does the crm interpret certain attributes? Does it work on all numerical attributes? If so, does

Re: [Pacemaker] Drbd primary never on SyncTarget

2009-09-08 Thread Florian Haas
On 2009-09-08 15:53, Johan Verrept wrote: On Tue, 2009-09-08 at 15:04 +0200, Florian Haas wrote: The question is moot in this case, though, as I already mentioned: all of this is handled via master preferences which the ocf:linbit:drbd RA updates automatically. Yes. This works nicely

Re: [Pacemaker] [DRBD-user] DRBD is not syncing over my routed network.

2009-09-09 Thread Florian Haas
[CCing the Pacemaker list here, in case we have people interested in following this discussion there] On 09/08/2009 05:52 PM, gary.w...@opengi.co.uk wrote: Hi. I am having a problem whilst carrying out some preliminary testing of DRBD with OpenAIS. My test setup was all working fine. I

Re: [Pacemaker] Problem with gratuitous arps in IPaddr2

2009-09-23 Thread Florian Haas
Replacing send_arp with arping has been discussed on IRC before, a decision on that is still outstanding. Up to Andrew and/or Dejan, I would think. Regardless, to answer your earlier question of whether IPaddr2 is working correctly or not on CentOS 5, depends on how you look at it. :) It is

[Pacemaker] Debian packaging input welcome: repositories set up at hg.linbit.com

2009-10-01 Thread Florian Haas
Hello everyone, as you know, Martin Loschwitz (madkiss) has picked up the torch for building and maintaining Debian packages for Pacemaker, Heartbeat, Glue, and Agents. He's been doing most of the work in a private checkout thus far, with the Debian build system bits being kept on

Re: [Pacemaker] Possible error in iSCSITarget RA

2009-10-01 Thread Florian Haas
On 2009-10-01 16:53, Michael Schwartzkopff wrote: Am Donnerstag, 1. Oktober 2009 16:49:59 schrieb Michael Schwartzkopff: Hi, I am experimenting a little bit with the iSCSITarget RA, but without success. My resource is defined like: primitive resIET ocf:heartbeat:iSCSITarget \ params

Re: [Pacemaker] Possible error in iSCSITarget RA

2009-10-01 Thread Florian Haas
On 2009-10-01 17:03, Thomas Georgiou wrote: Hi, I too found problems with the iSCSITarget RA, so I modified it and it seems to be working fine now. Attached is the modified RA. You attached iSCSILogicalUnit, yet say you had issues with iSCSITarget. I'm confused. If you have a patch to either

Re: [Pacemaker] Possible error in iSCSITarget RA

2009-10-01 Thread Florian Haas
Repeat: the very purpose is to utilize both. In any Pacemaker managed iSCSI target setup. And combining them into one is probably a really really bad idea. And: they don't run multiple ietd instances. They don't start daemons at all. Cheers, Florian On 2009-10-01 17:12, Thomas Georgiou wrote:

Re: [Pacemaker] [Linux-ha-dev] Debian packaging input welcome: repositories set up at hg.linbit.com

2009-10-03 Thread Florian Haas
On 10/03/2009 10:48 AM, Simon Horman wrote: There is _no guarantee_ that the staging repositories will always be up to date with the latest and greatest found at hg.linux-ha.org or hg.clusterlabs.org. Those willing to contribute patches to Pacemaker, Agents, Glue or Heartbeat should _not_ pull

Re: [Pacemaker] Low cost stonith device

2009-10-05 Thread Florian Haas
On 2009-10-05 14:28, Dejan Muhamedagic wrote: Hi, On Mon, Oct 05, 2009 at 02:01:49PM +0200, Florian Haas wrote: On 2009-10-05 10:37, Johan Verrept wrote: Hello guys, I completed the RA and have attached it. As far as I can tell it is fully functional but I would appreciate

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-12 Thread Florian Haas
On 2009-10-10 10:37, xin.li...@cs2c.com.cn wrote: Hi all: As I known, drbd (8.3.2 and above) in pacemaker has 2 ocf scripts, one is from linbit, the other one is from heartbeat . In Andrew's Cluster form Scratch - Fedora 11 , Configure the Cluster for DRBD , he uses ocf::linbit:drbd

Re: [Pacemaker] why use ocf::linbit:drbd instead ofocf::heartbeat:drbd?

2009-10-12 Thread Florian Haas
On 2009-10-12 12:13, darren.mans...@opengi.co.uk wrote: On 2009-10-10 10:37, xin.li...@cs2c.com.cn wrote: Hi all: As I known, drbd (8.3.2 and above) in pacemaker has 2 ocf scripts, one is from linbit, the other one is from heartbeat . In Andrew's Cluster form Scratch - Fedora 11 ,

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-13 Thread Florian Haas
On 2009-10-13 09:40, Johan Verrept wrote: On Mon, 2009-10-12 at 09:06 +0200, Andrew Beekhof wrote: On Mon, Oct 12, 2009 at 8:43 AM, Florian Haas florian.h...@linbit.com wrote: Andrew, Dejan: as we consider the ocf:linbit:drbd RA stable as of the DRBD 8.3.4 release, is it acceptable to remove

Re: [Pacemaker] Human confirmation of dead node?

2009-10-13 Thread Florian Haas
Please be introduced to the meatware stonith plugin. Cheers Florian On 2009-10-13 15:23, J Brack wrote: Hi, I'm currently using heartbeat. I heard that I'm meant to be using pacemaker. I will switch in a heartbeat (sorry) if I can get pacemaker to do what I need. I have a clustered nfs

[Pacemaker] DRBD User's Guide 1.3.0

2009-10-15 Thread Florian Haas
Hello everyone, I have released and uploaded the DRBD User's Guide 1.3.0. Thanks to everyone who provided feedback and corrected errors! A few notable changes since the 1.2.0 release 2 months ago: - Lots of stuff about Pacemaker integration. Note in particular the new sections about 4-way

Re: [Pacemaker] Documentation challenge

2009-10-19 Thread Florian Haas
On 2009-10-19 11:58, Andrew Beekhof wrote: While I may be good at writing cluster software, I'm only average at documenting it and suck horribly at making pretty web pages. Currently I use Pages.app (an Apple editor) for writing the docs because it looks reasonable and lets me focus on the

Re: [Pacemaker] ocf:linbit:drbd monitor failed with stacked resource

2009-10-19 Thread Florian Haas
Torsten, the DRBD OCF RA (in 8.3.4) returns not configured in the following circumstances: - invalid configuration file name; - missing drbd_resource parameter; - master-max meta attribute set to 2, but allow-two-primaries not set in drbd.conf; - notify meta attribute set to false or unset. As

Re: [Pacemaker] Problems with colocation/order with drbdtree-node-setup

2009-10-20 Thread Florian Haas
On 2009-10-20 14:36, Andrew Beekhof wrote: im i correct that i only have to load the drbd kernel-module on node 1 + 2; pacemaker should do the rest (configuration, promote/demote...) right? I think the RA does that too. But the RA also expects clone-max=2 and will return not configured if

[Pacemaker] Missing autoconf check for sensors.h?

2009-10-20 Thread Florian Haas
Andrew, building current tip against Heartbeat on CentOS 5, my build fails with: [...] gcc -g -O2 -I/usr/include/heartbeat -ggdb3 -O0 -fgnu89-inline -fstack-protector-all -Wall -Waggregate-return -Wbad-function-cast -Wcast-qual -Wcast-align -Wdeclaration-after-statement -Wendif-labels

Re: [Pacemaker] Missing autoconf check for sensors.h?

2009-10-21 Thread Florian Haas
On 10/21/2009 09:01 AM, Andrew Beekhof wrote: On Wed, Oct 21, 2009 at 8:53 AM, Florian Haas florian.h...@linbit.com wrote: OK, so building on a system where net-snmp-devel is not installed would circumvent this problem? yep, or if you installed the sensors devel package yeah, that I

[Pacemaker] Why are fatal warnings enabled by default?

2009-10-21 Thread Florian Haas
Andrew, Dejan, For pacemaker and agents, configure defaults to --enable-fatal-warnings. AFAIR, neither of these have ever built successfully with fatal warnings enabled. Is there a specific reason to keep the default as it is? Is this perhaps a deliberate entry barrier for packagers, so as to

Re: [Pacemaker] Why are fatal warnings enabled by default?

2009-10-21 Thread Florian Haas
On 2009-10-21 14:36, Dejan Muhamedagic wrote: The warnings being? In agents, a simple ./configure make leads to: [...] gmake[1]: Entering directory `/home/rpmbuild/hg/cluster-agents/heartbeat' if gcc -DHAVE_CONFIG_H -I. -I. -I../include -I../include -I../include -I../linux-ha

Re: [Pacemaker] Pacemaker cluster: OpenAis communication channels

2009-10-22 Thread Florian Haas
Steve, what has repeatedly come up is that RRP links don't auto-heal (see thread: http://oss.clusterlabs.org/pipermail/pacemaker/2009-May/001784.html), and that passive mode RRP seems to not work at all (see thread: https://lists.linux-foundation.org/pipermail/openais/2009-October/013095.html --

Re: [Pacemaker] problem setting up STONITH for DRAC

2010-01-11 Thread Florian Haas
On 2010-01-11 08:15, frank wrote: Hi Sander, I also have many troubles with drac stonith, and finally I had to change the source code to make drac5 stonith work; Did you send a patch? Where's the bugzilla entry for this? but with drac6 it failed again because output strings are different.

Re: [Pacemaker] OCF scripts for cups and courier-imap

2010-01-11 Thread Florian Haas
On 2010-01-09 10:00, cheibi welid wrote: Hi all, I am currently testing a HA cluster with DRBD (Pacemaker, Heartbeat, DRBD) and i need OCF scripts for cups and courier-imap. I want to know if those scripts already exist or should i develop them and in this case, is there some guides or

Re: [Pacemaker] [Linux-HA] Multiple Choice test for cluster knowledge

2010-01-13 Thread Florian Haas
On 2010-01-13 14:19, darren.mans...@opengi.co.uk wrote: Yes please in English for both! Have you (or anyone else) thought of doing a Linux-HA certification? Yes. We have. It's called DRBD Certified Engineer but actually covers not only DRBD, but Heartbeat, Corosync and Pacemaker also and will

Re: [Pacemaker] OCF scripts for cups and courier-imap

2010-01-13 Thread Florian Haas
On 01/12/2010 06:42 PM, cheibi welid wrote: On Mon, Jan 11, 2010 at 9:20 AM, Florian Haas florian.h...@linbit.com mailto:florian.h...@linbit.com wrote: On 2010-01-09 10:00, cheibi welid wrote: Hi all, I am currently testing a HA cluster with DRBD (Pacemaker

Re: [Pacemaker] Discussion about a Cluster knowledge test

2010-01-14 Thread Florian Haas
On 2010-01-14 10:06, Michael Schwartzkopff wrote: Hi, this post is a follow up to the remarks of LMB on the O'Reilly blog about hte multiple choice test about cluster knowledge. I want to sum up my comments about testing knowledge here. I'd like to have a test about the knowledge of

Re: [Pacemaker] Discussion about a Cluster knowledge test

2010-01-14 Thread Florian Haas
Folks, since I don't want to be accused of spamming the list -- I replied off-list to those who asked me specifics about our certification offering. Anyone else, free free to contact me off-list also, or use the contact form on our web site. Cheers, Florian signature.asc Description: OpenPGP

Re: [Pacemaker] FYI: Ubuntu looking for Pacemaker testers

2010-01-15 Thread Florian Haas
On 01/15/2010 06:23 PM, Andrew Beekhof wrote: Ubuntu is looking to switch its supported cluster stack to Corosync+Pacemaker and has put out a Call for testers. Just to clarify: that they are switching (from RHCS) is correct, but IIUC the supported cluster stack will be Pacemaker regardless

Re: [Pacemaker] Split Site 2-way clusters

2010-01-18 Thread Florian Haas
On 2010-01-18 11:41, Colin wrote: Hi All, we are currently looking at nearly the same issue, in fact I just wanted to start a similarly titled thread when I stumbled over these messages… The setup we are evaluating is actually a 2*N-node-cluster, i.e. two slightly separated sites with N

Re: [Pacemaker] Pre-Announce: End of 0.6 support is near

2010-01-18 Thread Florian Haas
On 2010-01-18 11:18, Andrew Beekhof wrote: Biggest caveat is the networking issue that makes pacemaker 1.0 wire-incompatible with pacemaker 0.6 (and heartbeat 2.1.x). So rolling upgrades are out and you'd need to look at one of the other upgrade strategies. Even though I've bugged you about

Re: [Pacemaker] Orphan resource problem

2010-01-22 Thread Florian Haas
crm resource cleanup orphan_resource_name? Cheers, Florian On 2010-01-22 15:02, David Henningsson wrote: Hi, I'm trying to setup a cluster with DRBD running master/slave (primary/secondary, I assume) and a kvm virtual machine. I have trouble getting them up and running, because it thinks

Re: [Pacemaker] Auto-restart service on IP shift?

2010-02-04 Thread Florian Haas
On 02/04/2010 06:27 PM, Andreas Mock wrote: What about using a cloned IP too? Then there'd be no need for rebinding. Hi Andrew, just read this and I wanted to know how this works. The same IP on more than 1 node? Which RA does support this? Best regards Andreas

[Pacemaker] [PATCH 1 of 4] Medium: build: service_crm.so is Corosync only, don't try to remove it if it wasn't built

2010-02-05 Thread Florian Haas
# HG changeset patch # User Florian Haas florian.h...@linbit.com # Date 1265372627 -3600 # Branch stable-1.0 # Node ID 512ca0b73892df791160264d73d96ee095f613ae # Parent 6f67420618b0eb52b3e61abc401e9a2cb5dd8dbd Medium: build: service_crm.so is Corosync only, don't try to remove it if it wasn't

[Pacemaker] [PATCH 4 of 4] Medium: build: add compatibility wrappers for conditional builds in legacy RPM versions

2010-02-05 Thread Florian Haas
# HG changeset patch # User Florian Haas florian.h...@linbit.com # Date 1265375521 -3600 # Branch stable-1.0 # Node ID 9f9d1d22197dcda845e5d624addfe5159605c14f # Parent 35eba1386aff888c38b181054c5afcb0f80a2910 Medium: build: add compatibility wrappers for conditional builds in legacy RPM

[Pacemaker] [PATCH 2 of 4] Medium: build: enable --without heartbeat and --without ais as advertised

2010-02-05 Thread Florian Haas
# HG changeset patch # User Florian Haas florian.h...@linbit.com # Date 1265375104 -3600 # Branch stable-1.0 # Node ID efc54adb199234e3e7fc8c9b5e2cf3b1924e0923 # Parent 512ca0b73892df791160264d73d96ee095f613ae Medium: build: enable --without heartbeat and --without ais as advertised The package

Re: [Pacemaker] drbd doesn't work on simple 2-node cluster.

2010-02-10 Thread Florian Haas
On 02/10/2010 01:02 PM, Andrew Beekhof wrote: target-role=Started --- this isnt going to be helping. Why not? I ask because I used that exact meta parama today and it worked like a charm. Florian signature.asc Description: OpenPGP digital signature

[Pacemaker] [PATCH] High: build: fix autoconf after crm shell modularization

2010-02-15 Thread Florian Haas
# HG changeset patch # User Florian Haas florian.h...@linbit.com # Date 1266222386 -3600 # Branch stable-1.0 # Node ID 439fd4efe28a870acff433eda0cf702853bcbfae # Parent fdf7240ea4e1e2d742b722587bcfb90bf6cb4f4c High: build: fix autoconf after crm shell modularization Un-break autoconf after tools

Re: [Pacemaker] [PATCH] Medium: build: require Python 2.4 or later

2010-02-15 Thread Florian Haas
On 2010-02-15 10:17, Dejan Muhamedagic wrote: Applied too. Cheers, Dejan P.S. If necessary, we could add some compatibility layer for older pythons. Actually, in the very beginning, old popen was used, then replaced because of the ugly will be obsoleted warning messages :-/ Of course

Re: [Pacemaker] ocf_is_probe() problem on debian

2010-02-15 Thread Florian Haas
Tom, thanks: http://hg.linux-ha.org/agents/rev/44b1ba8c7804 See http://developerbugs.linux-foundation.org/show_bug.cgi?id=2284 for the discussion. Cheers, Florian On 2010-02-15 12:46, Tom Weber wrote: Hello, the setup: debian lenny based pacemaker cluster. these packages from

[Pacemaker] [PATCH 2 of 2] Low: build: fix Requires for libesmtp

2010-02-18 Thread Florian Haas
# HG changeset patch # User Florian Haas florian.h...@linbit.com # Date 1266490897 -3600 # Branch stable-1.0 # Node ID ebbd9841d03a7b5205a6de4904c9474f5b76c23f # Parent cc3c3e83e77fbde81186c0b9ca95b08402942092 Low: build: fix Requires for libesmtp Add a Requires tag for esmtp conditional builds

[Pacemaker] [PATCH 1 of 2] Low: build: add --without snmp flag to RPM spec

2010-02-18 Thread Florian Haas
# HG changeset patch # User Florian Haas florian.h...@linbit.com # Date 1266490897 -3600 # Branch stable-1.0 # Node ID cc3c3e83e77fbde81186c0b9ca95b08402942092 # Parent 9600297d047874796cbebf3fcfb679e2fd412b98 Low: build: add --without snmp flag to RPM spec SNMP traps only work from net-snmp 5.4

[Pacemaker] [PATCH 0 of 2] More RPM spec fixes

2010-02-18 Thread Florian Haas
Andrew, another couple of patches to the RPM spec: 1. Require net-snmp 5.4. As discussed on IRC, netsnmp_transport_open_client is not available in 5.3. This currently creates and awkward situation where crm_mon is being linked against the net-snmp libraries, but it can't actually send

Re: [Pacemaker] crm_mon with sending mails

2010-02-22 Thread Florian Haas
Check if you built with libesmtp-devel installed. Check if you have libesmtp installed. Then, run crm_mon -N - -H [smtp-server] -T [my_mailaddress] and trigger a resource transition. If it's not sending out an email, it will print an error message detailing the reason. Hope this helps.

Re: [Pacemaker] HB 3.0.2 + Pacemaker 1.0.7-1 NOT working :(

2010-02-22 Thread Florian Haas
man ha.cf Note that ucast directives which go to the local machine are effectively ignored. This allows the ha.cf directives on all machines to be identical. Your ha.cf _must_ be identical across nodes. You got your ucast directives wrong. You need two of them. Hope this helps, Florian

Re: [Pacemaker] Updated VMwareVM resource agent

2010-03-01 Thread Florian Haas
On 02/28/2010 10:36 PM, Cristian Mammoli - Apra Sistemi wrote: In 2008 I posted on the linux-ha ML a resource agent script for vmware server 2 virtual machines. It has been slightly modified and shipped with heartbeat and (now) pacemaker. Actually the script is broken: the script is named

Re: [Pacemaker] DRBD Management Console 0.6.0

2010-03-01 Thread Florian Haas
On 2010-02-28 21:09, Cristian Mammoli - Apra Sistemi wrote: Cristian Mammoli - Apra Sistemi wrote: Cause I am the one who wrote it in the first place and the version I use has some minor improvement and fixes. And the version shipped with pacemaker is different from the one I posted 2

Re: [Pacemaker] Updated VMwareVM resource agent

2010-03-01 Thread Florian Haas
On 2010-03-01 09:56, Cristian Mammoli - Apra Sistemi wrote: Florian Haas wrote: There are a few other upstream changes you may have missed. Could you please rebase your patch on current upstream and send a hg export patch? Or, alternatively, could you diff your new version against the one you

Re: [Pacemaker] Updated VMwareVM resource agent

2010-03-01 Thread Florian Haas
Cristian, I am continuing this thread in linux-ha-dev; it really doesn't belong on the Pacemaker list. Cheers, Florian signature.asc Description: OpenPGP digital signature ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org

Re: [Pacemaker] Updated VMwareVM resource agent

2010-03-01 Thread Florian Haas
On 2010-03-01 13:02, Dejan Muhamedagic wrote: Hi, On Sun, Feb 28, 2010 at 10:36:32PM +0100, Cristian Mammoli - Apra Sistemi wrote: In 2008 I posted on the linux-ha ML a resource agent script for vmware server 2 virtual machines. It has been slightly modified and shipped with heartbeat and

Re: [Pacemaker] Dropping HeartBeat Stack?

2010-03-03 Thread Florian Haas
On 03/03/2010 09:24 AM, Andrew Beekhof wrote: 2- Andrew, when shall we see Pacemaker in RHEL instead of Redhat Cluster Suite? Red Hat has a very strict policy about discussing what may or may not be part of a current and/or future Red Hat products. Having said that, I got approval to

Re: [Pacemaker] DRBD and fencing

2010-03-09 Thread Florian Haas
On 03/09/2010 06:07 AM, Martin Aspeli wrote: Hi folks, Let's say have a two-node cluster with DRBD and OCFS2, with a database server that's supposed to be active on one node at a time, using the OCFS2 partition for its data store. *cringe* Which database is this? Florian signature.asc

Re: [Pacemaker] [PATCH] SNMP for net-snmp5.3

2010-03-10 Thread Florian Haas
Andrew, your thoughts on this patch? FWIW I'd strongly second the idea of supporting Net-SNMP 5.3; otherwise Pacemaker SNMP notifications won't work on RHEL 5. Cheers, Florian On 03/10/2010 08:47 AM, Simon Horman wrote: On Wed, Mar 10, 2010 at 02:57:12PM +0900, sato yuki wrote: Hello all.

Re: [Pacemaker] Still get unknown expected votes on debian lenny

2010-03-10 Thread Florian Haas
On 03/10/2010 01:40 PM, Michael Schwartzkopff wrote: Hi, I just installed the lastest pacemaker/corosync from madkiss' repository on my lenny and still get the unknown expected votes error and my cluster. Any idea how to resolve? Sure. Build from a clone of

Re: [Pacemaker] [PATCH] Medium: build: require Net-SNMP 5.3 or later

2010-03-19 Thread Florian Haas
On 03/18/2010 10:02 AM, Andrew Beekhof wrote: On Wed, Mar 17, 2010 at 11:02 AM, Dejan Muhamedagic deja...@fastmail.fm wrote: Hi, On Wed, Mar 17, 2010 at 09:17:38AM +0100, Florian Haas wrote: Andrew, now that Pacemaker has been on a bi-monthly release schedule for a while, is there any

Re: [Pacemaker] node states

2010-03-19 Thread Florian Haas
On 03/17/2010 09:30 PM, Andrew Beekhof wrote: On Wed, Mar 17, 2010 at 7:53 PM, Matthew Palmer mpal...@hezmatt.org wrote: On Wed, Mar 17, 2010 at 07:16:16AM -0500, Schaefer, Diane E wrote: We were wondering what the node state of UNCLEAN, with the three variations of online, offline and

Re: [Pacemaker] [PATCH] Medium: build: require Net-SNMP 5.3 or later

2010-03-19 Thread Florian Haas
On 03/19/2010 03:39 PM, Andrew Beekhof wrote: Who said you should build RC _packages_? Tag an RC, upload a tarball, announce on mailing list, done. How is that extra work? No wait, Pacemaker builds directly from a Mercurial tarball. So scratch the upload part. What does the tag achieve

Re: [Pacemaker] Pacemaker in VMware guests

2010-03-23 Thread Florian Haas
On 03/23/2010 10:00 PM, Andrew Beekhof wrote: On Tue, Mar 23, 2010 at 9:47 PM, Matthias Schlarb mschl...@vmware.com wrote: Hi, I'm aware of the external/vmware plugin and want to ask if someone did already some tests with it and would share the results. I was using it a while ago, but it

Re: [Pacemaker] Someone using ibmrsa-telnet external stonith plugin?

2010-03-26 Thread Florian Haas
On 03/25/2010 10:28 PM, Andreas Mock wrote: -Ursprüngliche Nachricht- Von: Florian Haas florian.h...@linbit.com Gesendet: 25.03.2010 16:23:59 An: The Pacemaker cluster resource manager pacemaker@oss.clusterlabs.org Betreff: Re: [Pacemaker] Someone using ibmrsa-telnet external stonith

Re: [Pacemaker] Question about Dual Primary DRBD + OCFS2

2010-03-26 Thread Florian Haas
On 2010-03-26 12:54, r...@free.fr wrote: Oh, debian i686. Figures. You'll need to rebuild the packages yourself, the ones from Madkiss' repo don't seem to work for i686. Hi Andrew, thanks for your answer. Which packages I need to rebuild ? All of them ? I need to start from scratch

Re: [Pacemaker] CIB write-to-disk bug?

2010-04-01 Thread Florian Haas
On 2010-04-01 16:27, Alan Robertson wrote: None of them verified. All the nodes in the cluster failed the test at the same time - and now I have no official CIBs on disk - on any cluster nodes... I sent Andrew all the CIBs, and all the core files, and basically everything under

Re: [Pacemaker] Searching for a viable Debian solution

2010-04-25 Thread Florian Haas
Paul, I am copying your message over to the Debian HA maintainers' mailing list. Chances are that one of those guys can share some valuable insight. Debian maintainers, when you respond would you mind copying the Pacemaker list? Cheers, Florian On 04/24/2010 06:01 AM, Paul Gear wrote: Hi

  1   2   3   4   >