Re: [Linux-ha-dev] interface monitoring

2018-08-01 Thread Alan Robertson
sts.linux-ha.org/mailman/listinfo/linux-ha-dev > Home Page: http://linux-ha.org/ -- Alan Robertson al...@unix.sh ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page: http://linux-ha.org/

[Linux-ha-dev] Looking for a few good queries ;-)

2015-01-08 Thread Alan Robertson
probably ought to be paths... At least that's what comes to my mind... The reasons why I think this is valuable is for the GUI. This is exactly what we need for displaying a group of related servers/services in the GUI... -- Alan Robertson al...@unix.sh

[Linux-ha-dev] Encryption progress...

2014-12-08 Thread Alan Robertson
packets to encourage the protocol to recover from lost packets. This code turned out to be simpler than I thought it would be. That's a rarity, for sure! -- Alan Robertson al...@unix.sh ___ Linux-HA-Dev: Linux-HA-Dev

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-24 Thread Alan Robertson
On 10/24/2014 03:32 AM, Lars Marowsky-Bree wrote: On 2014-10-23T20:36:38, Lars Ellenberg lars.ellenb...@linbit.com wrote: If we want to require presence of start-stop-daemon, we could make all this somebody elses problem. I need find some time to browse through the code to see if it can be

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-22 Thread Alan Robertson
On 10/22/2014 03:33 AM, Dejan Muhamedagic wrote: Hi Alan, On Mon, Oct 20, 2014 at 02:52:13PM -0600, Alan Robertson wrote: For the Assimilation code I use the full pathname of the binary from /proc to tell if it's one of mine. That's not perfect if you're using an interpreted language

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-22 Thread Alan Robertson
On 10/22/2014 07:09 AM, Dejan Muhamedagic wrote: On Wed, Oct 22, 2014 at 06:50:37AM -0600, Alan Robertson wrote: On 10/22/2014 03:33 AM, Dejan Muhamedagic wrote: Hi Alan, On Mon, Oct 20, 2014 at 02:52:13PM -0600, Alan Robertson wrote: For the Assimilation code I use the full pathname

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-22 Thread Alan Robertson
On 10/22/2014 07:11 AM, Tim Small wrote: On 22/10/14 13:50, Alan Robertson wrote: Does anyone know which OSes have either or both of those /proc names? Once again, can I recommend taking a look at the start-stop-daemon source (see earlier posting), which does this stuff, and includes checks

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-21 Thread Alan Robertson
On 10/21/2014 2:29 AM, Lars Ellenberg wrote: On Mon, Oct 20, 2014 at 11:21:36PM +0200, Lars Ellenberg wrote: On Mon, Oct 20, 2014 at 03:04:31PM -0600, Alan Robertson wrote: On 10/20/2014 02:52 PM, Alan Robertson wrote: For the Assimilation code I use the full pathname of the binary from /proc

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-20 Thread Alan Robertson
For the Assimilation code I use the full pathname of the binary from /proc to tell if it's one of mine. That's not perfect if you're using an interpreted language. It works quite well for compiled languages. On 10/20/2014 01:17 PM, Lars Ellenberg wrote: Recent discussions with Dejan made me

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-20 Thread Alan Robertson
On 10/20/2014 03:21 PM, Lars Ellenberg wrote: On Mon, Oct 20, 2014 at 03:04:31PM -0600, Alan Robertson wrote: On 10/20/2014 02:52 PM, Alan Robertson wrote: For the Assimilation code I use the full pathname of the binary from /proc to tell if it's one of mine. That's not perfect if you're

[Linux-ha-dev] Prototype REST code

2013-11-25 Thread Alan Robertson
that comes to mind is to allow a client to subscribe to changes to things. But before that it would be cool to have any kind of a primitive piece of code that used this interface. -- Alan Robertson al...@unix.sh - @OSSAlanR Openness is the foundation and preservative of friendship... Let

Re: [Linux-ha-dev] [Linux-HA] Announcing release 0.1.0 of the Assimilation Monitoring Project!

2013-05-02 Thread Alan Robertson
I ran across it while doing some research. As you said, it's a different focus - but related. Thanks for the heads-up! On 04/24/2013 04:09 AM, Lars Ellenberg wrote: On Fri, Apr 19, 2013 at 03:01:51PM -0600, Alan Robertson wrote: Hi all, Hi Alan! Good to see progress on this Project. Did

[Linux-ha-dev] Announcing release 0.1.0 of the Assimilation Monitoring Project!

2013-04-19 Thread Alan Robertson
TechTarget Interview: http://bit.ly/17M6DK2 Join the mailing list: http://lists.community.tummy.com/cgi-bin/mailman/listinfo/assimilation Join the mailing list, download the code, try it out, and send your comments and questions to the list! Thanks and have a great weekend! -- Alan

Re: [Linux-ha-dev] Slight bending of OCF specs: Re: Issues found in Apache resource agent

2012-09-12 Thread Alan Robertson
On 09/12/2012 05:14 AM, Lars Marowsky-Bree wrote: On 2012-09-11T15:04:55, Alan Robertson al...@unix.sh wrote: Depends. Pacemaker may still care about the status of these agents. If it can't start or stop them, what can it do with them? The status from these agents may feed into operations

Re: [Linux-ha-dev] Slight bending of OCF specs: Re: Issues found in Apache resource agent

2012-09-11 Thread Alan Robertson
On 09/08/2012 02:53 PM, Lars Marowsky-Bree wrote: On 2012-09-07T13:46:27, Alan Robertson al...@unix.sh wrote: Well, I presume that one would not tell pacemaker about such agents, as they would not be useful to pacemaker. From the point of view of the crm command, you wouldn't consider them

Re: [Linux-ha-dev] Slight bending of OCF specs: Re: Issues found in Apache resource agent

2012-09-07 Thread Alan Robertson
wouldn't consider them as valid resource agents to put in a configuration for pacemaker. People would instead use the nginx or apache agents that _do_ know how to start and stop things. -- Alan Robertson al...@unix.sh - @OSSAlanR Openness is the foundation and preservative of friendship

[Linux-ha-dev] Slight bending of OCF specs: Re: Issues found in Apache resource agent

2012-09-04 Thread Alan Robertson
create some new metadata to document it, and also not put start and stop into the actions in the operations section. Or just the latter. What do you think? On 08/29/2012 05:31 AM, Dejan Muhamedagic wrote: Hi Alan, On Mon, Aug 27, 2012 at 10:51:15AM -0600, Alan Robertson wrote: Hi, I

[Linux-ha-dev] Issues found in Apache resource agent

2012-08-27 Thread Alan Robertson
to interpret it as the next lower supported value for OCF_CHECK_LEVEL. In effect, there are no invalid OCF_CHECK_LEVEL values. The Apache agent declared all values but one to be errors. This is not the correct behavior. -- Alan Robertson al...@unix.sh - @OSSAlanR Openness is the foundation

Re: [Linux-ha-dev] LRM bug

2012-08-07 Thread Alan Robertson
On 08/07/2012 08:18 AM, Dejan Muhamedagic wrote: Hi Alan, On Mon, Jul 30, 2012 at 10:14:27AM -0600, Alan Robertson wrote: The LRM treats operation timeouts as ERROR:s - not just failed operations that give warnings. This violates the meaning of ERROR: messages in the code. We reserved

Re: [Linux-ha-dev] Probable sub-optimal behavior in ucast

2012-07-31 Thread Alan Robertson
an idea what could cause IPC to behave this way I'd be happy to know what it was... -- Alan Robertson al...@unix.sh ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page

Re: [Linux-ha-dev] Probable sub-optimal behavior in ucast

2012-07-31 Thread Alan Robertson
On 07/31/2012 06:08 AM, Alan Robertson wrote: I wasn't sure, actually - because of the troubles mentioned above. I'll check back in and let you know... Only two of the read processes are accumulating any CPU - it's the last one on each interface. You hit it spot on. Thanks Lars

[Linux-ha-dev] LRM bug

2012-07-30 Thread Alan Robertson
, you can't tell customers If you ever have an ERROR: message, the HA software has failed. This ought to just be a warning, like any other failed action... -- Alan Robertson al...@unix.sh - @OSSAlanR Openness is the foundation and preservative of friendship... Let me claim from you at all

[Linux-ha-dev] Probable sub-optimal behavior in ucast

2012-07-30 Thread Alan Robertson
in work on the master control process. And I don't have to uncompress them to throw them away - I can just look at the source IP address... What do you think? -- Alan Robertson al...@unix.sh - @OSSAlanR Openness is the foundation and preservative of friendship... Let me claim from

Re: [Linux-ha-dev] STONITH agent for SoftLayer API

2012-06-07 Thread Alan Robertson
Red Hat invented their own API then disabled the working API in their version of the code. Of course, they don't have as many agents, and they're not as well tested and the API is a bit odd. But what do I know, since I invented the Linux-HA API. On 6/7/2012 3:50 PM, Brad Jones wrote: Can

Re: [Linux-ha-dev] [RFC] IPaddr2: Proposal patch to support the dual stack of IPv4 and IPv6.

2012-06-04 Thread Alan Robertson
On 06/04/2012 12:32 AM, Keisuke MORI wrote: Hi Alan, Thank you for your comments. It's an interesting idea, but I don't think we need to care about IPv4 link-local addresses because users can configure using the same manner as a regular IP address. (and it's used very rarely) In the case

[Linux-ha-dev] The role of dependencies in managing computer systems

2012-06-04 Thread Alan Robertson
Hi, I recently wrote a blog post on the importance of dependencies in managing computer systems. Of course, a small number of dependencies are modelled very nicely by things like Pacemaker, but the picture is much bigger than those dependencies. Read more here:

[Linux-ha-dev] USENIX Configuration Management Summit

2012-06-04 Thread Alan Robertson
Hi, For those of you interested in managing servers beyond high-availability (which is likely many of you), you might be interested in next week's USENIX Configuration Management Summit. It will be held in Boston, and will feature a series of interesting talks, including one by me on the

Re: [Linux-ha-dev] [RFC] IPaddr2: Proposal patch to support the dual stack of IPv4 and IPv6.

2012-05-31 Thread Alan Robertson
It's straightforward to determine if an IP address is link-local or not - for an already configured address. 3: eth1: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qdisc pfifo_fast state UP qlen 1000 link/ether 94:db:c9:3f:7c:20 brd ff:ff:ff:ff:ff:ff inet 10.10.10.30/24 brd 10.10.10.255 *scope

Re: [Linux-ha-dev] [Patch] The patch which revises memory leak.

2012-05-08 Thread Alan Robertson
FYI: there is code in the heartbeat communication layer which is quite happy to simulate lost packets. I made it difficult to turn on accidentally. Read the code for details if you're interested. On 04/30/2012 10:21 PM, renayama19661...@ybb.ne.jp wrote: Hi Lars, We confirmed that this

Re: [Linux-ha-dev] [Patch] The patch which revises memory leak.

2012-05-02 Thread Alan Robertson
This is very interesting. My apologies for missing this memory leak :-(. The code logs memory usage periodically exactly to help notice such a thing. In my new open source project [http://assimmon.org], I am death on memory leaks. But I can assure you that back when that code was written,

Re: [Linux-ha-dev] Core Dump When Sending to Other Node That's Resetting

2012-04-18 Thread Alan Robertson
Have you tried running this under valgrind? On 04/13/2012 05:22 PM, Nguyen Dinh Phong wrote: Hi, I wrote a wrapper using hbclient api for an application that manages the redundancy of our system. The application uses the wrapper to send/receive messages (string) between the primary and

Re: [Linux-ha-dev] Another problem with the heartbeat init script...

2012-04-01 Thread Alan Robertson
On 03/30/2012 04:58 PM, Lars Ellenberg wrote: On Mon, Mar 26, 2012 at 02:28:42PM -0600, Alan Robertson wrote: Earlier I mentioned the problem with portreserve (which was apparently ignored?) No. But I cached a cold. Sorry to hear that :-(. Hope you're feeling better now. You have my full

Re: [Linux-ha-dev] Patch: pgsql streaming replication

2012-03-26 Thread Alan Robertson
On 03/20/2012 03:40 AM, Lars Marowsky-Bree wrote: On 2012-03-19T16:29:23, Soffen, Matthewmsof...@iso-ne.com wrote: I believe that the reason for not using #bash is that it is it NOT part of the default install on non Linux systems. That is what package dependencies are for. Matt's point is

Re: [Linux-ha-dev] Occasionally heartbeat doesn't start...

2012-03-12 Thread Alan Robertson
it - no harm comes from installing the file and attempting to run portrelease. But for those that provide it, it is a help. On 03/12/2012 05:43 AM, Lars Ellenberg wrote: On Fri, Mar 09, 2012 at 11:52:56AM -0700, Alan Robertson wrote: Hi, I've been investigating an HA configuration

[Linux-ha-dev] Occasionally heartbeat doesn't start...

2012-03-09 Thread Alan Robertson
Hi, I've been investigating an HA configuration for a customer. One time in testing heartbeat didn't start, because rpcbind had stolen its reserved port. Restarting rpcbind made it choose a different random port. This is definitely an interesting problem - even if it doesn't happen very

[Linux-ha-dev] Assimilation Monitoring Project on Twitter

2011-12-24 Thread Alan Robertson
Hi, I'm giving a little more detailed status on the Assimilation project's development progress on Twitter - on @OSSAlanR. -- Alan Robertsonal...@unix.sh - @OSSAlanR Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised

Re: [Linux-ha-dev] Additional changes made via DHCPD review process

2011-12-06 Thread Alan Robertson
I agree about avoiding the feature to sync config files. My typical recommendation is to use drbdlinks and put it on replicated or shared storage. In fact, I do that at home, and are doing it for a current customer. By the way, Sean has recently revised drbdlinks to support the OCF API.

Re: [Linux-ha-dev] Stonith turns node names to lowercase

2011-10-20 Thread Alan Robertson
On 10/19/2011 04:11 AM, Lars Marowsky-Bree wrote: On 2011-10-18T12:40:40, Florian Haasflor...@hastexo.com wrote: g_strdown(nodecopy); Is there a reason for this ? I suppose Dejan will accept a patch making this configurable. Please, no. We fence by hostname; hostnames are case

Re: [Linux-ha-dev] attrd and repeated changes

2011-10-20 Thread Alan Robertson
On 10/20/2011 03:41 AM, Philipp Marek wrote: Hello, when constantly sending new data via attrd the changes are never used. Example: while sleep 1 do attrd_updater -l reboot -d 5 -n rep_chg -U try$SECONDS cibadmin -Ql | grep rep_chg done This always returns the same

Re: [Linux-ha-dev] endian oddity in cluster glue

2011-09-06 Thread Alan Robertson
On 09/05/2011 09:02 AM, Pádraig Brady wrote: I was looking at extricating some logic from cluster-glue-libs and noticed strangeness wrt the endianess checking. CONFIG_BIG_ENDIAN is defined on my x86 machine, which is due to configure.ac referencing a non existent byteorder_test.c To fix this

Re: [Linux-ha-dev] Announcing - the Assimilation monitoring system - a sub-project of Linux-HA

2011-08-19 Thread Alan Robertson
On 08/17/2011 09:15 PM, Digimer wrote: Linux is fairly described as an ecosystem. Differing branches and methods of solving a given problem are tried, and the one with the most backing and merit wins. It's part of what makes open-source what it is. So, from my point of view, best of luck to

Re: [Linux-ha-dev] Announcing - the Assimilation monitoring system - a sub-project of Linux-HA

2011-08-17 Thread Alan Robertson
On 08/16/2011 05:08 PM, Angus Salkeld wrote: On Fri, Aug 12, 2011 at 03:11:36PM -0600, Alan Robertson wrote: Hi, Back last November or so, I started work on a new monitoring project - for monitoring servers and services. It's aims are: - Scalable virtually without limit - tens

Re: [Linux-ha-dev] Uniquness OCF Parameters

2011-06-20 Thread Alan Robertson
On 06/17/2011 02:43 AM, Lars Ellenberg wrote: On Thu, Jun 16, 2011 at 03:52:37PM -0600, Alan Robertson wrote: On 06/16/2011 02:51 AM, Lars Ellenberg wrote: On Thu, Jun 16, 2011 at 09:48:20AM +0200, Florian Haas wrote: On 2011-06-16 09:03, Lars Ellenberg wrote: With the current unique=true

Re: [Linux-ha-dev] Uniquness OCF Parameters

2011-06-16 Thread Alan Robertson
On 06/16/2011 02:51 AM, Lars Ellenberg wrote: On Thu, Jun 16, 2011 at 09:48:20AM +0200, Florian Haas wrote: On 2011-06-16 09:03, Lars Ellenberg wrote: With the current unique=true/false, you cannot express that. Thanks. You learn something every day. :) Sorry that I left off the As you are

Re: [Linux-ha-dev] Uniquness OCF Parameters

2011-06-15 Thread Alan Robertson
On 06/14/2011 06:03 AM, Florian Haas wrote: On 2011-06-14 13:08, Dejan Muhamedagic wrote: Hi Alan, On Mon, Jun 13, 2011 at 10:32:02AM -0600, Alan Robertson wrote: On 06/13/2011 04:12 AM, Simon Talbot wrote: A couple of observations (I am sure there are more) on the uniqueness flag for OCF

Re: [Linux-ha-dev] Uniquness OCF Parameters

2011-06-15 Thread Alan Robertson
On 06/14/2011 07:21 AM, Dejan Muhamedagic wrote: Hi Florian, On Tue, Jun 14, 2011 at 02:03:19PM +0200, Florian Haas wrote: On 2011-06-14 13:08, Dejan Muhamedagic wrote: Hi Alan, On Mon, Jun 13, 2011 at 10:32:02AM -0600, Alan Robertson wrote: On 06/13/2011 04:12 AM, Simon Talbot wrote

Re: [Linux-ha-dev] Uniquness OCF Parameters

2011-06-13 Thread Alan Robertson
On 06/13/2011 04:12 AM, Simon Talbot wrote: A couple of observations (I am sure there are more) on the uniqueness flag for OCF script parameters: Would it be wise for the for the index parameter of the SFEX ocf script to have its unique flag set to 1 so that the crm tool (and others) would

Re: [Linux-ha-dev] Problem WARN: Gmain_timeout_dispatch Again

2011-05-16 Thread Alan Robertson
appear to be having. -- Alan Robertson al...@unix.sh On 05/12/2011 12:54 PM, gilmarli...@agrovale.com.br wrote: Hello! I'm using heartbeat version 3.0.3-2 on debian squeeze with dedicated gigabit ethernet interface for the heartbeat. But even this generates the following

Re: [Linux-ha-dev] Problem WARN: Gmain_timeout_dispatch Again

2011-05-13 Thread Alan Robertson
- which you don't appear to be having. -- Alan Robertson al...@unix.sh On 05/12/2011 12:54 PM, gilmarli...@agrovale.com.br wrote: Hello! I'm using heartbeat version 3.0.3-2 on debian squeeze with dedicated gigabit ethernet interface for the heartbeat. But even this generates

Re: [Linux-ha-dev] Problem WARN: Gmain_timeout_dispatch Again

2011-05-13 Thread Alan Robertson
). In the mean time, as long as those timings remain close to the expectations (60 vs 50ms) I'd ignore them. Those messages are meant to debug real-time problems - which you don't appear to be having. -- Alan Robertson al...@unix.sh On 05/12/2011 12:54 PM, gilmarli...@agrovale.com.br wrote: Hello

Re: [Linux-ha-dev] nginx resource agent

2011-01-01 Thread Alan Robertson
On 12/14/2010 02:42 AM, Dejan Muhamedagic wrote: Hi Alan, On Mon, Dec 06, 2010 at 10:41:48AM -0700, Alan Robertson wrote: Hi, Attached is a resource agent for the nginx web server/proxy package. http://en.wikipedia.org/wiki/Nginx http://nginx.org/ I'd like for it to be added

[Linux-ha-dev] nginx resource agent

2010-12-06 Thread Alan Robertson
and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce #!/bin/sh # # High-Availability nginx OCF resource agent # # nginx # # Description: starts/stops nginx servers. # # Author: Alan Robertson # Dejan

Re: [Linux-ha-dev] nginx resource agent

2010-12-06 Thread Alan Robertson
Let the the list know how it works out. It has some improvements over how the original Apache resource agent works. On 12/06/2010 10:59 AM, Raoul Bhatia [IPAX] wrote: On 12/06/2010 06:41 PM, Alan Robertson wrote: Hi, Attached is a resource agent for the nginx web server/proxy package

Re: [Linux-ha-dev] OCF RA dev guide: final heads up

2010-12-06 Thread Alan Robertson
On 12/06/2010 09:35 AM, Dejan Muhamedagic wrote: On a different matter: Perhaps it would be good to add a section about ocf-tester. Or would you consider that out of scope? Let me second that request. If you don't know about ocf-tester, then you don't really know much about building OCF RAs

Re: [Linux-ha-dev] Thinking about a new communications plugin

2010-11-28 Thread Alan Robertson
On 11/27/2010 04:19 PM, Lars Ellenberg wrote: On Sun, Nov 28, 2010 at 12:03:23AM +0100, Lars Ellenberg wrote: But until then, you could probably already have implemented your original proposal in the cumulative man hours spent writing and reading this thread, and I'm sure I will get used. So

[Linux-ha-dev] Thinking about a new communications plugin

2010-11-22 Thread Alan Robertson
Hi, I've been thinking about a new unicast communications plugin that would work slightly differently from the current ucast plugin. It would take a filename giving the hostnames or ipv4 or ipv6 unicast addresses that one wants to send heartbeats to. When heartbeat receives a SIGHUP, this

Re: [Linux-ha-dev] a scalable membership and LRM proxy proposal

2010-11-18 Thread Alan Robertson
On 11/18/2010 02:47 AM, Andrew Beekhof wrote: Its not that I'm against your proposal, I just don't know of enough resources to build, test and stabilize a new communication protocol. In that context, an off-the-shelf component that gives us a couple of magnitudes worth of additional scaling

Re: [Linux-ha-dev] a scalable membership and LRM proxy proposal

2010-11-16 Thread Alan Robertson
Thanks for this information. This is _SOOO_ much better than trying to dig it all out of the web site. On 11/16/2010 03:04 AM, Andrew Beekhof wrote: Alan Robertson wrote: I was hoping for something a little more lightweight - although I clearly understand the benefits of it already exists

Re: [Linux-ha-dev] a scalable membership and LRM proxy proposal

2010-11-15 Thread Alan Robertson
of I'm alive messages per second, that sounds wasteful at the least... But, I haven't read the specs - so maybe this is all happily taken care of. I'll go read some specs. Thanks for the info! -- Alan Robertson al...@unix.sh On 11/11/2010 05:28 AM, Andrew Beekhof wrote: Some

Re: [Linux-ha-dev] a scalable membership and LRM proxy proposal

2010-11-15 Thread Alan Robertson
the attraction. Not sure it's what I want, but since I don't know yet quite what I want - that would be hard to say :-). -- Alan Robertson al...@unix.sh On 11/11/2010 05:28 AM, Andrew Beekhof wrote: Some of your thinking mirrors our own. What we're moving towards is indeed two

[Linux-ha-dev] a scalable membership and LRM proxy proposal

2010-11-04 Thread Alan Robertson
I've been thinking about the idea of very highly scalable membership, and also about the LRM proxy function which is currently being performed by Pacemaker. Towards this end I wrote up a high-level design (or architecture, or design philosophy or something) for such a scalable membership/LRM

[Linux-ha-dev] Linux-HA server(s) down for an hour on 23 March 2010

2010-03-22 Thread Alan Robertson
to provide and maintain servers for Linux-HA! Sorry for the inconvenience, -- Alan Robertson al...@unix.sh Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce

[Linux-ha-dev] Linux-HA Leadership Announcement

2008-06-24 Thread Alan Robertson
, and so on. Expect to hear from them soon! -- Alan Robertson [EMAIL PROTECTED] Linux-HA founder, Linux-HA project leader emeritus ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http://lists.linux-ha.org/mailman

[Linux-ha-dev] Announcing! Release 2.1.3 of Linux-HA is now available!

2007-12-21 Thread Alan Robertson
291749: GUI should not overwrite more specific settings of contained resources. + Remove autoconf and friends on make distclean -- PS: Special thanks to Dejan for making up this change log - it's an annoying and thankless task. -- Alan Robertson [EMAIL PROTECTED] Openness

Re: [Linux-ha-dev] Problem with heartbeat gui

2007-12-16 Thread Alan Robertson
information. Also, there is a 10-minute screencast video on using the GUI which gives a variety of tips on authorization and so on. You might look at it. http://linux-ha.org/Education/Newbie/IPaddrScreencast -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative

Re: AW: [Linux-ha-dev] Call for testers: 2.1.3

2007-12-14 Thread Alan Robertson
in the configure stuff: 1) It got the package name for pegasus wrong for Red Hat 2) It didn't work if you had pegasus installed but didn't enable the CIM provider. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let

Re: [Linux-ha-dev] build RPM without pegasus on RHEL4

2007-12-12 Thread Alan Robertson
, that would also be a reasonable thought. But, still please send the complete build output to my email address. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce

Re: [Linux-ha-dev] Call for testers: 2.1.3

2007-12-08 Thread Alan Robertson
Known Problem in 2.1.3: The STONITHd test seems to fail if fencing is enabled. I suspect this of being a testing quirk rather than a new problem. I'm working it. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you

Re: [Linux-ha-dev] [mgmt]Rewriting order and colocation configurations

2007-12-08 Thread Alan Robertson
also CC me directly, if you want to discuss this, that would help me. Thanks! -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce

Re: [Linux-ha-dev] [mgmt]Rewriting order and colocation configurations

2007-12-08 Thread Alan Robertson
Yan Gao wrote: On Sat, 2007-12-08 at 07:54 -0700, Alan Robertson wrote: Yan Gao wrote: I'm rewriting the order and colocation configurations of mgmt. Following features will be implemented: 1. Get the crm.dtd file from server end. 2. Dynamically adding gtk widgets for attributes of a type

Re: [Linux-ha-dev] Call for testers: 2.1.3

2007-12-08 Thread Alan Robertson
Alan Robertson wrote: Known Problem in 2.1.3: The STONITHd test seems to fail if fencing is enabled. I suspect this of being a testing quirk rather than a new problem. I'm working it. Now fixed in 'dev' and 'test'. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation

Re: [Linux-ha-dev] [mgmt]Rewriting order and colocation configurations

2007-12-08 Thread Alan Robertson
and still growing). -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce ___ Linux-HA-Dev: Linux-HA-Dev

[Linux-ha-dev] Request for feedback: Checking CIBs for errors...

2007-12-07 Thread Alan Robertson
I'm asking it to do] -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce ___ Linux-HA-Dev: Linux-HA-Dev

Re: [Linux-ha-dev] ANNOUNCE: Project Organization - CRM to become its own project

2007-12-07 Thread Alan Robertson
know when that's set up. Or maybe Andrew leaving and going his own way will work out for the best and things will be better than I could possibly imagine. Anyone who has ideas on how this can be made to happen, please email me. -- Alan Robertson [EMAIL PROTECTED] Openness

Re: [Linux-ha-dev] stonith plugin through HMC web interface

2007-12-07 Thread Alan Robertson
then. Why don't you want to use the current HMC interface? -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce

Re: [Linux-ha-dev] errors with 76c25be5c854

2007-12-06 Thread Alan Robertson
- in the testing STONITH module. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce ___ Linux-HA-Dev: Linux

Re: [Linux-ha-dev] Call for testers: 2.1.3

2007-12-06 Thread Alan Robertson
Serge Dubrouski wrote: It also build on FC6 but not on CentOS. Whiinnne... /me straightens up. Thanks for the info! -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions

Re: [Linux-ha-dev] Call for testers: 2.1.3

2007-12-06 Thread Alan Robertson
version, and the problem was reported for 64 bit. OK. So, this step should only be included if --enable-mgmt, I guess? -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions

Re: [Linux-ha-dev] Heartbeat - Dev: changeset 11642:54085bc025ce

2007-12-06 Thread Alan Robertson
. These changes are in changeset 11643:35a4edc666b8, which has now been pushed into 'dev'. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce

[Linux-ha-dev] Re: Heartbeat - Dev: changeset 11628:e4a4c6fd5649

2007-12-04 Thread Alan Robertson
by the remote logging daemon - or not. What have I missed here? -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce

[Linux-ha-dev] Re: Commit messages

2007-11-06 Thread Alan Robertson
. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce ___ Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org http

Re: [Linux-ha-dev] cl_log dropping messages

2007-11-06 Thread Alan Robertson
because you have a lot of processors doesn't mean that Xen is scheduling you properly. You have two different schedulers going on here - so the opportunities for problems go up rather rapidly. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let

[Linux-ha-dev] Recovering from unexpected bad things - is STONITH the answer?

2007-11-06 Thread Alan Robertson
judgment shouldn't happen. I'm bringing this to both lists, so that we can hear comments both from developers and users. Comments please... -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised

[Linux-ha-dev] Re: [Linux-HA] Recovering from unexpected bad things - is STONITH the answer?

2007-11-06 Thread Alan Robertson
Kevin Tomlinson wrote: On Tue, 2007-11-06 at 10:25 -0700, Alan Robertson wrote: We now have the ComponentFail test in CTS. Thanks Lars for getting it going! And, in the process, it's showing up some kinds of problems that we hadn't been looking for before. A couple examples

Re: [Linux-ha-dev] bug in failcount handling?

2007-10-30 Thread Alan Robertson
Lars Marowsky-Bree wrote: On 2007-10-29T19:44:48, Alan Robertson [EMAIL PROTECTED] wrote: Off hand, this sounds a bit like a bug to me. I've attached the relevant files - the output of cibadmin -Q, a spreadsheet with the output of the various ptest runs, and the logs from both machines

Re: [Linux-ha-dev] RFC: pkg/ and port/ directory location

2007-10-23 Thread Alan Robertson
creditable job of keeping up with their respective platforms. I too would vote for packaging rather than contrib. Unfortunately, I believe that the RPM spec file needs to be in the top level directory of the project :-( (correct me if I'm wrong). -- Alan Robertson [EMAIL PROTECTED] Openness

Re: [Linux-ha-dev] Starting heartbeat when interfaces are down

2007-10-23 Thread Alan Robertson
- but this behavior does not constitute a single point of failure for the cluster. This is the rationale for this behavior. It's not perfect behavior, but it's not completely irrational either... -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim

Re: [Linux-ha-dev] Starting heartbeat when interfaces are down

2007-10-23 Thread Alan Robertson
Simon Talbot wrote: All, Does anyone know of any Quagga/Zebra OCF Scripts in development/mature, if not I will put some proper effort into making some decent ones? We have some for one specific special case, but I'm not aware of any more general ones. -- Alan Robertson [EMAIL PROTECTED

Re: [Linux-ha-dev] RFC: pkg/ and port/ directory location

2007-10-23 Thread Alan Robertson
know what's going on. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce ___ Linux-HA-Dev: Linux-HA-Dev

Re: [Linux-ha-dev] mysql ocf patch - update documentation and fix a typo

2007-10-18 Thread Alan Robertson
Raoul Bhatia [IPAX] wrote: please find another ocf::heartbeat::mysql patch attached. When you attach patches, it would be nice if you're able to make them text/plain MIME types. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me

Re: [Linux-ha-dev] mysql ocf patch - update documentation and fix a typo

2007-10-18 Thread Alan Robertson
Raoul Bhatia [IPAX] wrote: Alan Robertson wrote: Raoul Bhatia [IPAX] wrote: please find another ocf::heartbeat::mysql patch attached. When you attach patches, it would be nice if you're able to make them text/plain MIME types. I didn't mean to say you should resend it, I just meant

[Linux-ha-dev] Announcing HA/DR Educational Blog

2007-10-11 Thread Alan Robertson
Alan Robertson has started a blog focusing on HA, DR, and related topics - largely from an educational perspective. You can find the blog here: http://techthoughts.typepad.com/managing_computers/ A few of the recent topics include: * Split-brain, Quorum, and Fencing * How to use

[Linux-ha-dev] Release 2.1.3 planned for 10 December, 2007

2007-10-10 Thread Alan Robertson
as appropriate. Dejan Muhamadagic has a tool which helps with this. Miscellaneous responsibilities These responsibilities belong to individuals and small groups of people. Alan Robertson * Communicate status, progress, and problems with other stakeholders using the linux-ha-dev email mailing

Re: [Linux-ha-dev] Release 2.1.3 planned for 10 December, 2007

2007-10-10 Thread Alan Robertson
Alan Robertson wrote: Hi, As noted in the subject, we're currently planning on putting out release 2.1.3 on 10 December 2007. Thanks to the good people at NTT and NEC we have more people to help with testing than in the past. I have documented a proposed release testing procedure here

Re: [Linux-ha-dev] Dynamic Modify the timeout values

2007-08-24 Thread Alan Robertson
errors) than it has ever been. How does that sound for an outline of a plan? -- Alan Robertson [EMAIL PROTECTED] Hi, Alan-san. I am sorry for delay. And we asked our sponsor and he admit to research what you suggest. Though I researched the parameters for ha.cf, they are over 50 and I think

Re: [Linux-ha-dev] OpenBSD port update

2007-08-16 Thread Alan Robertson
(that already went into dev, thanks) - some hardcoded replacement of pam with bsd_auth to allow compilation of mgmtd and usage of hb_gui - some other minor changes Any chance you'd send me the patch for the PAM/bsd_auth change? -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation

Re: [Linux-ha-dev] Local loopback HBcomm plugin?

2007-08-16 Thread Alan Robertson
on openSUSE 10.2 + hg version of hb2. Comments are welcome. You can also do a ucast 127.0.0.1 without any new code... That's what we do for testing in BasicSanityCheck. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you

Re: [Linux-ha-dev] Shared disk file Exclusiveness controlprogramforHB2

2007-08-15 Thread Alan Robertson
it over in more detail. -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim from you at all times your undisguised opinions. - William Wilberforce ___ Linux-HA-Dev: Linux-HA-Dev

Re: [Linux-ha-dev] Shared disk file Exclusiveness controlprogramforHB2

2007-08-14 Thread Alan Robertson
to study. In this case, we are trying to create a lock, but I suspect the lockless methods would be a good way to synchronize the creation of a lock (even though this sounds odd). -- Alan Robertson [EMAIL PROTECTED] Openness is the foundation and preservative of friendship... Let me claim

Re: [Linux-ha-dev] Shared disk file Exclusiveness control program for HB2

2007-08-14 Thread Alan Robertson
Alan Robertson wrote: Andrew Beekhof wrote: On 8/8/07, NAKAHIRA Kazutomo [EMAIL PROTECTED] wrote: Hello, all. We wrote a Shared Disk File EXclusiveness Control Program, called SF-EX for short, could prevent a destruction of data on shared disk file system due to Split-Brain. This program

  1   2   3   4   >