Re: [Linux-HA] slave's drbd resource doesn't get promote when master dies

2008-03-22 Thread Lars Ellenberg
://thread.gmane.org/gmane.linux.network.drbd/14345/focus=14372 sory for puting out broken software. blame is on me, I did not double check QA, which would have been my job before committing anything. the fix we have pending undergoes thorough regression testing this time. -- : Lars Ellenberg

Re: [Linux-HA] Initial dead time is smaller than deadtime

2008-04-10 Thread Lars Ellenberg
). what does netstat -su say? what settings do you have for net.core.rmem_max and net.core.wmem_max? consider to up that to 8 to 10 MB. -- : Lars EllenbergTel +43-1-8178292-0 : : LINBIT Information Technologies GmbH Fax +43-1-8178292-82 : : Vivenotgasse 48

Re: [Linux-HA] Initial dead time is smaller than deadtime

2008-04-11 Thread Lars Ellenberg
On Thu, Apr 10, 2008 at 07:30:31PM +0200, Bernd Schubert wrote: On Thursday 10 April 2008 12:48:27 Lars Ellenberg wrote: On Wed, Apr 09, 2008 at 06:34:39PM +0200, Lars Marowsky-Bree wrote: On 2008-04-08T19:32:58, Bernd Schubert [EMAIL PROTECTED] wrote: Hello, I need to set

Re: [Linux-HA] status of dopd in 2.1.3

2008-06-02 Thread Lars Ellenberg
around. In my test environment (vmware) i run Ubuntu 8.0.4 with original packages of heartbeat 2.1.3 drbd 8.0.11 -- : Lars Ellenberg http://www.linbit.com : : DRBD/HA support and consulting sales at linbit.com : : LINBIT Information Technologies GmbH

Re: [Linux-HA] Wierd heartbeat problem.

2008-07-16 Thread Lars Ellenberg
sane input parameters, and be happy. -- : Lars Ellenberg http://www.linbit.com : : DRBD/HA support and consulting sales at linbit.com : : LINBIT Information Technologies GmbH Tel +43-1-8178292-0 : : Vivenotgasse 48, A-1120 Vienna/Europe Fax +43-1

Re: [Linux-HA] drbd and dopd

2008-07-22 Thread Lars Ellenberg
message corruption by fprintf(stderr) ... debian 2.1.3-5 missed it unfortunately. SLES 10 SP2 includes it. I don't know for sure for other distros/packages. -- : Lars Ellenberg http://www.linbit.com : : DRBD/HA support and consulting sales at linbit.com

Re: [Linux-HA] Accessing non-TCP services on clutser VIP.

2008-08-20 Thread Lars Ellenberg
.10 cluster ip: x.y.z.20 current master: x.y.z.21 from portto portremark x.y.z.10 1234x.y.z.20161 SNMP request to cluster x.y.z.21 161 x.y.z.101234SNMP reply from master -- : Lars Ellenberg

Re: [Linux-HA] Server reboots as soon as load average hits exactly 30

2008-08-25 Thread Lars Ellenberg
and setup. or get rid of it completely. after all, by using heartbeat, you already have your cluster nodes monitoring each other. and the watchdog ain't no use if it gets in the way, right? -- : Lars Ellenberg : LINBIT HA-Solutions GmbH : DRBD®/HA support and consultinghttp

Re: [Linux-HA] Server reboots as soon as load average hits exactly 30

2008-08-25 Thread Lars Ellenberg
the load, typically that something is client requests. during failover (and coming up cache cold on the other node), more client requests pile up. the strategy reboot and failover is likely to worsen any load problem. -- : Lars Ellenberg : LINBIT HA-Solutions GmbH : DRBD®/HA

Re: [Linux-HA] File System preferences

2008-09-02 Thread Lars Ellenberg
servers, and nfs-mount the content from some NFS server (HA NFS DRBD Cluster?) -- : Lars Ellenberg : LINBIT HA-Solutions GmbH : DRBD®/HA support and consultinghttp://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT Information Technologies GmbH

Re: [Linux-HA] File System preferences

2008-09-02 Thread Lars Ellenberg
On Tue, Sep 02, 2008 at 03:17:42PM +0100, Matthew Macdonald-Wallace wrote: On Tue, 2 Sep 2008 16:03:27 +0200 Lars Ellenberg [EMAIL PROTECTED] wrote: why not use NFS? add plenty RAM into your web servers, and nfs-mount the content from some NFS server (HA NFS DRBD Cluster?) Because

Re: [Linux-HA] Rename an arbitrary file on failover (and keeping 2 files in sync)

2008-12-01 Thread Lars Ellenberg
own pam restrictions that deny access on passive nodes. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA

Re: [Linux-HA] DRBD not syncing

2008-12-12 Thread Lars Ellenberg
for the reason why drbd was going to StandAlone. my best guess is that the before resync target handler returned some bad exit code because you faked the hostname in the OCF RA parameters. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http

Re: [Linux-HA] value too great for base?

2008-12-16 Thread Lars Ellenberg
;) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux

[Linux-HA] [Announce] DRBD Management Console

2009-02-17 Thread Lars Ellenberg
below. Thanks, Lars Ellenberg - Forwarded message from Philipp Reisner philipp.reis...@linbit.com - Date: Tue, 17 Feb 2009 11:56:00 +0100 From: Philipp Reisner philipp.reis...@linbit.com To: drbd-annou...@lists.linbit.com, drbd...@lists.linbit.com Cc: drbd-u...@lists.linbit.com

Re: [Linux-HA] Stranges dead link and late heartbeat on sunny Sunday.

2009-04-09 Thread Lars Ellenberg
recommend to upgrade to heartbeat 2.1.4 from backports, and continue to use your config as is. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] Stranges dead link and late heartbeat on sunny Sunday.

2009-04-09 Thread Lars Ellenberg
On Thu, Apr 09, 2009 at 04:22:14PM +0200, julien WICQUART wrote: Date: Thu, 9 Apr 2009 10:36:02 +0200 From: Lars Ellenberg lars.ellenb...@linbit.com Subject: Re: [Linux-HA] Stranges dead link and late heartbeat on sunny Sunday. To: linux-ha@lists.linux-ha.org I already posted

Re: [Linux-HA] Split-brain condition after power failure: smb resource migrates unexpectedly.

2009-05-06 Thread Lars Ellenberg
by the wiki itself at http://linux-ha.org/wiki/Heartbeat -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list

Re: [Linux-HA] Heartbeat does not start drbd after System Start

2009-05-28 Thread Lars Ellenberg
it was responsible. that simply won't work. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA

Re: [Linux-HA] glib: ucast: error binding socket. Retrying: Address already in use

2009-06-10 Thread Lars Ellenberg
services are started, yes, even before portmapper or any rpc services with arbitrary ports. or tell portmapper to choose arbitrary ports from above 1024, if that is possible. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

Re: [Linux-HA] getting started with ha2: 404

2009-06-19 Thread Lars Ellenberg
choice) is _definitely_ much more production ready than any integrated-in-heartbeat CRM you may find. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] Fwd: Startup of cluster with one Outdated node

2009-06-24 Thread Lars Ellenberg
the same: stop your current secondary heartbeat first, so it will not take over when you stop your primary heartbeat. DRBD should then be Connected Secondary/Secondary. From there you can shutdown and reboot your boxes in any order. -- : Lars Ellenberg : LINBIT | Your Way to High Availability

Re: [Linux-HA] Failover problem

2009-06-24 Thread Lars Ellenberg
. You need to - add more nodes (just to have a real quorum), and/or - add stonith, and/or - set quorum policy to ignore. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT

Re: [Linux-HA] [patch] clock_t wrapped around causing false resourcestart failure

2009-07-01 Thread Lars Ellenberg
; } #endif /* CLOCK_T_IS_LONG_ENOUGH */ -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA

Re: [Linux-HA] [patch] clock_t wrapped around causingfalse resourcestart failure

2009-07-01 Thread Lars Ellenberg
my proposed patch there. just to reiterate: the problem is not the uptime wrap or resulting longclock wrap, as that is handled correctly in cl_times() and time_longclock. but the cmp_longclock comparing unsigned values without checking for wrap. -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] [patch] clock_t wrapped around causingfalse resourcestart failure

2009-07-01 Thread Lars Ellenberg
On Wed, Jul 01, 2009 at 04:59:24PM +0200, Dejan Muhamedagic wrote: Hi Lars, On Wed, Jul 01, 2009 at 04:44:06PM +0200, Lars Ellenberg wrote: On Wed, Jul 01, 2009 at 10:22:58AM -0400, Tavanyar, Simon wrote: Aahh ... so it's measured after reboot. That makes sense. Thanks, Lars

Re: [Linux-HA] [patch] clock_t wrapped around causingfalse resourcestart failure

2009-07-02 Thread Lars Ellenberg
this in a VM, and try to reproduce? Or shall we just ignore this for now? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] socket of lrmd

2009-07-03 Thread Lars Ellenberg
inherited from the parent process. All those FDs should probably be opened with the O_CLOEXEC flag, or alternatively by flags = fcntl(fd, F_GETFD); /*error handling*/ fcntl(fd, F_SETFD, flags | FD_CLOEXEC); /*error handling*/ -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA

Re: [Linux-HA] socket of lrmd

2009-07-03 Thread Lars Ellenberg
) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org

Re: [Linux-HA] all or none failover

2009-07-27 Thread Lars Ellenberg
resource monitoring _at all_? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA

Re: [Linux-HA] How to tell which box was master

2009-08-13 Thread Lars Ellenberg
. Cheers, -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

Re: [Linux-HA] DRBD failover doesn't work

2009-09-08 Thread Lars Ellenberg
at the same time to a non-shared medium. OCFS2 on the disk is the legacy of the previous DRBD setup with two primaries. So should we migrate to ext3? We postponed it as it would induce quite a long downtime. What exactly is it that you are trying to achieve? -- : Lars Ellenberg : LINBIT | Your Way

Re: [Linux-HA] DRBD failover doesn't work

2009-09-09 Thread Lars Ellenberg
to ext3. If you do not need concurrent access, yes, I'd strongly recommend to get rid of the cluster file system. As long as you don't export via NFS or similar, I think the downtime can be reduced to about a minute. Don't think that is long for such a task. -- : Lars Ellenberg : LINBIT | Your Way

Re: [Linux-HA] DRBD failover doesn't work

2009-09-09 Thread Lars Ellenberg
On Wed, Sep 09, 2009 at 12:13:53PM +0200, Christoph Lechner wrote: Lars Ellenberg wrote: The question was if you would recommend a change to ext3. If you do not need concurrent access, yes, I'd strongly recommend to get rid of the cluster file system. As long as you don't export via

Re: [Linux-HA] Checking status of communications between nodes

2009-09-21 Thread Lars Ellenberg
, but I would expect there is something similar available. If there is not such thing yet, now is about time to make it available ;) Anyone? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered

Re: [Linux-HA] Virtual-IP with unnumbered interface

2009-09-21 Thread Lars Ellenberg
state UNKNOWN qlen 1000 link/ether 00:16:3e:0f:01:12 brd ff:ff:ff:ff:ff:ff inet 192.168.113.12/24 brd 192.168.113.255 scope global eth0 inet 192.168.113.33/24 brd 192.168.113.255 scope global secondary eth0 ) Thanks: -- : Lars Ellenberg : LINBIT | Your Way to High Availability

Re: [Linux-HA] Heartbeat considers the other node is dead

2009-09-28 Thread Lars Ellenberg
thing. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

Re: [Linux-HA] Two node cluster switchback

2009-10-02 Thread Lars Ellenberg
on the preferred node. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org

Re: [Linux-HA] Two node cluster switchback

2009-10-02 Thread Lars Ellenberg
On Fri, Oct 02, 2009 at 10:21:53AM +0200, James Brackinshaw wrote: On Fri, Oct 2, 2009 at 10:17 AM, Lars Ellenberg lars.ellenb...@linbit.com wrote: On Wed, Sep 30, 2009 at 02:22:32PM +0200, James Brackinshaw wrote: Hello, I have a two node heartbeat setup on Centos 5.3. The two nodes

Re: [Linux-HA] WARN: Gmain_timeout_dispatch

2009-10-19 Thread Lars Ellenberg
messages, not for Gmain_timeout_dispatch thingies per se. so maybe upgrading to a more recent kernel + pacemaker is the _real_ solution? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks

Re: [Linux-HA] Configuring iSCSI failover, service stops when slave comes up

2009-11-25 Thread Lars Ellenberg
on that node, I guess you need a cleanup of that resource. Won't do much good, though, until you have also set up the contraints properly. I'm sure that the paper mentions those? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com

Re: [Linux-HA] 4 Servers heartbeat cluster

2009-12-10 Thread Lars Ellenberg
(hopefully) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

Re: [Linux-HA] Question about linux-ha.org

2010-01-04 Thread Lars Ellenberg
a couple of them. They can tell you more. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list

Re: [Linux-HA] heartbeat waits for initdead even after all nodes have joined

2010-01-14 Thread Lars Ellenberg
heartbeat in the startup sequence, it turns out I don't really need initdead after all. btw, wfc-timeout is only used by the drbd init script, which you should not use in case you go the ocf:linbit:drbd RA route -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support

Re: [Linux-HA] heartbeat waits for initdead even after all nodes have joined

2010-01-15 Thread Lars Ellenberg
). for AIS based clusters, there seem to be no real initdead, I think that is why the dc-timeout was changed to default to the ha.cf initdead setting. iirc there have been improvements to this startup behaviour in pacemaker somewhen, but I don't remember the details. -- : Lars Ellenberg : LINBIT

[Linux-HA] Heartbeat 3.0.2 release, status update

2010-01-20 Thread Lars Ellenberg
to give the stack a few more spins in the Cluster Test Suite. We're still confident of meeting the original within the month of January release goal, though. Thanks for everyone's patience! Cheers, Lars -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting

Re: [Linux-HA] Heartbeat 2 and DRBD issue

2010-01-26 Thread Lars Ellenberg
Pacemaker Cluster GUI and DRBD Management Console, integrated VNC viewer for Pacemaker controlled VMs. The various Wizards should get you up and running in no time. http://www.drbd.org/mc/management-console/ -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting

Re: [Linux-HA] ERROR: Device drbd0: another node is heartbeating in our slot!

2010-01-27 Thread Lars Ellenberg
cluster file system directly as well as via iSCSI. DRBD or not, if your initiators mount it, you must not mount OCFS2 directly on the iSCSI target. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered

Re: [Linux-HA] Which RPM should create /var/run/heartbeat/crm ?

2010-01-27 Thread Lars Ellenberg
changed to create its socket in /var/run/heartbeat/* directly. Even though Rasto fixed that half a year ago, I only committed that change yesterday, sorry for that. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

[Linux-HA] Announce: Final Release of Heartbeat 3.0.2, Cluster-Glue 1.0.2, and Resource-Agents 1.0.2

2010-02-01 Thread Lars Ellenberg
packages. The hgweb view of the Mercurial repository can be found at http://hg.linux-ha.org/heartbeat-STABLE_3_0 Build instructions have been written up by Andrew on the Clusterlabs website (http://clusterlabs.org/wiki/Install#From_Source). -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] hb_reset in HB 2.x?

2010-02-16 Thread Lars Ellenberg
be if the boxes were to bootup at the same time? hb_standby foreign hb_takeover local -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] About hb_gui

2010-02-17 Thread Lars Ellenberg
On Tue, Feb 16, 2010 at 02:54:37PM +0100, Alain.Moulle wrote: Hi I can't find anymore on whatever linux distribution on www.clusterlabs.org/rpm the pacemaker-mgmt which gave us the hb_gui ... Maybe you want to try the DMC... http://www.drbd.org/mc/management-console/ Even though it is

Re: [Linux-HA] Compilation of the resource-agents source to a RPM file

2010-03-04 Thread Lars Ellenberg
(multibyte) character at line 348. The rest can be ignored. Please make sure docbook-dtds is installed, and do a rpm -V docbook-dtds. Any mismatch? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered

Re: [Linux-HA] permission denied error starting dopd w/ heartbeat

2010-03-18 Thread Lars Ellenberg
is Heartbeat-3-0-2fef843bab53.tar.bz2 Thanks, -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list

Re: [Linux-HA] node2 wont stay up

2010-03-18 Thread Lars Ellenberg
with another process. You can complain to your distributor. it is part of nfs, and you may explicitly tell it which port to use (-o and -p options), which you can usually also configure in some /etc/default/nfs* or /etc/sysconfig/nfs* file. hth, -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] Question about VirtualDomain

2010-03-18 Thread Lars Ellenberg
=120 interval=60 Maybe it is all a misunderstanding? Alain: can you post your hakvm script, and and example logs from a start attempt to the first monitoring failure? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

Re: [Linux-HA] Question about VirtualDomain

2010-03-19 Thread Lars Ellenberg
, arp conflicts, IP conflicts? Load on the host or vm prevents the hastatus to return in time? Maybe you just need to add some timeout + retry around the ping vm and/or ssh vm in your hakvm script, or increase various timeouts? Lars -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] Question about VirtualDomain

2010-03-19 Thread Lars Ellenberg
the monitor action succeeds (that includes your script) Which means, Alain, you have to provide some logs and dig deeper ;) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] Master/Slave OCF script

2010-03-19 Thread Lars Ellenberg
master-max to 1, so there should be only one master node, but after promoting first node Pacemaker keeps trying to promote second node. Known bug in that version with that target-role. upgrade (or delete the target-role setting, and upgrade anyways). -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] Link to CIB User guide

2010-03-25 Thread Lars Ellenberg
looking for is the ClusterInformationBase UserGuide. Every link to it that I seem to find refers me back to linux-ha.org which contains nothing of the sort. Anyone know where I can get this guide? thanks -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support

Re: [Linux-HA] Problem with .../pacemaker/ping OCF script

2010-04-09 Thread Lars Ellenberg
and the cluster behavior is like before. Has anybody an idea what could happen? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] hblinkstatus - dead on one interface

2010-04-16 Thread Lars Ellenberg
with the results. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

Re: [Linux-HA] hblinkstatus - dead on one interface

2010-04-16 Thread Lars Ellenberg
8001.00:0b:5f:09:55:00.800e, length 43 STP? What exactly is this network, switched, virtual, direct link, ...? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] [Linux-ha-dev] Deprecated resource agents

2010-04-19 Thread Lars Ellenberg
of these already exist in Pacemaker, and Andrew has indicated he wants to maintain them there rather than fix them in the Linux-HA repo. Right. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered

Re: [Linux-HA] HA LDAP monitor

2010-04-19 Thread Lars Ellenberg
://www.clusterlabs.org/doc/crm_cli.html More reference documentation on pacemaker: http://www.clusterlabs.org/doc/ More reference documentation on heartbeat: http://www.linux-ha.org/doc/ -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

Re: [Linux-HA] [Linux-ha-dev] Deprecated resource agents

2010-04-19 Thread Lars Ellenberg
On Mon, Apr 19, 2010 at 10:48:48PM +0200, Lars Marowsky-Bree wrote: On 2010-04-19T22:22:41, Lars Ellenberg lars.ellenb...@linbit.com wrote: - EvmsSCC and - Evmsd (both apply to EVMS, which is no longer maintained); Fine with me, as newer distributions don't ship with EVMS anymore

Re: [Linux-HA] Problems with latest pacemaker update on Debian

2010-04-20 Thread Lars Ellenberg
- but is there a better solution for someone running Squeeze? Rebuild the stack yourself. Or wait until a new build of those packages is available, and confirmed working by someone. Or, speaking for lmb, use an other distribution ;-) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support

Re: [Linux-HA] Problems with latest pacemaker update on Debian

2010-04-20 Thread Lars Ellenberg
On Tue, Apr 20, 2010 at 02:39:56PM +0100, stephen mulcahy wrote: Lars Ellenberg wrote: Most likely this is a broken build of those debian packages. There is an other round of builds pending anyways, as new versions have been released. Yeah, looks like some of the new packages have made

Re: [Linux-HA] [Linux-ha-dev] Deprecated resource agents

2010-04-20 Thread Lars Ellenberg
, just in case they or the script or the phase of the moon messes anything up and things are restarted regardless. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] login error

2010-04-21 Thread Lars Ellenberg
out in the wiki? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org

Re: [Linux-HA] how to change an op parameter in pacemaker?

2010-04-29 Thread Lars Ellenberg
be nice to be able to do crm configure ping set params host_list=10.10.10.1 Just a humble feature request crm configure show your_ping_resource_id | sed -e 's/host_list=\([^]*\)/host_list=10.10.10.1/ | crm configure load update - ? -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] Corosync shutdown hangs server

2010-04-30 Thread Lars Ellenberg
itself for cpu every minute, so it will be killed in case it runs into an endless loop... does corosync behave similarly? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] [Linux-ha-dev] Tickle Ack function in portblock resource

2010-05-03 Thread Lars Ellenberg
=1234 Len=0 Right. So tickle acks are send out. All is well ;-) If some firewall in between filters out the tickle acks as invalid packets, well, the client is not tricked into sending anything, and thus needs to notice all by itself. -- : Lars Ellenberg : LINBIT | Your Way to High Availability

Re: [Linux-HA] Documentation mismatch

2010-05-04 Thread Lars Ellenberg
the random byte string into something ascii alphanumerical, with the welcome side effect of being of predetermined length. It has nothing to do with what auth method is used. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

Re: [Linux-HA] Heartbeat and Postfix

2010-05-10 Thread Lars Ellenberg
(especially simpler to explain to anyone who may need to operate/manage that thing). BTW, alert mails from the nodes about the cluster misbehaving should probably not try to be delivered into that HA postfix ;-) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting

Re: [Linux-HA] Issues with Heartbeat/DRBD over Internet connection

2010-05-12 Thread Lars Ellenberg
66.165.231.34 ucast eth1 67.218.128.19 You should add an additional link. Really. auto_failback on crm yes Are you short on memory, or under memory pressure? Are UDP packets dropped? Packet loss somewhere? Message corruption? Firewalled in one direction? -- : Lars Ellenberg : LINBIT | Your Way to High

Re: [Linux-HA] Issues with Heartbeat/DRBD over Internet connection

2010-05-14 Thread Lars Ellenberg
have started, until this Emergency Shutdown incident. And send that to your support contact. You do have a support contact, right? ;-) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered

Re: [Linux-HA] How to identify HBREAD and HBWRITE processes

2010-05-14 Thread Lars Ellenberg
br0 \_ heartbeat: read: mcast br0 \_ /usr/lib/heartbeat/ccm \_ /usr/lib/heartbeat/cib \_ /usr/lib/heartbeat/lrmd -r \_ /usr/lib/heartbeat/attrd \_ /usr/lib/heartbeat/crmd -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

Re: [Linux-HA] crm_mon has display problems

2010-05-26 Thread Lars Ellenberg
of: crm_mon -1rf | less -S crm_mon -1rfn | less -S ? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list

Re: [Linux-HA] stonith: failure using expect+ssh (solved)

2010-05-27 Thread Lars Ellenberg
will just leave it at that. It needs to work for you, that is what matters. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] stonith: failure using expect+ssh (solved)

2010-05-28 Thread Lars Ellenberg
On Thu, May 27, 2010 at 09:48:15PM -0600, Tim Serong wrote: On 5/27/2010 at 10:24 PM, Lars Ellenberg lars.ellenb...@linbit.com wrote: On Thu, May 27, 2010 at 12:46:14AM +0200, Matthias Ferdinand wrote: --On Wednesday, May 26, 2010 12:00:02 -0600 linux-ha-requ...@lists.linux-ha.org

Re: [Linux-HA] stonith: failure using expect+ssh (solved)

2010-05-28 Thread Lars Ellenberg
On Fri, May 28, 2010 at 12:32:33PM +0200, Dejan Muhamedagic wrote: On Fri, May 28, 2010 at 11:07:53AM +0200, Lars Ellenberg wrote: On Thu, May 27, 2010 at 09:48:15PM -0600, Tim Serong wrote: On 5/27/2010 at 10:24 PM, Lars Ellenberg lars.ellenb...@linbit.com wrote: On Thu, May 27

Re: [Linux-HA] Should not Delay RA be stateful?

2010-06-01 Thread Lars Ellenberg
resource agent, that almost always means there is a bug somewhere else, either in some resource agent, or interaction, or in the configuration concept or dependencies. Please don't mask bugs. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http

Re: [Linux-HA] Active-Active nfs storage

2010-06-02 Thread Lars Ellenberg
of any resource-agents release yet. Lars (Ellenberg), could you comment on this please? Typically references from NFS to the underlying file system even after unexport come from file locks. To verify that, try mounting from the clients with nolock. If you cannot do without locks, figure out

Re: [Linux-HA] Active-Active nfs storage

2010-06-03 Thread Lars Ellenberg
is that your client workload includes locking, and theirs do not. Please try with nolock on the client side, and see if that changes something. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered

Re: [Linux-HA] upgrade problem

2010-06-11 Thread Lars Ellenberg
-refresh name=last-lrm-refresh value=1276174323/ /cluster_property_set /crm_config -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] heartbeat drdb failover failed

2010-07-14 Thread Lars Ellenberg
thinking dopd might have had something to do with the failure of the drbd resource takeover. Anyone know what might have happened? Did you want to show us something else? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD

Re: [Linux-HA] New with HA / STONITH

2010-08-03 Thread Lars Ellenberg
will only work under certain conditions? http://www.linux-ha.org/ConfiguringStonithPlugins may help a bit. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] how to use mac hadware with ipaddr2

2010-08-04 Thread Lars Ellenberg
overloaded, unfortunately) -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux

Re: [Linux-HA] time to fork heartbeat?

2010-08-12 Thread Lars Ellenberg
are more the GUI kind, have a look at the DMC, http://www.drbd.org/mc/management-console/ At the time of writing, the latest release announcement is http://www.gossamer-threads.com/lists/drbd/announce/19904 -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting

Re: [Linux-HA] Revenge of the cluster-glue clplumbing ABI change (a public service announcement)

2010-08-22 Thread Lars Ellenberg
an incompatible ABI change. And that its still desirable to do this for the next release. I have CCed Lars Ellenberg, who made the -version-info change to see if he has any thoughts. The intention was to keep old stuff working even with new libs, but have new stuff require new libs. If someone wanted

Re: [Linux-HA] Perpeptual Newbie Question - gfs2 active/active

2010-09-14 Thread Lars Ellenberg
\ no-quorum-policy=ignore \ last-lrm-refresh=1284509691 -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA

Re: [Linux-HA] Perpeptual Newbie Question - gfs2 active/active

2010-09-15 Thread Lars Ellenberg
is the huge overhead with the solution. drbd looks like the way to go in passive/active solutions though. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] Standby Node Refuses to Take Over

2010-10-01 Thread Lars Ellenberg
. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux

Re: [Linux-HA] /etc/hosts VS node directive in ha.cf

2010-10-06 Thread Lars Ellenberg
long that resolving would take. BTW, is it a requirement from Heartbeat to assign IP on the interface that are being in the communication link? If you use IP for cluster communication, you better use an interface with IP enabled. Maybe I don't understand your question? -- : Lars Ellenberg

Re: [Linux-HA] Standby Node Refuses to Take Over

2010-10-06 Thread Lars Ellenberg
and XML libraries to operate. I am working on keeping it lightweight so it can be used in small systems. Wish me luck :) I certainly do. Regards, Steve -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT

Re: [Linux-HA] Emergency reboot by stonith-enabled=false

2010-10-08 Thread Lars Ellenberg
uid=root If you want to prevent reboots, use crm respawn. see also http://hg.linux-ha.org/dev/rev/9624b66a6b82 will be released with heartbeat 3.0.4 -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT

Re: [Linux-HA] Documentation of heartbeat protocol

2010-10-14 Thread Lars Ellenberg
by reverse engeneering the code, is probably not exactly wasted, but possibly close, sometimes, even though it may be an interesting and educating experience. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT

Re: [Linux-HA] possible race condition in OCF apache RA monitor

2010-10-15 Thread Lars Ellenberg
at the code ... Wait. It does that already, since May 2007. What versions are you using? Consider an update? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

Re: [Linux-HA] Reusable-Cluster-Components-glue: make error on 32-bit box

2010-10-27 Thread Lars Ellenberg
/libreplace.a(NoSuchFunctionName.o)' is incompatible with i386 output make clean ? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria

  1   2   3   4   5   6   >