is lost, with RELP things received but not written is lost)
Really, the only way to not loose something is to have an application level
acknowlegement that's only sent after the data is safe on redundant non-volitile
media.
David Lang
___
Linux-HA
:-) This is 'good enough'
for load balancing syslog messages.
David Lang
vagrant@sendervm:~$ sudo crm configure show
node receivervm
node sendervm
primitive p_vip ocf:heartbeat:IPaddr2 \
params ip=10.114.90.251 cidr_netmask=24 nic=eth1
clusterip_hash=sourceip-sourceport \
op start
have things setup per your config, check that both nodes are online and
have the resource active. At that point, both systems should get some of the
traffic.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org
On Thu, 19 Sep 2013, Jakob Curdes wrote:
Am 19.09.2013 01:21, schrieb David Lang:
cman two_node=1 expected_votes=1
totem vsftype=none token=5000
token_retransmits_before_loss_const=10 join=60 consensus=4800
rrp_mode=none transport=udpu
interface ringnumber=0 bindnetaddr=10.1.18.0
On Thu, 19 Sep 2013, Jakob Curdes wrote:
Am 19.09.2013 11:49, schrieb David Lang:
On Thu, 19 Sep 2013, Jakob Curdes wrote:
That's the direction we started, but apparently the centos
pacemaker/corosync packages don't look at the corosync.conf file, they
expect to extract everything out
/
/interface
/totem
/cman
fencedevices/
rm
failoverdomains/
resources/
/rm
/cluster
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org
drop one of the
lists from the thread)?
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
Why is corosync replacing heartbeat? it seems like there are a number of things
(like multi-datacenter operation) that have worked for heartbeat for years that
corosync lists as new or upcoming features.
David Lang
On Mon, 1 Jul 2013, Digimer wrote:
On 07/01/2013 07:06 AM, Parkirat wrote
down this road is how
much damage you suffer in a split-brain situation.
If you have shared nothing HA nodes, the odds are that you really aren't
damaging much, the worst case tends to be the dup IP issue, and if one is off
the network, that really doesn't matter.
David Lang
On Wed, 22 Aug 2012, Jon Heese wrote:
On 21 Aug 2012, at 17:29, David Lang david_l...@intuit.com wrote:
On Tue, 21 Aug 2012, Jon Heese wrote:
Feel free to keep discussing alternatives, but I am not at liberty to
change this system from the current Heartbeat/Pacemaker/CRM\
architecture
the configuration so that instead of your
VIP moving from one box to another, it's instead shared between the systems.
This is a much smaller step than setting up an external load balancer system.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
On Tue, 21 Aug 2012, Lars Marowsky-Bree wrote:
On 2012-08-21T13:16:29, David Lang david_l...@intuit.com wrote:
with ldirectord you have an extra network hop, and you have all your
traffic going through one system. This is a scalability bottleneck as
well as bing a separate system
On Tue, 21 Aug 2012, Jon Heese wrote:
On 21 Aug 2012, at 22:06, David Lang david_l...@intuit.com wrote:
Well haproxy does all that out of the box, no tricks or tweakery
required...
Another thing I like about haproxy is that it's unnervingly fast;
start/stop/reload are effectively
On Fri, 8 Jun 2012, Lars Ellenberg wrote:
On Fri, Jun 08, 2012 at 02:07:17PM -0700, David Lang wrote:
I just updated one of my systems to the 3.4 kernel and findif appears to be
failing (generic error)
And your version of the resource agents is?
Did you try current upstream (3.9.3
I just updated one of my systems to the 3.4 kernel and findif appears to be
failing (generic error)
how can I test it from the command line?
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo
, and
then one of the nodes starts it.
this solves the dup-IP problem because starting the resource re-sends the
appropriate ARP packets to clean up the network.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org
, shutdown heartbeat on
the
new box and copy the file over (or update the value in the file) on the new box.
David Lang
On Fri, 29 Jul 2011, mike wrote:
Date: Fri, 29 Jul 2011 16:06:25 -0300
From: mike mgbut...@nbnet.nb.ca
Reply-To: General Linux-HA mailing list linux-ha@lists.linux-ha.org
stalls attempting to send the message (including failed
DNS lookups because the network is down), a graceful failover can take a long
time to take place.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
On Wed, Aug 11, 2010 at 05:22:56PM -0700, David Lang wrote:
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
On Wed, Aug 11, 2010 at 03:59:34PM -0700, David Lang wrote:
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
On Wed, Aug 11, 2010 at 02:44
that Igor is experiancing, and the inability to
take a simple config and convert it to the new format, it is sounding like it
may be time to fork.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman
; fi
Now, this goes against all good principles, etc. But I think that this
solution will work practically most of the time.
Any thoughts?
anything that works, especially with the lack of information.
when it does the failover, does something further then show up in the ha-log
file?
David
figuring out what heartbeat is doing. the
logs
aren't showing anything (including failures), which makes it incredibly
difficult to troubleshoot.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
On Wed, Aug 11, 2010 at 02:44:36PM -0700, David Lang wrote:
haresources2cib.py is obsolete and probably produces a bad
cib.xml. The recommended way is to create a configuration using
the crm shell.
Ok, so this means that there is officially
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
On Wed, Aug 11, 2010 at 03:59:34PM -0700, David Lang wrote:
On Thu, 12 Aug 2010, Dejan Muhamedagic wrote:
On Wed, Aug 11, 2010 at 02:44:36PM -0700, David Lang wrote:
I currently manage over a hundred
clusters of machines. with v1 style configs
: dont_ask:
nice_failback: yes
given that this only happens on one machine, either you don't have heartbeat
installed the same way on both machines, or you don't have the same haresorces
file
can you please verify that haresources is _identical_ on both machines?
David Lang
On Tue, 10 Aug
On Tue, 10 Aug 2010, Igor Chudov wrote:
On Tue, Aug 10, 2010 at 12:51 PM, David Lang
david.l...@digitalinsight.com wrote:
one problem I see in ha-log-2.txt is the lines
Aug 10 10:38:06 pfs-srv4 ResourceManager[1241]: [1253]: ERROR: Cannot locate
resource script
Aug 10 10:38:06 pfs-srv4
On Tue, 10 Aug 2010, Igor Chudov wrote:
Dmitri, you are right.
In any case the name change did nothing.
did it eliminate the error from the log? does the log say anything else after
that point?
David Lang
They are still refuse to take over when rebooted simultaneously.
The symptoms
could you re-post the files (log files, ha.cf and haresources from each box)
David Lang
On Tue, 10 Aug 2010, Igor Chudov wrote:
Date: Tue, 10 Aug 2010 15:23:44 -0500
From: Igor Chudov ichu...@gmail.com
Reply-To: General Linux-HA mailing list linux-ha@lists.linux-ha.org
To: General Linux-HA
and a different one on the backup with each
being
failover for the other)
I have had many cases where one bad resource entry would prevent things from
starting. with the older version I am using I get a better error message than I
saw you post, which let me find it more clearly.
David
box shutdown, not that the other box has instructed it to shutdown.
this is looking to me like something external to heartbeat is killing the
process.
David Lang
On Tue, 10 Aug 2010, Igor Chudov wrote:
Date: Tue, 10 Aug 2010 17:57:26 -0500
From: Igor Chudov ichu...@gmail.com
Reply
and
doesn't hear anything, just in case the switch is preventing it from seeing
another system that's up.
David Lang
pfs-srv3:
Aug 10 18:04:41 pfs-srv3 logd: [955]: WARN: Core dumps could be lost
if multiple dumps occur.
Aug 10 18:04:41 pfs-srv3 logd: [955]: WARN: Consider setting
non-default
On Tue, 10 Aug 2010, Igor Chudov wrote:
On Tue, Aug 10, 2010 at 6:41 PM, David Lang
david.l...@digitalinsight.com wrote:
On Tue, 10 Aug 2010, Igor Chudov wrote:
Guys, I have a bit of clarification. In an attempt to avoid the timing
issues, an hour ago I tried adding a configuration change
On Tue, 10 Aug 2010, Igor Chudov wrote:
On Tue, Aug 10, 2010 at 7:05 PM, David Lang
david.l...@digitalinsight.com wrote:
On Tue, 10 Aug 2010, Igor Chudov wrote:
On Tue, Aug 10, 2010 at 6:41 PM, David Lang
david.l...@digitalinsight.com wrote:
On Tue, 10 Aug 2010, Igor Chudov wrote:
As I
does ha-log show on pfs-srv3?
David Lang
On Mon,
9 Aug 2010, Nick Calvert wrote:
Date: Mon, 9 Aug 2010 21:10:13 +0100
From: Nick Calvert nick.calv...@gmail.com
Reply-To: General Linux-HA mailing list linux-ha@lists.linux-ha.org
To: General Linux-HA mailing list linux-ha@lists.linux-ha.org
ha-log should give you a detailed picture of what each box is thinking as they
startup. I've always been able to track down the problem with that info for my
systems.
David Lang
On Mon, 9 Aug 2010, Igor Chudov wrote:
Pushkar, I will be at work tomorrow (took a couple of days off
, so when the
network is restored you are in a splitbrain situation, and that is what you are
recovering from.
David Lang
Thanks,
Dejan
Thanks.
On Fri, Aug 6, 2010 at 8:29 PM, Dejan Muhamedagic deja...@fastmail.fm
wrote:
Hi,
On Fri, Aug 06, 2010 at 04:57:43PM +0800, fengyandong
the real IP
addresses and the VIPs).
I have several dozen clusters with postfix on them, and it works just fine
without needing to know anything about heartbeat.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org
On Tue, 4 May 2010, Cameron Smith wrote:
On Tue, May 4, 2010 at 10:17 AM, David Lang
david.l...@digitalinsight.comwrote:
On Tue, 4 May 2010, Cameron Smith wrote:
I am currently using Heartbeat to manage http, mysql and a DRBD device
between two nodes.
I want to also manage Postfix
On Mon, 15 Feb 2010, Dejan Muhamedagic wrote:
On Thu, Feb 11, 2010 at 10:04:04AM -0800, David Lang wrote:
on my old systems that are still running heartbeat 1.x there is a hb_reset
command that moves all resources to the node they are configured to start on
(assuming auto_fallback is turned
detail to know if this
would be a factor or not.
David Lang
On Mon, 11 Jan 2010, Javen Wu wrote:
Can we change the limitation 32 nodes of COROSYNC to 64 ?
Thanks
Javen
2010/1/11 Alain.Moulle alain.mou...@bull.net
Hi,
I wonder what is for now the *real* limit in the number
of nodes in a HA
/log/ha-log
auto_failback off
apiauth cl_status gid=haclient
this is using udp broadcase instead of unicast
you may want to add debugfile and logfile options, then if things don't work
look at what those files have to say
David Lang
Thanks,
Marcus
for the cluster, which is what you are seeing.
David Lang
Thanks,
Dejan
heartbeat[4514]: 2009/07/07_09:11:35 ERROR: MSG[4] : [src=dcwvm-drbdnode-1]
heartbeat[4514]: 2009/07/07_09:11:35 ERROR: MSG[5] :
[(1)srcuuid=0x8137360(36 27)]
heartbeat[4514]: 2009/07/07_09:11:35 ERROR: MSG[6] : [seq=7e73f
do the rules on the ASA allow the real interfaces of the boxes to get to the
Internet, or only the VIP?
David Lang
On Fri, 8 May 2009, Ryan Thomson wrote:
Date: Fri, 8 May 2009 10:48:14 -0700
From: Ryan Thomson r...@pet.ubc.ca
Reply-To: General Linux-HA mailing list linux-ha@lists.linux
about using Linux Virtual Server. It also
integrates nicely into heartbeat.
CLUSTERIP and LVS serve very different purposes.
CLUSTERIP spreads the load across different servers in that heartbeat cluster.
LVS spreads the load across different servers outside that heartbeat cluster.
David Lang
.
David Lang
Brandon
Dejan Muhamedagic wrote:
Hi,
You need to find a more appropriate forum: heartbeat isn't
haproxy.
Thanks,
Dejan
On Fri, Jan 16, 2009 at 07:23:13PM +0530, kiran sarvabhotla wrote:
Hi all,
I tried to configure a 2 node cluster (courtesy
http://www.howtoforge.com/high
times.
David Lang
--
Eric Robinson
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Imran Chaudhry
Sent: Friday, November 28, 2008 9:36 AM
To: linux-ha@lists.linux-ha.org
Subject: [Linux-HA] Rename an arbitrary file on failover (and keeping
2files
the resource manually with the command line that heartbeat would use
for example
/etc/ha.d/resouce.d/IPADDR 192.168.111.1 start
David Lang
Thanks!
Leon
=
Leon Miller-Out
President, Singlebrook Technology, Inc
p 607-330-1493
f 607-697-0457
516 W. State St. - Suite
heartbeat just do an ifconfig and you should see eth0:0
and eth0:1 with your VIPs on them.
The 2 real web servers are set up like this:
what do your load balancers look like?
David Lang
/etc/network/interfaces
auto lo
iface lo inet loopback
auto lo:0
iface lo:0 inet static
address
to this is to have the clients point at the individual boxes. the
NTP protocol is designed to deal with multiple sources so it doesn't need to
work through the VIP.
David Lang
Thanks! Phil
LNXPNTP1: primary system, eth0 = 10.255.213.232
CHARNTP1: backup system, eth0 = 10.255.213.233
Virtual address
in the haresources file where it can be
liegitimate for them to be different, but there are not many cases like this.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http
can match, no matter how much more you
spend on the hardware.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
-53.1.14.el5),
and (I believe) V2-style configuration.
haresources is the V1-style configuration
cibadmin is the V2-style configuration
V2 is significantly more powerful then V1, but also more complicated. if you
don't need any of the V2 features you can use V1.
David Lang
On Fri, 28 Mar 2008, Luis Motta Campos wrote:
David Lang wrote:
haresources is the V1-style configuration
cibadmin is the V2-style configuration
V2 is significantly more powerful then V1, but also more complicated. if
you don't need any of the V2 features you can use V1.
I'm not sure about
so does this mean that those of us who use systems that don't run X and don't
have windows boxes nearby shouldn't use heartbeat V2?
that's what it sounds like you guys are saying.
David Lang
On Tue, 12 Feb 2008, Dan Gahlinger wrote:
Date: Tue, 12 Feb 2008 13:14:26 -0500
From: Dan Gahlinger
On Wed, 13 Feb 2008, Andrew Beekhof wrote:
On Feb 12, 2008, at 7:23 PM, David Lang wrote:
so does this mean that those of us who use systems that don't run X and
don't have windows boxes nearby shouldn't use heartbeat V2?
that's what it sounds like you guys are saying.
no no no :-)
CLI
a version that gets patches, this is a useful option.
David Lang
On Wed, 13 Feb 2008, Andrew Beekhof wrote:
Date: Wed, 13 Feb 2008 08:23:08 +0100
From: Andrew Beekhof [EMAIL PROTECTED]
Reply-To: General Linux-HA mailing list linux-ha@lists.linux-ha.org
To: General Linux-HA mailing list linux-ha
that don't handle the loss of connection cleanly and the app has
required manual restarting, but never any problems with the AIX systems
themselves.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman
with
everything.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
did both everything that people were
running into was handled, and it's worked ever since.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org
the person
asking the question) should go back to the early archives to find the details.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
is a good
idea (and I would suggest that it may be appropriate to split the GUI from the
core CRM as well).
I will only start to get worried if I start to see interface changes takeing
place that aren't coordinated or compatible between the projects.
David Lang
is a good
idea (and I would suggest that it may be appropriate to split the GUI from the
core CRM as well).
I will only start to get worried if I start to see interface changes takeing
place that aren't coordinated or compatible between the projects.
David Lang
of interrupting
connections low enough that it hasn't been a high priority)
David Lang
Perhaps a good 'distribution' is pfsense, which packages it all
(FreeBSD+PF+CARP+more) including a web interface. There is plenty of
documentation on the web avaiable for such a setup...
- Joris
-Original Message
mean when you say you need to manage ssh sessions? if you mean they
go through the firewall, then the iptables stuff should fix this. if you mean
that people connect to the firewall itself and you want the ssh session to
failover to the backup, that's not possible.
David Lang
with the conntrack replication yet (once I get a
chance to experiment with it I may find it's trivial to setup and start useing
it everywhere, but it hasn't been important enough for me to do so yet)
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux
version and distributing slightly different things, but all named the same
version numbers is not the way I would want to see this project go. please try
to avoid this.
David Lang
___
Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org
http
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-16T12:20:42, David Lang [EMAIL PROTECTED] wrote:
releasing substandard packages. Particularly given that I am the author of
the majority of the HAv2 code and therefor have arguably the most interest
in its quality.
the inferance
On Mon, 16 Jul 2007, Andrew Beekhof wrote:
On 7/16/07, David Lang [EMAIL PROTECTED] wrote:
On Mon, 16 Jul 2007, Andrew Beekhof wrote:
Lets call a spade a spade shall we...
This is a thinly veiled put-down of the people who have been doing
Alan's
job for the last 7 months.
this did
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-14T20:52:43, Alan Robertson [EMAIL PROTECTED] wrote:
a random collection of mostly old cast-off computers. They range
between 300mhz and 2.4 ghz, and have disks ranging from
5200 to 7200 RPM with significantly
on the inactive box in a pair generate MailTo messages from both boxes, causing
management to freak out)
are there enough 'oneshot' type things that it is worth adding the concept to
the cib directly rather then trying to fake it out in the scripts?
David Lang
version and distributing slightly different things, but all named the same
version numbers is not the way I would want to see this project go. please try
to avoid this.
David Lang
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http
)
3. a limited number of 2u dual Athlon 2000+ systems with SCSI RAID controllers
(I'm short on drives, but can come up with a handful of 10k, and possibly a
couple 15K drives)
any interst from anyone for this?
David Lang
___
Linux-HA-Dev: Linux
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-15T14:58:47, David Lang [EMAIL PROTECTED] wrote:
it would be great if somthing like the auto-build system could feed into a
cluster like this for automated testing (I don't know if there would be
enough testing with nightly builds
)
3. a limited number of 2u dual Athlon 2000+ systems with SCSI RAID controllers
(I'm short on drives, but can come up with a handful of 10k, and possibly a
couple 15K drives)
any interst from anyone for this?
David Lang
___
Linux-HA mailing list
Linux
On Mon, 16 Jul 2007, Lars Marowsky-Bree wrote:
On 2007-07-15T14:58:47, David Lang [EMAIL PROTECTED] wrote:
it would be great if somthing like the auto-build system could feed into a
cluster like this for automated testing (I don't know if there would be
enough testing with nightly builds
release locations?
David Lang
Since I am running
FreeBSD I can obviously not use binary packages for linux distributions.
On 7/6/07, Lars Marowsky-Bree [EMAIL PROTECTED] wrote:
On 2007-07-05T12:00:04, Matt Wilder [EMAIL PROTECTED] wrote:
I enabled logd and am having the same problem. Below
with heartbeat 2.0.8
On 7/6/07, David Lang [EMAIL PROTECTED] wrote:
On Fri, 6 Jul 2007, Andrew Beekhof wrote:
On 7/6/07, Matt Wilder [EMAIL PROTECTED] wrote:
There is no virtualization going on here. These are FreeBSD
6.2-RELEASEsystems with nearly stock kernels. These problems are not
occuring
that people not use the current
release)
I know that Alan has a torture test that he runs releases through before he does
the release, is this something that we can setup additional machines to help
with?
Last week Alan was out at Usenix but I'm surprised that he hasn't spoken up
since then.
David
that people not use the current
release)
I know that Alan has a torture test that he runs releases through before he does
the release, is this something that we can setup additional machines to help
with?
Last week Alan was out at Usenix but I'm surprised that he hasn't spoken up
since then.
David
79 matches
Mail list logo