[Linux-HA] Anyone successfully install PAcemaker/Corosync on Freebsd?

2015-12-19 Thread mike
Hi All, just curious if anyone has had any luck at one point installing Pacemaker and Corosync on FreeBSD. I've run into an issue when running ./configure while trying to install Corosync. The process craps out at nss with this error: checking for nss... configure: error: in

[Linux-HA] Adding mysql Resource - is this normal?

2014-01-14 Thread mike
? Why is this worded backwards? Thanks as always, Mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Adding mysql Resource - is this normal?

2014-01-14 Thread mike
of that pound on fedora forum Sent from my Verizon Wireless 4G LTE DROID mike mgbut...@nbnet.nb.ca wrote: Hello All, I've worked with HA and pacemaker for a few years now and today I'm working with a Fedora 19 cluster that I just created. Following the excellent /Clusters from Scratch/ by Andrew

Re: [Linux-HA] Simple cluster - backup wont start

2013-07-31 Thread mike
on the primary node and HA thinking it was already up on the backup node when HA started there, there was an obvious problem and HA killed the backup node 1 or 2 seconds after it joined the cluster. Good guy LinuxHA strikes again! On 13-07-29 12:40 PM, mike wrote: Hi guys, I've got a rather odd issue

[Linux-HA] Simple cluster - backup wont start

2013-07-29 Thread mike
Hi guys, I've got a rather odd issue. We have a simple two node cluster running one VIP and mysql. Pretty sure I could create this cluster in my sleep. Anyway, the cluster has been up and running for months with no issues at all. Last night we had to shut down both nodes. On the primary node

Re: [Linux-HA] can not reboot or shutdown the server

2013-01-10 Thread mike
Is eth3 up at the time this thing goes into its loop? On 13-01-10 07:50 AM, 赵长松 wrote: Hi I use drbd and heartbeat to construct HA.But When I reboot or shutdown the server , it run into a infinite loop. The information in logfile as follows: crmd[3852]: 2013/01/10_10:22:18 info:

Re: [Linux-HA] I need to edit my cib.xml manually

2012-06-19 Thread mike
in the configuration file and start HA. What are the ramifications of editing it by hand and will HA allow me to do that? How about crm configure save file.crm then edit file.crm then crm load replace file.crm on the other side? HTH, Dan Thanks -mike

Re: [Linux-HA] I need to edit my cib.xml manually

2012-06-19 Thread mike
in the configuration file and start HA. What are the ramifications of editing it by hand and will HA allow me to do that? How about crm configure save file.crm then edit file.crm then crm load replace file.crm on the other side? HTH, Dan Thanks -mike ___ Linux-HA

Re: [Linux-HA] Apache wont start on VIP

2012-02-06 Thread mike
-- mike On 12-02-06 08:28 AM, Robert Schumann wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi Mike! Am 05.02.2012 01:57, schrieb mike: On the same 2 node cluster I have apache running and it fails back and forth fine as long as ports.conf is set to listen on all ip's. I do have

[Linux-HA] Apache wont start on VIP

2012-02-04 Thread mike
something obvious here? Why does apache start from the command line but not in HA unless ports.conf is set to listen on all interfaces? Thanks -mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See

[Linux-HA] Mixing LVS Tun and NAT

2012-01-30 Thread mike
on the same set of back end servers using LVS TUN but I cannot get it to work. It dawned on me that maybe I can't mix the two on the same backend servers because of the requirement of NAT to set the DFGW back to an IP on the director. What do you guys think? -thanks as always -Mike

[Linux-HA] What am I missing? HA ldirector

2012-01-28 Thread mike
Hi all, I've got a working LVS HA cluster with several load balanced applications running successfully. The HA cluster is a combination PROD and Test (something I dont agree with but there you have it). Last week we added an LVS Tunnel test pair to the Cluster. Here's what it looks like.

Re: [Linux-HA] Services does not react after IP Takeover

2012-01-23 Thread mike
On 12-01-23 04:59 AM, Niclas Müller wrote: Hey, I've build a cluster with one pacemaker resource named ClusterIP. The change by failover is very fast and is ok. My problem is that services like apache or mysql take at least 10-15 seconds to respond after IP takeover. Is there a change to

Re: [Linux-HA] ldirectord behavior on Blade Server

2011-11-11 Thread mike
On Thu, Nov 10, 2011 at 03:14:44PM -0400, mike wrote: On 11-11-09 09:33 PM, Simon Horman wrote: On Wed, Nov 09, 2011 at 12:18:22PM -0400, mike wrote: Thanks Nick: Here's the config that is the issue right now: #SERVER1.vip.intranet.mydomain.com virtual=172.28.191.194:8080

[Linux-HA] ldirectord behavior on Blade Server

2011-11-09 Thread mike
Got a bit of an odd issue and I'm hoping someone can help me figure this out. The set up is not ideal but here's the basic flow: Request goes to VIP1 on LVS on Blade A. It then routes it to one of 2 load balanced pairs on Blade A or B (works perfectly). That request then goes to one of 2 load

Re: [Linux-HA] ldirectord behavior on Blade Server

2011-11-09 Thread mike
Thanks Nick: Here's the config that is the issue right now: #SERVER1.vip.intranet.mydomain.com virtual=172.28.191.194:8080 protocol=tcp scheduler=lc checktype=connect checkport=8080 #negotiatetimeout=10 real=172.28.191.170:8080 masq real=172.28.191.171:8080 masq

Re: [Linux-HA] crm commands from a perl script

2011-11-08 Thread mike
On 11-11-08 12:07 AM, Tim Serong wrote: On 11/08/2011 12:24 PM, mike wrote: So I'm putting together a quick little perl script to monitor HA by running a few crm commands. When I run the script from the command line as root, it works perfectly. However, when I put it in root's crontab I get

[Linux-HA] crm commands from a perl script

2011-11-07 Thread mike
the correct value for use later in the script. From cron, the same script errs out with the msg above. Thanks for any suggestions. -mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http

Re: [Linux-HA] Does ANYTHING Work on RHEL6?

2011-10-31 Thread mike
On 11-10-31 07:56 AM, Robinson, Eric wrote: I can't get a cluster up on RHEL6. First I tried pacemaker+corosync, but corosync complains... Could not get the ring status, the error is: 6 ..and I cannot connect to the cluster. So then I tried pacemaker+heartbeat, only to learn that

Re: [Linux-HA] cib.xml missing on a cluster node

2011-10-26 Thread mike
On 11-10-26 07:20 PM, Alessandra Giovanardi wrote: Hi, I have a cluster based on Heartbeat v2 with two nodes (DEBIAN): gicdrupal01 gicdrupal02 with one RG active on gicdrupal02 (gicdrupal01 is in standby) with these pkt release: ii heartbeat 2.1.3-6lenny4 Subsystem for

Re: [Linux-HA] ldirectord error

2011-09-30 Thread mike
On 11-09-30 05:06 PM, Nick Khamis wrote: Can't locate Socket6.pm Socket6 is a perl module - install it and you should be fine -mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http

Re: [Linux-HA] ldirectord error

2011-09-30 Thread mike
On 11-09-30 08:02 PM, Nick Khamis wrote: Hey Mike! Thank you so much for your response. For those that may bump into this problem, in debian squeeeze I had to install: apt-get install libio-socket-inet6-perl apt-get install libwww-perl And everything is ok now... Nick. On Fri, Sep

[Linux-HA] LVS-NAT and DR on same HA cluster?

2011-09-28 Thread mike
Running ldirectord on HA with a couple of LVS-DR backend instances set up and working. Customer want to have another VIP in the cluster pointing to another load balanced tomcat pair. Only thing is its goign to have to be NAT. Any problems with NAT and DR co-existing on the same director?

Re: [Linux-HA] FW: Install problems with ha resource

2011-09-26 Thread mike
with Cluster Glue - is that installed? -mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] ldirector - jboss port

2011-09-26 Thread mike
On 11-09-25 08:04 AM, mike wrote: On 11-09-25 05:13 AM, Vladislav Bogdanov wrote: 25.09.2011 11:09, Vladislav Bogdanov wrote: 25.09.2011 02:29, mike wrote: On 11-09-24 02:43 PM, Vladislav Bogdanov wrote: 24.09.2011 16:21, mike wrote: On 11-09-24 05:02 AM, Vladislav Bogdanov wrote

Re: [Linux-HA] ldirector - jboss port

2011-09-25 Thread mike
On 11-09-25 05:13 AM, Vladislav Bogdanov wrote: 25.09.2011 11:09, Vladislav Bogdanov wrote: 25.09.2011 02:29, mike wrote: On 11-09-24 02:43 PM, Vladislav Bogdanov wrote: 24.09.2011 16:21, mike wrote: On 11-09-24 05:02 AM, Vladislav Bogdanov wrote: 23.09.2011 21:15, mike wrote: Last year I

Re: [Linux-HA] ldirector - jboss port

2011-09-24 Thread mike
On 11-09-24 05:02 AM, Vladislav Bogdanov wrote: 23.09.2011 21:15, mike wrote: Last year I set up an HA cluster with ldirector pointing to 2 load balanced real servers. We had jboss on the backend listening to the Real IP on port 8080. Initially, we could not get the backend to reply - we kept

Re: [Linux-HA] ldirector - jboss port

2011-09-24 Thread mike
On 11-09-24 02:43 PM, Vladislav Bogdanov wrote: 24.09.2011 16:21, mike wrote: On 11-09-24 05:02 AM, Vladislav Bogdanov wrote: 23.09.2011 21:15, mike wrote: Last year I set up an HA cluster with ldirector pointing to 2 load balanced real servers. We had jboss on the backend listening

[Linux-HA] ldirector - jboss port

2011-09-23 Thread mike
response was guys, start each instance listening to a different port - problem solved. They're not too happy with this solution so I'm here asking - is there any way possible to load balance to several jboss instances running on the same backend servers and on the same port? Thanks -mike

Re: [Linux-HA] Simple Architecture Questions

2011-09-22 Thread mike
On 11-09-22 10:41 AM, Nick Khamis wrote: Hello Everyone, We have almost setup a working prototype of what will be our production cluster. A few simple question I have are: i) We begin the installation by creating hacluster:haclient. How bad is it to proceed with the installation as user

Re: [Linux-HA] Simple Architecture Questions

2011-09-22 Thread mike
On 11-09-22 11:45 AM, Nick Khamis wrote: Hello Mike, Thank you so much for your response. You do not need to install cluster stack on real or backend servers just the nodes that are actually part of the cluster. This is the part that I am trying to make sure I absolutely understand

Re: [Linux-HA] Simple Architecture Questions

2011-09-22 Thread mike
On 11-09-22 02:45 PM, Nick Khamis wrote: Got it! On my way Thanks Mike! Nick. On Thu, Sep 22, 2011 at 1:05 PM, mikemgbut...@nbnet.nb.ca wrote: On 11-09-22 11:45 AM, Nick Khamis wrote: Hello Mike, Thank you so much for your response. You do not need to install cluster stack on real

[Linux-HA] Assign an IP to VLAN

2011-09-20 Thread mike
I've got HA set up on 2 nodes. Very simple setup. One VIP and ldirector. The issue I am having here is bringing up the VIP on a VLAN device. For example, here is an ifconfig output before HA is started 9: eth0.101@eth0: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qdisc noqueue link/ether

[Linux-HA] LVS DR

2011-09-15 Thread mike
I have this set up with ldirectord on HA. Do the directors and the real backend servers have to be on the same subnet? ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also:

Re: [Linux-HA] LVS DR

2011-09-15 Thread mike
Thanks Michael - I think this is the root of my problem with my previous post. I was using DR on different subnets. Guess I'll switch to tunnel. On 11-09-15 08:11 AM, Michael Schwartzkopff wrote: I have this set up with ldirectord on HA. Do the directors and the real backend servers have to be

[Linux-HA] LVS VIP on Different subnet

2011-09-15 Thread mike
I have an LVS HA set up. All is installed and the VIP comes up and ldirectord comes up as well. However, the VIP I have been asked to assign in the cluster will reside on a different subnet than the real ip that currently resides on the device. My gut says this wont work. Here is the device

[Linux-HA] Strange behavior ldirectord direct routing

2011-09-14 Thread mike
does HA suddenly make it impossible to connect to the backend servers from the primary node? Another problem I have is that I cannot telnet to the VIP on port 8080 no matter what node it is running on. I think if I can resolve the problem above, this one will go away too. Thanks -mike

Re: [Linux-HA] Strange behavior ldirectord direct routing

2011-09-14 Thread mike
/25 brd 172.28.89.127 scope global eth0.101 inet 172.28.191.155/25 brd 172.28.191.127 scope global eth0.101 -- VIP On 11-09-14 06:37 PM, mike wrote: Hope someone here can give me some pointers. I've set up a ldirectord LinuxHA cluster. When I start it up its a simple set up. I have one VIP

[Linux-HA] An odd question - HA and LVS

2011-08-15 Thread mike
I've set up a few LVS clusters in our current environment and now I've been asked to do the following. Install HA and LVS on server 1 and server 2 The VIP will point to 2 servers - server 3 and server 1, i.e. back to itself. Now don't ask why they want to do this, sometimes I just give up

Re: [Linux-HA] Heartbeat Restart is not same as Stop and Start

2011-08-03 Thread mike
Permission problem perhaps? Not really sure what you're doing but the fact that you have users configuring the cluster (why do you do this btw?) may be pointing to a permission issue. -mgb On 11-08-03 06:57 PM, Rahul Kanna wrote: Hi, Our system setup: Heartbeat 3.0.3 DRBD (to manage file

Re: [Linux-HA] string2msg_ll: node [?] failed authentication

2011-08-02 Thread mike
Are the cib.xml and cib.xml.sig IDENTICAL on both nodes? On 11-08-01 07:49 PM, Hai Tao wrote: also I found the heartbeat messages the node 2 get is weird: [node2 ha.d]# cat /dev/ttyS0 0 0.00 1/162 16586 ttl=3 auth= 3872bbb8a107925fcdd6ea4e3716d8 ts=4e372cb5 ld=0.00 0.00 0.0086 ttl=3

Re: [Linux-HA] string2msg_ll: node [?] failed authentication

2011-08-02 Thread mike
The cib.xml* files should be in the /var/lib/heartbeat/crm directory on both nodes. On 11-08-02 04:38 PM, Hai Tao wrote: these are no such files. Thanks. Hai Tao Date: Tue, 2 Aug 2011 09:30:47 -0300 From: mgbut...@nbnet.nb.ca To: linux-ha@lists.linux-ha.org Subject: Re: [Linux-HA]

Re: [Linux-HA] replace a dead node

2011-07-30 Thread mike
heartbeat on the new box and copy the file over (or update the value in the file) on the new box. David Lang On Fri, 29 Jul 2011, mike wrote: Date: Fri, 29 Jul 2011 16:06:25 -0300 From: mikemgbut...@nbnet.nb.ca Reply-To: General Linux-HA mailing listlinux-ha@lists.linux-ha.org To: General

Re: [Linux-HA] replace a dead node

2011-07-29 Thread mike
a failover. I set it with this entry in the cib.xml: nvpair id=nvpair.id17897906 name=default-resource-failure-stickiness value=50/ Hope this helps. -mike On 11-07-22 02:55 PM, Hai Tao wrote: Does HA monitor its resources? If I manually disable the floating IP, (for example, ifdown

Re: [Linux-HA] Does HA monitor its resources?

2011-07-23 Thread mike
depending on the Linux variant you are using) and ten start apache wit service apache start, for instance. -mike On 11-07-22 05:58 PM, Hai Tao wrote: How can I disable HA without stopping the resources then? I like to disable HA by stopping the heartbeat, but once I do that a failover

Re: [Linux-HA] Does HA monitor its resources?

2011-07-22 Thread mike
in the cib.xml: nvpair id=nvpair.id17897906 name=default-resource-failure-stickiness value=50/ Hope this helps. -mike On 11-07-22 02:55 PM, Hai Tao wrote: Does HA monitor its resources? If I manually disable the floating IP, (for example, ifdown eth0:0), will HA be able to detect

[Linux-HA] ldirector problem problem

2011-07-17 Thread mike
connections are not flowing through to the back end web server.* As usual - thanks for all replies and suggestions. - Mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http

Re: [Linux-HA] ldirector problem problem

2011-07-17 Thread mike
protocol=tcp scheduler=lc checktype=connect checkport=80 #negotiatetimeout=10 real=192.168.2.16:80 gate #real=172.28.185.38:389 ipip #service=ldap protocol=tcp checktimeout=10 checkinterval=10 Now it works! On 11-07-17 10:35 AM, mike wrote

[Linux-HA] Confusion between pacemaker and crm

2011-07-15 Thread mike
Hi guys, I've always been a bit confused when it comes to what really is pacemaker. Now, I've installed a few clusters and in my ha.cf file I enabled crm. Am I correct in understanding that crm *is* pacemaker? The reason I ask is that I've read some documentation from this site:

Re: [Linux-HA] Urgent help required ... Corosync not getting started ... !!!

2011-06-03 Thread Mike Caldwell
On Fri, Jun 3, 2011 at 3:18 AM, Amit Jathar amit.jat...@alepo.com wrote: The Selinux is disabled. I am launching corosync with command /usr/etc/init.d/corosync start How to strace lauch of corosync ? Also, how to check the UID? strace /usr/etc/init.d/corosync start

Re: [Linux-HA] Urgent help required ... Corosync not getting started ... !!!

2011-06-03 Thread Mike Caldwell
On Fri, Jun 3, 2011 at 8:54 AM, Mike Caldwell caldwell1...@gmail.comwrote: On Fri, Jun 3, 2011 at 3:18 AM, Amit Jathar amit.jat...@alepo.com wrote: The Selinux is disabled. I am launching corosync with command /usr/etc/init.d/corosync start How to strace lauch of corosync ? Also, how

Re: [Linux-HA] Urgent help required ... Corosync not getting started ... !!!

2011-06-01 Thread Mike Caldwell
I am not able to troubleshoot the issue after chasing it for more than a day . No hint, as no logs present in /var/log/messages/ ... :( Any help is appreciable. Let me know, if you need more information. Thanks, Amit I've had more luck with logging set up with to_logfile: yes

Re: [Linux-HA] ldirectord setup

2011-05-19 Thread mike
On 11-05-19 04:41 PM, Ariel wrote: I'm starting a new LVS-DR setup with ldirectord but am unable to get it working. I started with trying to set up a single director server with a single real server. My ldirectord.cf: -- checktimeout=8 checkinterval=5 autoreload=yes

Re: [Linux-HA] HA Nodes Port 691 UDP

2011-05-18 Thread mike
On 11-05-18 09:31 AM, Randy Katz wrote: Hi, does anyone on this list know why there are UDP requests on port 691 of the HA nodes? I turned on firewalling and my crm_mon would not show both nodes' status until I allowed UDP port 691 to flow through, please advise, Regards, Randy

Re: [Linux-HA] Problem WARN: Gmain_timeout_dispatch Again

2011-05-12 Thread mike
On 11-05-12 03:53 PM, gilmarli...@agrovale.com.br wrote: Hello!I'm using heartbeat version 3.0.3-2 on debian squeeze with dedicated gigabit ethernet interface for the heartbeat. But even this generates the following message:WARN: Gmain_timeout_dispatch: Dispatch function for send local

Re: [Linux-HA] [Heartbeat] my VIP doesn't work :(

2011-04-26 Thread mike
On 11-04-22 06:25 AM, SEILLIER Mathieu wrote: Hi all, First I'm french so sorry in advance for my English... I have to use Heartbeat for High Availability between 2 Tomcat 5.5 servers under Linux RedHat 5.3. The first server is active, the other one is passive. The master is called

[Linux-HA] JAVA sun.jnu.encoding ignored when process started from BP. Not when started manually

2011-04-21 Thread Mike Toler
I have a java process that I am started by Linux HA. I have create an OCF script called BillingProcessor. That script calls an outside script (pm.pl) which starts the process. The JAVA command is shown here. Note, I am including the -Dsun.jnu.encoding=UTF-8 directive. java

[Linux-HA] ldirector and LinuxHA

2011-01-27 Thread mike
Hello All, I've successfully set up a load balancing cluster using ldirectord and LinuxHA. ldirectord.cf contains several stanzas for load balancing several backend services. All seems to work as it should with the exception of one minor detail. I have one application on a backend server that

Re: [Linux-HA] ha config

2011-01-13 Thread mike
On 11-01-12 10:28 PM, Cody Zhang wrote: Hi,All Anybody help me? Found error when run heartbeat-2.1.3-3. My configruation example: *ha.cf:* logfile /var/log/ha-log logfacility local0 keepalive 2 deadtime 30 warntime 10 initdead 120 udpport 694 bcast eth0 ucast eth0 192.168.0.60

[Linux-HA] Help me adjust my ha.cf settings

2010-11-05 Thread mike
Hi all, I'm running a simple MySQL cluster on a very heavily loaded LPAR and experiencing some outages due to late heartbeat packets, Gmain timeouts and so on. I'd like to adjust these settings: # Thresholds (in seconds) keepalive 1 warntime 6

Re: [Linux-HA] Help me adjust my ha.cf settings

2010-11-05 Thread mike
On 10-11-05 06:40 PM, Pavlos Parissis wrote: On 5 November 2010 20:32, mikemgbut...@nbnet.nb.ca wrote: Hi all, I'm running a simple MySQL cluster on a very heavily loaded LPAR and experiencing some outages due to late heartbeat packets, Gmain timeouts and so on. Before we look

[Linux-HA] MySQL unknown exec error

2010-11-04 Thread mike
Looking for a more experienced person who can explain this issue we had last night. Our backups kicked in during the night at 1AM. At 1:01AM, our mysql cluster had issues. Specifically I can see in crm_mon where the cluster has it as failed due to an unknown exec error. Looking at the

Re: [Linux-HA] MySQL unknown exec error

2010-11-04 Thread mike
On 10-11-04 12:38 PM, Dejan Muhamedagic wrote: Hi, On Thu, Nov 04, 2010 at 11:06:48AM -0300, mike wrote: Looking for a more experienced person who can explain this issue we had last night. Our backups kicked in during the night at 1AM. At 1:01AM, our mysql cluster had issues

[Linux-HA] Recommended settings for keepalive

2010-11-02 Thread mike
Hi guys, Can you tell me what you would recommend for the following settings in the ha.cf file: Here are my settings. # Thresholds (in seconds) keepalive 1 warntime 6 deadtime 10 initdead 15 Are

Re: [Linux-HA] Recommended settings for keepalive

2010-11-02 Thread mike
On 10-11-02 11:52 AM, Dejan Muhamedagic wrote: Hi, On Tue, Nov 02, 2010 at 11:13:49AM -0300, mike wrote: Hi guys, Can you tell me what you would recommend for the following settings in the ha.cf file: Here are my settings. # Thresholds (in seconds) keepalive

[Linux-HA] Stonith log entries

2010-10-13 Thread mike
Hi all, I've started building a simple 2 node http cluster. I've built several clusters so this should be a joke. I got the first node fired up and noticed these entries over and over again in my logs. Oct 13 21:18:46 Firethorn crmd: [2403]: ERROR: te_connect_stonith: Sign-in failed:

Re: [Linux-HA] Stonith log entries

2010-10-13 Thread mike
Fedora 13 on i686 btw. On 10-10-13 09:26 PM, mike wrote: Hi all, I've started building a simple 2 node http cluster. I've built several clusters so this should be a joke. I got the first node fired up and noticed these entries over and over again in my logs. Oct 13 21:18:46 Firethorn crmd

Re: [Linux-HA] Orphan resource process(es) running

2010-10-07 Thread mike
the address all is working well. On Wed, 2010-10-06 at 20:45 -0300, mike wrote: On 10-10-06 07:09 PM, AR wrote: Hi, First let me say thank you to those of you that support the project. It appears that there are orphan processes running? How do I get rid

Re: [Linux-HA] Orphan resource process(es) running

2010-10-06 Thread mike
On 10-10-06 07:09 PM, AR wrote: Hi, First let me say thank you to those of you that support the project. It appears that there are orphan processes running? How do I get rid of these? # crm_verify -LVV crm_verify[31892]: 2010/10/06_14:55:10 WARN: process_orphan_resource: Nothing known

Re: [Linux-HA] Built in Affinity with LVS and LinuxHA?

2010-09-10 Thread Mike
server somehow took ownership of the VIP and as a result was grabbing all requests. A reboot resolved it. Can you tell me why this may have happened? What could be wrong on my backend servers that they would grab the VIP like this? Any help would be appreciated greatly. Mike On Thu, 2010-09-09

[Linux-HA] Built in Affinity with LVS and LinuxHA?

2010-09-07 Thread Mike
Hello all, I've implemented a LVS cluster using ldirectord and LinuxHA. Here is a snippet from my ldirectord.cf file: virtual=172.28.185.54:8080 protocol=tcp scheduler=wrr checktype=connect checkport=8080 #service=ldap real=172.28.185.57:8080 ipip

[Linux-HA] How much should I worry about these entries?

2010-07-21 Thread mike
in my ha logs I have the entries that appear several times a night. Now I know in a previous post I was told these were indicative of resource contention. These clusters that are seeing these messages are on a zVM LPAR so they share CPU, memory and so on. Previously when we saw these errors,

[Linux-HA] Significance of these entries

2010-07-16 Thread mike
Hi All, I have a very simple Apache cluster with 3 VIPs. I've seen a few failovers that have me stumped. The logs certainly indicate a problem but they don't fully tell the story. I'll give a summary of the logs here - anyone got any ideas why a failover occurred? Jul 15 16:00:09

Re: [Linux-HA] Significance of these entries

2010-07-16 Thread mike
On 10-07-16 09:24 AM, mike wrote: Hi All, I have a very simple Apache cluster with 3 VIPs. I've seen a few failovers that have me stumped. The logs certainly indicate a problem but they don't fully tell the story. I'll give a summary of the logs here - anyone got any ideas why a failover

[Linux-HA] Why the stonith messages?

2010-06-14 Thread mike
Got a simple 2 node active/standby cluster with stonith disabled in my cib. This morning I logged on to the server to see /var filled up over the weekend. Looking at the ha-log file I can see why. It was polluted with stonith messages. Specifically, messages that read Can't initiate connection

Re: [Linux-HA] HA configuration problems

2010-06-09 Thread mike
see what is running and what isn't. Mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems

[Linux-HA] Ldirectord and jboss port

2010-06-03 Thread mike
We've got an application serving up JBoss on port 8080. I'm using LVS to load balance and have incorporated ldirecord into LinuxHA to provide a higly available LVS-tunnel cluster. We found that by starting up Jboss on the real servers and listening on the RIP that for some reason, LVS cannot

Re: [Linux-HA] explain the difference between servers?

2010-05-31 Thread mike
Nikita Michalko wrote: Hi mike, it seems to be no HA-problem anymore though, but: Am Montag, 31. Mai 2010 01:29 schrieb mike: So I've got ldirector up and running just fine and providing ldap high availability to 2 backend real servers on port 389. Here is the output of netstat

Re: [Linux-HA] explain the difference between servers?

2010-05-31 Thread mike
mike wrote: Nikita Michalko wrote: Hi mike, it seems to be no HA-problem anymore though, but: Am Montag, 31. Mai 2010 01:29 schrieb mike: So I've got ldirector up and running just fine and providing ldap high availability to 2 backend real servers on port 389. Here

[Linux-HA] explain the difference between servers?

2010-05-30 Thread mike
So I've got ldirector up and running just fine and providing ldap high availability to 2 backend real servers on port 389. Here is the output of netstat on both real servers: tcp0 0 0.0.0.0:389 0.0.0.0:* LISTEN tcp0 0 :::389

[Linux-HA] odd issues with LinuxHA/ldirector

2010-05-28 Thread mike
Anyone ever see an issue where ldirector would not pass requests to 2 backend real servers on a certain port (in my case 8080) but if you change that to port 22, it works flawlessly? Its really strange that it would work on one port but not another. Any hints?

Re: [Linux-HA] odd issues with LinuxHA/ldirector

2010-05-28 Thread mike
Pushkar Pradhan wrote: From: linux-ha-boun...@lists.linux-ha.org on behalf of mike Sent: Fri 5/28/2010 10:01 AM To: General Linux-HA mailing list Subject: [Linux-HA] odd issues with LinuxHA/ldirector Anyone ever see an issue where ldirector would

Re: [Linux-HA] odd issues with LinuxHA/ldirector

2010-05-28 Thread mike
Pushkar Pradhan wrote: From: linux-ha-boun...@lists.linux-ha.org on behalf of mike Sent: Fri 5/28/2010 12:08 PM To: General Linux-HA mailing list Subject: Re: [Linux-HA] odd issues with LinuxHA/ldirector Pushkar Pradhan wrote

[Linux-HA] Help with LinuxHA/ldirector

2010-05-25 Thread mike
I'm at my wits end here folks and I'm looking for some help. Currently when the cluster is up and running on lvsuat1a, requests for ldap come through the VIP and then get passed out to each real server in a fairly round robin format. However, when I fail the cluster over I am seeing some odd

Re: [Linux-HA] Harmless log entries

2010-05-20 Thread mike
asking here is, is the 15 minute failure-timeout a rolling thing that gets reset or is it a one shot deal, i.e. once ignored the first time always ignored from that point on? Thank you Andrew Mike Andrew Beekhof wrote: On Wed, May 19, 2010 at 5:22 PM, mike mgbut...@nbnet.nb.ca wrote: Andrew

Re: [Linux-HA] Harmless log entries

2010-05-20 Thread mike
What did I miss? Must have been something. mike wrote: So to see if I understand correctly a couple scenarios: Assume a failure-timeout of 15 minutes 1. lets assume I have 2 failures within 5 minutes and then no failure for 20 minutes afterwards. After that 20 minutes I have a failure

Re: [Linux-HA] Harmless log entries

2010-05-20 Thread mike
Gianluca Cecchi wrote: On Thu, May 20, 2010 at 2:45 PM, mike mgbut...@nbnet.nb.ca wrote: ok, I actually went ahead and did a test on my cluster. The results did not occur as I would have expected. I failed ldirectord twice on the main node. I waited 20 minutes and saw this entry

Re: [Linux-HA] Harmless log entries

2010-05-19 Thread mike
I assume Andrew means 15 minutes * 60 = 900 seconds * 1000 = 90 milliseconds Vadym Chepkov wrote: On May 19, 2010, at 2:51 AM, Andrew Beekhof wrote: which is what my DBA was looking for. He wants mysql to failover if there are 3 successive failures of MySQL but only if those

Re: [Linux-HA] Harmless log entries

2010-05-19 Thread mike
Andrew Beekhof wrote: which is what my DBA was looking for. He wants mysql to failover if there are 3 successive failures of MySQL but only if those successive failures occur within 15 minutes. You want migration-threshold=3 and failure-timeout=90 (15 * 60 * 1000 Thanks Andrew,

[Linux-HA] ldirectord and ldap problems

2010-05-19 Thread mike
and correctly land on the proper server, i.e. the backup node. So I think this kinda rules out some weird Arp table issue on a switch somewhere. Help me out here guys, where do I look? Mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

[Linux-HA] Harmless log entries

2010-05-18 Thread mike
the scores automatically. My only question is, do I only see this message if the scores have been rolled back and what is responsible to firing this thing off in the first place? Thanks Mike ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http

[Linux-HA] Help with ldirectord and round robin

2010-05-18 Thread mike
servers I issue the command from continually hitting the same server. Certainly the results in the slapd log files are anything BUT round robin. Can anyone help me out here with either my understanding of the round robin set up or tell me what to change in my config file above? Thanks guys Mike

[Linux-HA] Problems starting ldirectord

2010-05-17 Thread mike
Hello All, I've set up ldirectord so that I am able to start it from the command line like so: service ldirectord start The ldirectord.cf file is in place and ipvsadm shows the connections as I would expect them to. Incorporating this into heartbeat should be a joke but I'm running into a

Re: [Linux-HA] Issues with Heartbeat/DRBD over Internet connection

2010-05-12 Thread Mike Sweetser
On Wed, May 12, 2010 at 3:16 AM, Lars Ellenberg lars.ellenb...@linbit.comwrote: On Tue, May 11, 2010 at 01:35:17PM -0700, Mike Sweetser wrote: Hello, I've set up a DRBD and Heartbeat configuration communicating over an Internet connection, rather than internal. The servers are running

[Linux-HA] Issues with Heartbeat/DRBD over Internet connection

2010-05-11 Thread Mike Sweetser
Hello, I've set up a DRBD and Heartbeat configuration communicating over an Internet connection, rather than internal. The servers are running CentOS 5.4, with DRBD 8.3.2 and Heartbeat 3.0.3, out of the CentOS repository. I start seeing these in the ha-log. ERROR: Message hist queue is filling

Re: [Linux-HA] Issues with Heartbeat/DRBD over Internet connection

2010-05-11 Thread Mike Sweetser
Is that the problem or the solution? What should I change them to? On Tue, May 11, 2010 at 1:50 PM, Andrew Beekhof and...@beekhof.net wrote: A reliable (and fast) internet connection combined with very aggressive timeouts in ha.cf ___ Linux-HA

[Linux-HA] A good cibadmin guide

2010-05-05 Thread mike
Hi guys, I wonder if someone might be able to point me to a good cibadmin guide. Maybe its something someone wrote on their own, I really am not picky here. I would like to get my hands on a decent doc that I could read and get to know how to do a few things a little better. Thanks

Re: [Linux-HA] MySQL and 4 instances

2010-05-03 Thread mike
Florian Haas wrote: On 2010-05-03 09:24, Andrew Beekhof wrote: On Thu, Apr 29, 2010 at 7:37 PM, mike mgbut...@nbnet.nb.ca wrote: Hello all, We had a simple 2 node MySQL cluster - nothing special. One instance that worked perfectly. We recently added 3 instances and now we're

[Linux-HA] MySQL and 4 instances

2010-04-29 Thread mike
Hello all, We had a simple 2 node MySQL cluster - nothing special. One instance that worked perfectly. We recently added 3 instances and now we're having some issues. The problem is that Heartbeat issues a MySQL Status immediately after the MySQL Start .. and of course the MySQL Status will

Re: [Linux-HA] Clarify Apache failover please?

2010-04-08 Thread mike
, at least not right now. I want to simulate a case where httpd will not start. right now, all that appears to happen is the cluster keeps trying to start httpd on the primary node. I'm obviously missing something because this way it is set up is certainly not highly available. mike wrote

  1   2   >