Hi I hope I'm doing this correctly. I just joined this list after I tried looking for help with the HAProxy people.
I'm using HAProxy and Heartbeat on two Ubuntu 8.04 servers. I have two Xen nodes on each of my physical machines. One is the load balance and Heartbeat (LB1), the other is the actual LAMP web server (WEB1). Testing HAProxy/Heartbeat setup seems that it's working fine, by that I mean that shutting off one of the web servers, it switches to the other one. My problem is I keep loosing access to the HAProxy stats page. I know that isn't a huge problem, but I'm worried it might be a sign of a bigger problem somewhere. The stats show up fine for about 15-20 minutes, then I get a apache generic 404 error page. I also see: "File does not exist: /var/www/apache2-default/haproxy" show up in the apache error log as soon as I loose it. If I go back to my LB1 node and restart Heartbeat, it all comes back for another 15-20 minutes. There's nothing in any of the logs that I can see, other than it stops logging when it happens. I use http://192.168.31.100/haproxy?stats to get to that stats page. The .100 is the shared address between the the load balancers. If I use 192.168.31.201, which is LB1, I get the browser's 404 notice. If I use .100, it shows my apache generic 404 page. So somehow it stops seeing LB1, and goes to port 80 on my web server on the WEB1 node. That's where I see the apache error saying it can't find the HAProxy stats page. When I used the "tcpdump -q -i eth0 tcp port 80 and src host 192.168.31.100" command, it showed me looking at the stats, and the test web page: tcpdump: verbose output suppressed, use -v or -vv for full protocol decode listening on eth0, link-type EN10MB (Ethernet), capture size 96 bytes 11:23:16.106664 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 0 11:23:16.254209 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 0 11:23:16.254409 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 262 11:23:16.254501 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 0 11:23:17.460534 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 0 11:23:17.628385 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 0 11:23:17.628590 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 2712 11:23:17.839448 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 2712 11:23:17.839460 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 524 Once I couldn't see the stats page again, the output stopped completely. I watched it on LB2 as well. It seems like it stops listing to the .100 IP address. If I use "tcpdump -q -i eth0 tcp port 80" I see LB1 checking web1 and web2, but nothing on the .100 address. The HAProxy people said they thought it might be a Heartbeat problem, because after they checked my HAProxy setup, they couldn't find any problems there. Sorry for the long post, I'm just getting desperate for some help. Thanks, Tom _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
