Hi,

On Sat, May 23, 2009 at 01:13:53PM -0400, Tom Potwin wrote:
> Hi
> 
> I hope I'm doing this correctly. I just joined this list after I tried
> looking for help with the HAProxy people.
> 
> I'm using HAProxy and Heartbeat on two Ubuntu 8.04 servers. I have two Xen
> nodes on each of my physical machines. One is the load balance and Heartbeat
> (LB1), the other is the actual LAMP web server (WEB1). Testing
> HAProxy/Heartbeat setup seems that it's working fine, by that I mean that
> shutting off one of the web servers, it switches to the other one. My
> problem is I keep loosing access to the HAProxy stats page. I know that
> isn't a huge problem, but I'm worried it might be a sign of a bigger problem
> somewhere. 
>
> The stats show up fine for about 15-20 minutes, then I get a apache generic
> 404 error page. I also see: "File does not exist:
> /var/www/apache2-default/haproxy" show up in the apache error log as soon as
> I loose it. If I go back to my LB1 node and restart Heartbeat, it all comes
> back for another 15-20 minutes. There's nothing in any of the logs that I
> can see, other than it stops logging when it happens. I use
> http://192.168.31.100/haproxy?stats to get to that stats page. The .100 is
> the shared address between the the load balancers. If I use 192.168.31.201,
> which is LB1, I get the browser's 404 notice. If I use .100, it shows my
> apache generic 404 page. So somehow it stops seeing LB1, and goes to port 80
> on my web server on the WEB1 node. That's where I see the apache error
> saying it can't find the HAProxy stats page.
> 
> When I used the "tcpdump -q -i eth0 tcp port 80 and src host 192.168.31.100"
> command, it showed me looking at the stats, and the test web page:
> tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
> listening on eth0, link-type EN10MB (Ethernet), capture size 96 bytes
> 11:23:16.106664 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 0
> 11:23:16.254209 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 0
> 11:23:16.254409 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 262
> 11:23:16.254501 IP 192.168.31.100.www > 192.168.30.64.2289: tcp 0
> 11:23:17.460534 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 0
> 11:23:17.628385 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 0
> 11:23:17.628590 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 2712
> 11:23:17.839448 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 2712
> 11:23:17.839460 IP 192.168.31.100.www > 192.168.30.64.2290: tcp 524
> 
> Once I couldn't see the stats page again, the output stopped completely. I
> watched it on LB2 as well. It seems like it stops listing to the .100 IP
> address. If I use "tcpdump -q -i eth0 tcp port 80" I see LB1 checking web1
> and web2, but nothing on the .100 address. The HAProxy people said they
> thought it might be a Heartbeat problem, because after they checked my
> HAProxy setup, they couldn't find any problems there. Sorry for the long
> post, I'm just getting desperate for some help. 

OK. Doubt that this is a heartbeat problem, because they
typically get excercised immediately and not wait for 15 minutes
to do so. Anyway, can't say more unless you provide the
configuration and logs. Which heartbeat version do you use? What
kind of configuration (haresources or v2/CRM)?

BTW, did you check the apache logs, i.e. is that file (a cgi
script I guess) really missing or is there something else. Are
all processes which are supposed to be running there?

Thanks,

Dejan

> Thanks, Tom
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to