Re: [Linux-HA] HA of virtual machiens

Andrew Beekhof Tue, 05 Feb 2008 00:24:31 -0800


On Feb 5, 2008, at 12:30 AM, Amos Shapira wrote:

On Feb 4, 2008 7:32 PM, Andrew Beekhof <[EMAIL PROTECTED]> wrote:
Crashing?
What was the subject?  I don't recall this.
I couldn't make CentOS 5's 2.1.3 talk to another node whenconfiguring with
the version 2 style CRM, at some stage I learned that not all programs
manage to start and stay up. Later also found (I think) that"stonith -h" or
something like this always bombs on some interrupt.
I don't remember all the details but the thread where I asked aboutthis is
archived in
http://lists.community.tummy.com/pipermail/linux-ha/2007-November/029068.html



Sorry, I must have missed this thread.

I eventually switched to using the old-style haresources config fileand
things seem to work OK with that.

24 heartbeat[17482]: 2007/11/29_07:12:41 info: Status update fornode drbd01.test.spammatters.local: status up25 heartbeat[17482]: 2007/11/29_07:13:45 info: all clients arenow paused

line 25 is sure to be part of the problem, but also I don't see anyevidence that heartbeat even tried to start the crm processes.


this is also interesting...

13 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: writesocket priority set to IPTOS_LOWDELAY on eth014 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: boundsend socket to device: eth015 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: boundreceive socket to device: eth016 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast:started on port 695 interface eth0 to 192.168.0.24817 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: writesocket priority set to IPTOS_LOWDELAY on eth018 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: boundsend socket to device: eth019 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: boundreceive socket to device: eth020 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast:started on port 695 interface eth0 to 192.168.0.249

I wonder if the fact that there are two IPs on eth0 could have beencausing problems.

Oh, and the reason crm_mon was taking so long is related to yourchoice of deadtime which was quite high.



Thanks,

--Amos
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] HA of virtual machiens

Reply via email to