Dear HA man,

I have got some problem with testing my HA system. As follows:



My HA system contains only two nodes. When I suddenly cut off the primary 
node's power, 

the secondary node will take over all the services immediately. Two cases here:

1> If I don't start the original primary node in 15 minutes ( just keep the 
cluster in degraded

   mode),  heartbeat will get errors like "Can not write to pipe 1" about 15 
minutes later.

   Then it stops all services and kills itself. 

2> If I start the original primary node again in 15 minutes, there is no error.

   Everythings goes just as what the website tells me.




It seems heartbeat does't see the other node is dead, I guess.

Here are my environments:

O.S: Red HAT Enterprise Linux WS 4.0 

     2.6.9-22(kernel)

     x86_64

Heartbeat: 2.1.2 ( 2.0.8 also tested, the same problem)

Install: tar.gz

Cluster: work with drbd-8.0.0



ha.cf, haresources, ha-log and ha-debug file are sent as attachments.



Thanks for your attention. I'm looking forward to your reply.



Good Luck!



                                yours,

                                        ajie
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to