On Feb 10, 2008, at 9:38 PM, Andreas Mather1 wrote:

***********************
Warning: Your file, report_1.tar.gz, contains more than 32 files after decompression and cannot be scanned.
***********************




Hi all,

Please find attached a hb_report for a problem I experienced when
implementing heartbeat.

Logs are only included for one node which is odd.



The environment:
It's an asymmetric 4 node cluster, running heartbeat 2.1.3. All nodes share a couple of filesystems, all GPFS formatted. Services inlcude WebSphere (modified RA), DB2 (modified RA), vsftpd (Xinetd), samba, nfs, MCS (self written RA), IHS and are put in 4 groups (filesvc, mcs, was, db). Dejan is
also familiar with the setup.
OS: SLES 9.3 (x86_64)
hearbeat: build via ./ConfigureMe package


The Problem:
In general, everything works fine (crm_standby works for every node, etc.), but, when I simulate a power loss of one node (via IBM RSA)*, a cluster split occurs when this node rejoins. Suddenly, on every node, crm_mon shows the node it is running on as 'online' while reporting the other nodes as 'OFFLINE'. After 1 - 2 min. the cluster is fully operational again (all nodes found themself again), but it seems as every resource gets restarted.

I can easily believe this would happen.

With all the membership changes I can see in the logs, you lost quorum at which point <nvpair id="cib-bootstrap-options-no_quorum-policy" name="no_quorum-policy" value="stop"/>
kicked in and all your resources would have been stopped.

Then later, when the ccm sorted itself out and the membership returned to normal, the resources would have been started again.


No idea why the CCM went haywire though.
Probably best to log a bug.



Please let me know, if I can provide further information.

Thanks,

Andreas


* Sorry, I forgot to test what happens, when I just stop and start
heartbeat on that node - would be useful too, I think... :(




(See attached file: report_1.tar.gz)

Mit freundlichen Grüßen / Best regards

Andreas MATHER
ESLT - Enterprise Services for Linux Technologies

IBM Austria, Obere Donaustrasse 95, 1020 Vienna
Phone : +43-1-21145/4799
Fax: +43-1-21145/8888
e-mail: [EMAIL PROTECTED]

IBM Österreich Internationale Büromaschinen Gesellschaft m.b.H.
Sitz: Wien
Firmenbuchgericht: Handelsgericht Wien, FN 80000y
< report_1 .tar.gz>_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/

_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/

Reply via email to