Dear Linux-ha,
I have had terrible results from running V2.0.8 using the GUI.
I have been working for a month on this and referenced all of the scant
documentation on using this product.
These are off-line servers, servicing no data, and being used by nobody.
I really dread to think what would happen if I let people use them for
real. I have listed some problems below this email with the XML.
This version seems to be very far from being production stable. I can
wait for a new version, or suggest to my managers that we purchase
something else. Which I don't want to do, I actually like the product.
Can anybody give reassurance that there are future versions out there
which are going to be better?
Is there anything I can do to help the process?
Regards,
Ben
Some problems:
I've been working on several sets of SUSE 10.1 servers, all with a fresh
and successful compile.
1. The ocf for DRBD doesn't work.
2. The ocf for MySQL doesn't work.
3. Often the Start, Stop and Refresh of resources does nothing.
4. The GUI reports different state from the physical state of the servers.
5. Some times it's impossible to stop heartbeat resulting in having to
power-cycle the servers.
6. The 'Monitor' -> 'Restart' operation will return 'running', yet
heartbeat will still continually restart the resource after every start.
Only cured by a reboot.
7. There seems absolutely no way of telling Heartbeat that if the ping
to the router fails, bring down the node. Not helped by some
documentation on the subject suggesting options that don't exist.
8. More than once a day Heartbeat will restart some or all resources for
no reason.
9. The /etc/messages file reports so many warning and errors, many
hundred a minute, checking it is impossible.
10. Options from the GUI are sometimes not saved.
11. Often the options the GUI displays are differ from what Heartbeat is
holding in memory.
12. On a good clean compile of Heartbeat, the GUI is not installed.
13. Refreshing a stale resource sometimes does nothing, again a restart.
14. A monitor which fails sometimes restarts the wrong resource.
etc etc......
My cib.xml:
<cib generated="true" admin_epoch="0" have_quorum="true"
ignore_dtd="false" num_peers="2" cib_feature_revision="1.3"
ccm_transition="8" dc_uuid="eada0f23-7df8-41fd-834e-10983c269d9f"
epoch="11" num_updates="1868" cib-last-written="Fri Jun 22 07:36:41 2007">
<configuration>
<crm_config>
<cluster_property_set id="cib-bootstrap-options">
<attributes>
<nvpair id="id-no-quorum-policy" name="no-quorum-policy"
value="ignore"/>
<nvpair
id="cib-bootstrap-options-default-resource-stickiness"
name="default-resource-stickiness" value="INFINITY"/>
<nvpair name="last-lrm-refresh"
id="cib-bootstrap-options-last-lrm-refresh" value="1182407100"/>
</attributes>
</cluster_property_set>
</crm_config>
<nodes>
<node id="eada0f23-7df8-41fd-834e-10983c269d9f" uname="hp-tm-05"
type="normal"/>
<node id="8fc1a24b-e7da-400e-96d6-d3308ccbe6fa" uname="hp-tm-03"
type="normal"/>
</nodes>
<resources>
<group id="G_dbms-07-03">
<primitive class="ocf" type="IPaddr2" provider="heartbeat"
id="R_dbms-07-03_ip">
<instance_attributes id="R_dbms-07-03_ip_instance_attrs">
<attributes>
<nvpair id="2d10055d-33df-44ba-9500-2082959aecd9"
name="ip" value="172.16.14.99"/>
<nvpair id="47b640e2-e314-4cdd-b203-630691c82bc9"
name="nic" value="eth0"/>
<nvpair id="12c00c99-21d7-42bd-8518-98d0b551aa87"
name="cidr_netmask" value="255.255.255.0"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="heartbeat" type="drbddisk"
provider="heartbeat" id="R_dbms-07-03_drbd">
<instance_attributes id="R_dbms-07-03_drbd_instance_attrs">
<attributes>
<nvpair id="bfe8bff9-a38f-4892-8348-abbe6abebbe1"
name="1" value="dbms-07-03"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="ocf" type="Filesystem" provider="heartbeat"
id="R_dbms-07-03_mount">
<instance_attributes id="R_dbms-07-03_mount_instance_attrs">
<attributes>
<nvpair id="7164e75a-67c5-4ab2-a635-2f41db06494e"
name="device" value="/dev/drbd0"/>
<nvpair id="140b19d2-5e4f-40a6-b412-541973a137d3"
name="directory" value="/dbms-07-03"/>
<nvpair id="83313dba-d213-4e82-97c8-40aabf915971"
name="fstype" value="reiserfs"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="heartbeat" type="mysql-rt"
provider="heartbeat" id="R_dbms-07-03_mysql">
<instance_attributes id="R_dbms-07-03_mysql_instance_attrs">
<attributes>
<nvpair id="0d75ea45-4a7a-406b-ae71-753aa50cf59c"
name="1" value="/dbms-07-03"/>
<nvpair id="c1cf1472-214c-4722-8d14-df3117e9c1dd"
name="2" value="172.16.14.99"/>
</attributes>
</instance_attributes>
<operations>
<op id="95fa19ee-762a-4e59-b1c9-e66229759a92"
name="monitor" interval="20" timeout="20" start_delay="60"
prereq="nothing" on_fail="restart"/>
</operations>
</primitive>
<primitive class="ocf" type="MailTo" provider="heartbeat"
id="R_dbms-07-03_email">
<instance_attributes id="R_dbms-07-03_email_instance_attrs">
<attributes>
<nvpair name="email"
id="d0ae30eb-b413-49c3-9afa-de52bc47d9bb" value="[EMAIL PROTECTED]"/>
<nvpair id="cd578e7f-2b82-4c2d-8af1-c200cea5d6db"
name="subject" value="Heartbeat_Resource_dbms-07-03"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="ocf" type="AudibleAlarm"
provider="heartbeat" id="R_dbms-07-03_beep">
<instance_attributes id="R_dbms-07-03_beep_instance_attrs">
<attributes>
<nvpair id="1660eddf-a7a1-4765-aad9-1a3d22e2932a"
name="nodelist" value="hp-tm-03"/>
</attributes>
</instance_attributes>
</primitive>
<instance_attributes id="G_dbms-07-03_instance_attrs">
<attributes/>
</instance_attributes>
</group>
<group id="G_dbms-07-04">
<primitive class="ocf" type="IPaddr2" provider="heartbeat"
id="R_dbms-07-04_ip">
<instance_attributes id="R_dbms-07-04_ip_instance_attrs">
<attributes>
<nvpair id="22710abe-0b68-4cb9-a9e3-a5437da16507"
name="ip" value="172.16.14.100"/>
<nvpair id="1842d0fa-0936-48d0-a3a7-d8d09cdf465b"
name="nic" value="eth0"/>
<nvpair id="56345d27-9259-4ae0-9b1c-cf5d57477232"
name="cidr_netmask" value="255.255.255.0"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="heartbeat" type="drbddisk"
provider="heartbeat" id="R_dbms-07-04_drbd">
<instance_attributes id="R_dbms-07-04_drbd_instance_attrs">
<attributes>
<nvpair id="56af3e04-86b8-4ea0-b434-8f9679a4bc1a"
name="1" value="dbms-07-04"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="ocf" type="Filesystem" provider="heartbeat"
id="R_dbms-07-04_mount">
<instance_attributes id="R_dbms-07-04_mount_instance_attrs">
<attributes>
<nvpair id="06fb4dd9-36d6-4489-bc16-c008472129a2"
name="device" value="/dev/drbd1"/>
<nvpair id="d5e73893-7d50-4de2-91f0-559389536a23"
name="directory" value="/dbms-07-04"/>
<nvpair id="5696a879-f840-407a-acd3-4080143b6ccd"
name="fstype" value="reiserfs"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="heartbeat" type="mysql-rt"
provider="heartbeat" id="R_dbms-07-04_mysql">
<instance_attributes id="R_dbms-07-04_mysql_instance_attrs">
<attributes>
<nvpair id="fc21452e-bdde-44a5-968a-27864f3cd334"
name="1" value="/dbms-07-04"/>
<nvpair id="aaac255c-21e4-48a1-a1f8-c2eac42e49c7"
name="2" value="172.16.14.100"/>
</attributes>
</instance_attributes>
<operations>
<op name="monitor" timeout="20" start_delay="60"
on_fail="restart" role="Started" prereq="nothing" disabled="false"
id="97c5c7ea-9937-41e0-bac3-2f59a2265265" interval="20"/>
</operations>
</primitive>
<primitive class="ocf" type="MailTo" provider="heartbeat"
id="R_dbms-07-04_email">
<instance_attributes id="R_dbms-07-04_email_instance_attrs">
<attributes>
<nvpair name="email"
id="c3bd18a3-6046-4623-be14-98450eea77d5" value="[EMAIL PROTECTED]"/>
<nvpair id="45562bc4-04a7-46cf-9c56-099b005f45c0"
name="subject" value="Heartbeat_Reource_dbms-07-04"/>
</attributes>
</instance_attributes>
</primitive>
<primitive class="ocf" type="AudibleAlarm"
provider="heartbeat" id="R_dbms-07-04_beep">
<instance_attributes id="R_dbms-07-04_beep_instance_attrs">
<attributes>
<nvpair id="af204ba1-8a4e-4c5c-80bd-df641f5577c2"
name="nodelist" value="hp-tm-05"/>
</attributes>
</instance_attributes>
</primitive>
<instance_attributes id="G_dbms-07-04_instance_attrs">
<attributes/>
</instance_attributes>
</group>
</resources>
<constraints>
<rsc_location id="P_dbms-07-03" rsc="G_dbms-07-03">
<rule id="prefered_P_dbms-07-03" score="100">
<expression attribute="#uname"
id="3352abd4-cd7e-42dd-83e5-5dba2df870f9" operation="eq" value="hp-tm-05"/>
</rule>
</rsc_location>
<rsc_location id="P_dbms-07-04" rsc="G_dbms-07-04">
<rule id="prefered_P_dbms-07-04" score="100">
<expression attribute="#uname"
id="bcb58d7a-4609-4e8f-9707-2288aaeefb88" operation="eq" value="hp-tm-03"/>
</rule>
</rsc_location>
</constraints>
</configuration>
</cib>
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems