Helo list! I am trying now to upgrade heartbeat 2.1.3 (pacemaker 0.6) to 3.0.3 on SLES11/SP1. After installing the new RPM's from http://download.opensuse.org/repositories/network:/ha-clustering/SLE_11_SP1/x86_64/ I see the following errors in the ha-log: ... WARN: do_cib_control: Couldn't complete CIB registration 30 times... pause and retry ERROR: do_cib_control: Could not complete CIB registration 30 times... hard error ... The cib.xml has proper rights (I think): -rw------- 1 hacluster haclient 3474 2010-09-08 12:25 /var/lib/heartbeat/crm/cib.xml
Verifying CIB: crm_verify -VVV -x cib.xml crm_verify[24558]: 2010/09/08_13:20:17 notice: update_validation: Upgrading (null)-style configuration to pacemaker-0.6 with no-op crm_verify[24558]: 2010/09/08_13:20:17 notice: update_validation: Upgrading transitional-0.6-style configuration to pacemaker-1.0 with /usr/share/pacemake r/upgrade06.xsl crm_verify[24558]: 2010/09/08_13:20:17 notice: update_validation: Upgrading pacemaker-1.1-style configuration to pacemaker-1.2 with no-op crm_verify[24558]: 2010/09/08_13:20:17 notice: update_validation: Upgraded from <none> to pacemaker-1.2 validation crm_verify[24558]: 2010/09/08_13:20:17 WARN: cluster_status: We do not have quorum - fencing and resource management disabled With crm I can not change anything in cib.xml: crm configure Signon to CIB failed: connection failed Init failed, could not perform requested operations ERROR: cannot parse xml: no element found: line 1, column 0 Installed SW/versions: heartbeat-3.0.3-2.14 libgssglue1-0.1-6.22 libglue2-1.0.6-2.1 cluster-glue-1.0.6-2.1 resource-agents-1.0.3-4.2 pacemaker-1.1.2.1-5.1 My cib.xml and ha-log are attached. I suppose my CIB is wrong. How can I update the old cib.xml? Could someone point me pls to the right upgrade sequence/documentation? Best regards Nikita Michalko
<?xml version="1.0" ?> <cib admin_epoch="0" epoch="0" num_updates="0"> <configuration> <crm_config> <cluster_property_set id="cib-bootstrap-options"> <attributes> <nvpair id="cib-bootstrap-options-symmetric-cluster" name="symmetric-cluster" value="true"/> <nvpair id="cib-bootstrap-options-no-quorum-policy" name="no-quorum-policy" value="stop"/> <nvpair id="cib-bootstrap-options-default-resource-stickiness" name="default-resource-stickiness" value="2"/> <nvpair id="cib-bootstrap-options-default-resource-failure-stickiness" name="default-resource-failure-stickiness" value="-6"/> <nvpair id="cib-bootstrap-options-stonith-enabled" name="stonith-enabled" value="false"/> <nvpair id="cib-bootstrap-options-stonith-action" name="stonith-action" value="reboot"/> <nvpair id="cib-bootstrap-options-startup-fencing" name="startup-fencing" value="true"/> <nvpair id="cib-bootstrap-options-stop-orphan-resources" name="stop-orphan-resources" value="true"/> <nvpair id="cib-bootstrap-options-stop-orphan-actions" name="stop-orphan-actions" value="true"/> <nvpair id="cib-bootstrap-options-remove-after-stop" name="remove-after-stop" value="false"/> <nvpair id="cib-bootstrap-options-short-resource-names" name="short-resource-names" value="true"/> <nvpair id="cib-bootstrap-options-transition-idle-timeout" name="transition-idle-timeout" value="3min"/> <nvpair id="cib-bootstrap-options-default-action-timeout" name="default-action-timeout" value="110s"/> <nvpair id="cib-bootstrap-options-is-managed-default" name="is-managed-default" value="true"/> <nvpair id="cib-bootstrap-options-cluster-delay" name="cluster-delay" value="60s"/> <nvpair id="cib-bootstrap-options-pe-error-series-max" name="pe-error-series-max" value="-1"/> <nvpair id="cib-bootstrap-options-pe-warn-series-max" name="pe-warn-series-max" value="-1"/> <nvpair id="cib-bootstrap-options-pe-input-series-max" name="pe-input-series-max" value="-1"/> </attributes> </cluster_property_set> </crm_config> <nodes/> <resources> <group id="group_1"> <primitive class="ocf" id="IPaddr_193_27_40_54" provider="heartbeat" type="IPaddr"> <operations> <op id="IPaddr_193_27_40_54_mon" interval="60s" name="monitor" timeout="60s"/> </operations> <instance_attributes id="IPaddr_193_27_40_54_inst_attr"> <attributes> <nvpair id="IPaddr_193_27_40_54_attr_0" name="ip" value="193.27.40.54"/> <nvpair id="IPaddr_193_27_40_54_attr_1" name="cidr_netmask" value="26"/> <nvpair id="IPaddr_193_27_40_54_attr_3" name="broadcast" value="193.27.40.63"/> </attributes> </instance_attributes> </primitive> <primitive class="ocf" id="IPaddr_192_168_163_54" provider="heartbeat" type="IPaddr"> <operations> <op id="IPaddr_192_168_163_54_mon" interval="60s" name="monitor" timeout="60s"/> </operations> <instance_attributes id="IPaddr_192_168_163_54_inst_attr"> <attributes> <nvpair id="IPaddr_192_168_163_54_attr_0" name="ip" value="192.168.163.54"/> <nvpair id="IPaddr_192_168_163_54_attr_1" name="cidr_netmask" value="26"/> <nvpair id="IPaddr_192_168_163_54_attr_3" name="broadcast" value="192.168.163.63"/> </attributes> </instance_attributes> </primitive> <primitive class="lsb" id="ubis_udbmain_3" provider="heartbeat" type="ubis_udbmain"> <operations> <op id="ubis_udbmain_3_mon" interval="120s" name="monitor" timeout="110s"/> </operations> </primitive> </group> </resources> <constraints> <rsc_location id="rsc_location_group_1" rsc="group_1"> <rule id="prefered_location_group_1" score="1"> <expression attribute="#uname" id="prefered_location_group_1_expr" operation="eq" value="opteron"/> </rule> </rsc_location> </constraints> </configuration> <status/> </cib>
Sep 08 13:25:30 opteron heartbeat: [24661]: info: Pacemaker support: respawn Sep 08 13:25:30 opteron heartbeat: [24661]: info: Pacemaker support: false Sep 08 13:25:30 opteron heartbeat: [24661]: WARN: Logging daemon is disabled --enabling logging daemon is recommended Sep 08 13:25:30 opteron heartbeat: [24661]: info: ************************** Sep 08 13:25:30 opteron heartbeat: [24661]: info: Configuration validated. Starting heartbeat 3.0.2 Sep 08 13:25:30 opteron heartbeat: [24662]: info: heartbeat: version 3.0.2 Sep 08 13:25:31 opteron heartbeat: [24662]: info: Heartbeat generation: 1252594514 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: bound send socket to device: eth0 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: bound receive socket to device: eth0 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: started on port 694 interface eth0 to 193.27.40.16 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth1 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: bound send socket to device: eth1 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: bound receive socket to device: eth1 Sep 08 13:25:31 opteron heartbeat: [24662]: info: glib: ucast: started on port 694 interface eth1 to 193.27.40.16 Sep 08 13:25:31 opteron heartbeat: [24662]: info: G_main_add_TriggerHandler: Added signal manual handler Sep 08 13:25:31 opteron heartbeat: [24662]: info: G_main_add_TriggerHandler: Added signal manual handler Sep 08 13:25:31 opteron heartbeat: [24662]: info: G_main_add_SignalHandler: Added signal handler for signal 17 Sep 08 13:25:31 opteron heartbeat: [24662]: info: Local status now set to: 'up' Sep 08 13:27:12 opteron heartbeat: [24662]: info: Comm_now_up(): updating status to active Sep 08 13:27:12 opteron heartbeat: [24662]: info: Local status now set to: 'active' Sep 08 13:27:12 opteron heartbeat: [24662]: info: Starting child client "/usr/lib64/heartbeat/ccm" (90,90) Sep 08 13:27:12 opteron heartbeat: [24662]: info: Starting child client "/usr/lib64/heartbeat/cib" (90,90) Sep 08 13:27:12 opteron heartbeat: [24706]: info: Starting "/usr/lib64/heartbeat/ccm" as uid 90 gid 90 (pid 24706) Sep 08 13:27:12 opteron heartbeat: [24662]: info: Starting child client "/usr/lib64/heartbeat/lrmd -r" (0,0) Sep 08 13:27:12 opteron heartbeat: [24707]: info: Starting "/usr/lib64/heartbeat/cib" as uid 90 gid 90 (pid 24707) Sep 08 13:27:12 opteron heartbeat: [24662]: info: Starting child client "/usr/lib64/heartbeat/stonithd" (0,0) Sep 08 13:27:12 opteron heartbeat: [24708]: info: Starting "/usr/lib64/heartbeat/lrmd -r" as uid 0 gid 0 (pid 24708) Sep 08 13:27:12 opteron heartbeat: [24662]: info: Starting child client "/usr/lib64/heartbeat/attrd" (90,90) Sep 08 13:27:12 opteron heartbeat: [24709]: info: Starting "/usr/lib64/heartbeat/stonithd" as uid 0 gid 0 (pid 24709) Sep 08 13:27:12 opteron heartbeat: [24662]: info: Starting child client "/usr/lib64/heartbeat/crmd" (90,90) Sep 08 13:27:12 opteron heartbeat: [24710]: info: Starting "/usr/lib64/heartbeat/attrd" as uid 90 gid 90 (pid 24710) Sep 08 13:27:12 opteron heartbeat: [24711]: info: Starting "/usr/lib64/heartbeat/crmd" as uid 90 gid 90 (pid 24711) Sep 08 13:27:12 opteron lrmd: [24708]: info: G_main_add_SignalHandler: Added signal handler for signal 15 Sep 08 13:27:12 opteron lrmd: [24708]: info: G_main_add_SignalHandler: Added signal handler for signal 17 Sep 08 13:27:12 opteron lrmd: [24708]: info: enabling coredumps Sep 08 13:27:12 opteron lrmd: [24708]: info: G_main_add_SignalHandler: Added signal handler for signal 10 Sep 08 13:27:12 opteron lrmd: [24708]: info: G_main_add_SignalHandler: Added signal handler for signal 12 Sep 08 13:27:12 opteron lrmd: [24708]: info: Started. Sep 08 13:27:12 opteron cib: [24707]: info: Invoked: /usr/lib64/heartbeat/cib Sep 08 13:27:12 opteron attrd: [24710]: info: Invoked: /usr/lib64/heartbeat/attrd Sep 08 13:27:12 opteron crmd: [24711]: info: Invoked: /usr/lib64/heartbeat/crmd Sep 08 13:27:12 opteron stonith-ng: [24709]: info: Invoked: /usr/lib64/heartbeat/stonithd Sep 08 13:27:12 opteron attrd: [24710]: CRIT: get_cluster_type: This installation of Pacemaker does not support the '(null)' cluster infrastructure. Terminating. Sep 08 13:27:12 opteron cib: [24707]: info: G_main_add_TriggerHandler: Added signal manual handler Sep 08 13:27:12 opteron cib: [24707]: info: G_main_add_SignalHandler: Added signal handler for signal 17 Sep 08 13:27:12 opteron heartbeat: [24662]: WARN: Managed /usr/lib64/heartbeat/attrd process 24710 exited with return code 100. Sep 08 13:27:12 opteron stonith-ng: [24709]: info: G_main_add_SignalHandler: Added signal handler for signal 17 Sep 08 13:27:12 opteron stonith-ng: [24709]: CRIT: get_cluster_type: This installation of Pacemaker does not support the '(null)' cluster infrastructure. Terminating. Sep 08 13:27:12 opteron ccm: [24706]: info: Hostname: opteron Sep 08 13:27:12 opteron heartbeat: [24662]: WARN: Managed /usr/lib64/heartbeat/stonithd process 24709 exited with return code 100. Sep 08 13:27:12 opteron crmd: [24711]: info: G_main_add_SignalHandler: Added signal handler for signal 17 Sep 08 13:27:12 opteron cib: [24707]: WARN: validate_cib_digest: No on-disk digest present Sep 08 13:27:12 opteron cib: [24707]: notice: update_validation: Upgraded from <none> to transitional-0.6 validation Sep 08 13:27:12 opteron cib: [24707]: notice: readCibXmlFile: Enabling transitional-0.6 validation on the existing (sane) configuration Sep 08 13:27:12 opteron cib: [24707]: CRIT: get_cluster_type: This installation of Pacemaker does not support the '(null)' cluster infrastructure. Terminating. Sep 08 13:27:12 opteron heartbeat: [24662]: WARN: Managed /usr/lib64/heartbeat/cib process 24707 exited with return code 100. Sep 08 13:27:13 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 1 times... pause and retry Sep 08 13:27:15 opteron ccm: [24706]: info: G_main_add_SignalHandler: Added signal handler for signal 15 Sep 08 13:27:16 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 2 times... pause and retry Sep 08 13:27:19 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 3 times... pause and retry Sep 08 13:27:22 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 4 times... pause and retry Sep 08 13:27:25 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 5 times... pause and retry Sep 08 13:27:28 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 6 times... pause and retry Sep 08 13:27:31 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 7 times... pause and retry Sep 08 13:27:34 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 8 times... pause and retry Sep 08 13:27:37 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 9 times... pause and retry Sep 08 13:27:40 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 10 times... pause and retry Sep 08 13:27:43 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 11 times... pause and retry Sep 08 13:27:46 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 12 times... pause and retry Sep 08 13:27:49 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 13 times... pause and retry Sep 08 13:27:52 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 14 times... pause and retry Sep 08 13:27:55 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 15 times... pause and retry Sep 08 13:27:58 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 16 times... pause and retry Sep 08 13:28:01 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 17 times... pause and retry Sep 08 13:28:04 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 18 times... pause and retry Sep 08 13:28:07 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 19 times... pause and retry Sep 08 13:28:10 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 20 times... pause and retry Sep 08 13:28:13 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 21 times... pause and retry Sep 08 13:28:16 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 22 times... pause and retry Sep 08 13:28:19 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 23 times... pause and retry Sep 08 13:28:22 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 24 times... pause and retry Sep 08 13:28:25 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 25 times... pause and retry Sep 08 13:28:28 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 26 times... pause and retry Sep 08 13:28:31 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 27 times... pause and retry Sep 08 13:28:34 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 28 times... pause and retry Sep 08 13:28:37 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 29 times... pause and retry Sep 08 13:28:40 opteron crmd: [24711]: WARN: do_cib_control: Couldn't complete CIB registration 30 times... pause and retry Sep 08 13:28:40 opteron crmd: [24711]: ERROR: do_cib_control: Could not complete CIB registration 30 times... hard error Sep 08 13:28:40 opteron crmd: [24711]: ERROR: do_log: FSA: Input I_ERROR from do_cib_control() received in state S_STARTING Sep 08 13:28:40 opteron crmd: [24711]: ERROR: do_recover: Action A_RECOVER (0000000001000000) not supported Sep 08 13:28:40 opteron crmd: [24711]: CRIT: get_cluster_type: This installation of Pacemaker does not support the '(null)' cluster infrastructure. Terminating. Sep 08 13:28:40 opteron heartbeat: [24662]: WARN: Managed /usr/lib64/heartbeat/crmd process 24711 exited with return code 100. Sep 08 13:32:19 opteron heartbeat: [24662]: info: client [/usr/lib64/heartbeat/crmd] is not running. Sep 08 13:32:19 opteron heartbeat: [24662]: info: client [/usr/lib64/heartbeat/attrd] is not running. Sep 08 13:32:19 opteron heartbeat: [24662]: info: client [/usr/lib64/heartbeat/stonithd] is not running. Sep 08 13:32:19 opteron heartbeat: [24662]: info: killing /usr/lib64/heartbeat/lrmd -r process group 24708 with signal 15 Sep 08 13:32:19 opteron lrmd: [24708]: info: lrmd is shutting down Sep 08 13:32:19 opteron heartbeat: [24662]: info: client [/usr/lib64/heartbeat/cib] is not running. Sep 08 13:32:19 opteron heartbeat: [24662]: info: killing /usr/lib64/heartbeat/ccm process group 24706 with signal 15 Sep 08 13:32:19 opteron ccm: [24706]: info: received SIGTERM, going to shut down Sep 08 13:32:20 opteron heartbeat: [24662]: info: killing HBREAD process 24674 with signal 15 Sep 08 13:32:20 opteron heartbeat: [24662]: info: killing HBREAD process 24676 with signal 15 Sep 08 13:32:20 opteron heartbeat: [24662]: info: killing HBWRITE process 24675 with signal 15 Sep 08 13:32:20 opteron heartbeat: [24662]: info: killing HBFIFO process 24672 with signal 15 Sep 08 13:32:20 opteron heartbeat: [24662]: info: killing HBWRITE process 24673 with signal 15 Sep 08 13:32:20 opteron heartbeat: [24662]: info: Core process 24672 exited. 5 remaining Sep 08 13:32:20 opteron heartbeat: [24662]: info: Core process 24673 exited. 4 remaining Sep 08 13:32:20 opteron heartbeat: [24662]: info: Core process 24674 exited. 3 remaining Sep 08 13:32:20 opteron heartbeat: [24662]: info: Core process 24675 exited. 2 remaining Sep 08 13:32:20 opteron heartbeat: [24662]: info: Core process 24676 exited. 1 remaining Sep 08 13:32:20 opteron heartbeat: [24662]: info: opteron Heartbeat shutdown complete.
_______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
