Hi,
I am new to Linux HA.
I am using the 3.0.1 version of heartbeat with the 1.0.6 stable build of
pacemaker on a fresh fc11 system.
I installed everything as per:
http://clusterlabs.org/wiki/Install
# wget -O /etc/yum.repos.d/pacemaker.repo
http://clusterlabs.org/rpm/fedora-11/clusterlabs.repo
# yum install -y pacemaker corosync heartbeat
When I attempted to start heartbeat, it failed as follows:
r...@fc11-2:# /etc/init.d/heartbeat start
/etc/init.d/heartbeat: line 51: /etc/ha.d/shellfuncs: No such file or
directory
I commented out the call to shellfuncs and manually set HA_BIN and HA_NOARCHBIN
as follows:
HA_BIN=/usr/lib64/heartbeat
HA_NOARCHBIN=/usr/share/heartbeat
When I attempted to start heartbeat again I received this error message:
r...@fc11-2:# /etc/init.d/heartbeat start
Starting High-Availability services: /etc/init.d/heartbeat: line 147:
ha_parameter: command not found
/etc/init.d/heartbeat: line 182: ha_parameter: command not found
Done.
However, the crm was started.
Next, I attempted to bring an IPaddr resource online with the command:
r...@fc11-2# crm_resource -r SS-IP1 -p target_role -v started
My configuration information is:
r...@fc11-2# crm configure show
node $id="306cec3f-d583-4748-a4d5-98c8d745ee93" fc11-1
node $id="9f84dba6-90ae-4bc6-bd89-87f8b8c8828d" fc11-2
primitive SS-IP1 ocf:heartbeat:IPaddr \
params ip="172.16.80.250" target_role="started" \
op monitor interval="10s"
property $id="cib-bootstrap-options" \
dc-version="1.0.6-cebe2b6ff49b36b29a3bd7ada1c4701c7470febe" \
cluster-infrastructure="Heartbeat" \
stonith-enabled="false"
The log shows:
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: run_graph:
====================================================
Nov 12 14:01:16 fc11-2 crmd: [5301]: notice: run_graph: Transition 23
(Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0,
Source=/var/lib/pengine/pe-warn-12.bz2): Complete
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: print_graph: ## Empty transition
graph ##
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: te_graph_trigger: Transition 23 is
now complete
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: notify_crmd: Processing transition
completion in state S_TRANSITION_ENGINE
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: notify_crmd: Transition 23 status:
done - <null>
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: s_crmd_fsa: Processing
I_TE_SUCCESS: [ state=S_TRANSITION_ENGINE cause=C_FSA_INTERNAL
origin=notify_crmd ]
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:
// A_LOG
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: do_state_transition: State
transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS
cause=C_FSA_INTERNAL origin=notify_crmd ]
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: do_state_transition: Starting
PEngine Recheck Timer
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: crm_timer_start: Started PEngine
Recheck Timer (I_PE_CALC:900000ms), src=111
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:
// A_DC_TIMER_STOP
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:
// A_INTEGRATE_TIMER_STOP
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:
// A_FINALIZE_TIMER_STOP
Nov 12 14:01:16 fc11-2 pengine: [5346]: WARN: process_pe_message: Transition
23: WARNINGs found during PE processing. PEngine Input stored in:
/var/lib/pengine/pe-warn-12.bz2
Nov 12 14:01:16 fc11-2 pengine: [5346]: info: process_pe_message: Configuration
WARNINGs found during PE processing. Please run "crm_verify -L" to identify
issues.
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: cib_process_xpath: cib_query:
//cib/configuration/resources//*...@id="SS-IP1"]//meta_attributes//nvpa...@name="target_role"]
does not exist
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: cib_process_xpath: Processing
cib_query op for
//cib/configuration/resources//*...@id="SS-IP1"]//instance_attributes//nvpa...@name="target_role"]
(/cib/configuration/resources/primitive/instance_attributes/nvpair[2])
Nov 12 14:01:19 fc11-2 cib: [5297]: info: cib_process_request: Operation
complete: op cib_modify for section resources (origin=local/crm_resource/5,
version=0.14.11): ok (rc=0)
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: xmlfromIPC: Peer disconnected
Nov 12 14:02:06 fc11-2 cib: [5297]: debug: cib_process_xpath: cib_query:
//cib/configuration/resources//*...@id="SS-IP1"]//meta_attributes//nvpa...@name="target_role"]
does not exist
Nov 12 14:02:06 fc11-2 cib: [5297]: debug: cib_process_xpath: Processing
cib_query op for
//cib/configuration/resources//*...@id="SS-IP1"]//instance_attributes//nvpa...@name="target_role"]
(/cib/configuration/resources/primitive/instance_attributes/nvpair[2])
Nov 12 14:02:06 fc11-2 cib: [5297]: info: cib_process_request: Operation
complete: op cib_modify for section resources (origin=local/crm_resource/5,
version=0.14.11): ok (rc=0)
I then did crm_verify as follows:
r...@fc11-2:# crm_verify -L
crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error -
SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-2
crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error -
SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-1
Warnings found during check: config may not be valid
Use -V for more details
I tried to debug this problem by running the /etc/ha.d/resource.d/IPaddr script
manually as follows:
r...@fc11-1:# sh -x /etc/ha.d/resource.d/IPaddr start
+ . /etc/ha.d/resource.d//hto-mapfuncs
++ . /etc/ha.d/shellfuncs
/etc/ha.d/resource.d//hto-mapfuncs: line 9: /etc/ha.d/shellfuncs: No such
file or directory
I commented out line 9 and then ran into this problem:
r...@fc11-1:# sh -x /etc/ha.d/resource.d/IPaddr 172.16.80.250 start
+ . /etc/ha.d/resource.d//hto-mapfuncs
++ OCF_ROOT_DIR=/usr/lib/ocf
++ OCF_RA_DIR=/usr/lib/ocf/resource.d/
++ OCF_RA_VERSION_MAJOR=1
++ export OCF_RA_VERSION_MAJOR
++ OCF_RA_VERSION_MINOR=0
++ export OCF_RA_VERSION_MINOR
++ OCF_ROOT=/usr/lib/ocf
++ export OCF_ROOT
++ LEGAL_ACTIONS='(start|stop|status|usage|meta-data)'
+ '[' 2 '!=' 2 ']'
++ echo 172.16.80.250
++ sed 's%/.*%%'
+ BASEIP=172.16.80.250
+ OCF_RESKEY_ip=172.16.80.250
+ export OCF_RESKEY_ip
++ echo 172.16.80.250
++ sed 's%^172.16.80.250/*%%'
+ str=
+ '[' '!' -z '' ']'
+ case $2 in
+ OCF_TYPE=IPaddr
+ OCF_RESKEY_lvs_support=0
+ OCF_RESOURCE_INSTANCE=IPaddr_172.16.80.250
+ export OCF_TYPE OCF_RESOURCE_INSTANCE OCF_RESKEY_lvs_support
+ ra_execocf start
+ '[' xstart = x ']'
+ . /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs
/etc/ha.d/resource.d//hto-mapfuncs: line 35:
/usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs: No such file or directory
I am in the process of debug this problem.
Has anyone else already fixed these problems or other gotchas with heartbeat
3.0.1?
Thanks,
Bob
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems