Hi,

I am new to Linux HA.

I am using the 3.0.1 version of heartbeat with the 1.0.6 stable build of 
pacemaker on a fresh fc11 system.

I installed everything as per:

        http://clusterlabs.org/wiki/Install

        # wget -O /etc/yum.repos.d/pacemaker.repo 
http://clusterlabs.org/rpm/fedora-11/clusterlabs.repo
        # yum install -y pacemaker corosync heartbeat

When I attempted to start heartbeat, it failed as follows:

        r...@fc11-2:# /etc/init.d/heartbeat start
        /etc/init.d/heartbeat: line 51: /etc/ha.d/shellfuncs: No such file or 
directory

I commented out the call to shellfuncs and manually set HA_BIN and HA_NOARCHBIN 
as follows:

       HA_BIN=/usr/lib64/heartbeat
       HA_NOARCHBIN=/usr/share/heartbeat

When I attempted to start heartbeat again I received this error message:

        r...@fc11-2:#  /etc/init.d/heartbeat start
        Starting High-Availability services: /etc/init.d/heartbeat: line 147: 
ha_parameter: command not found
        /etc/init.d/heartbeat: line 182: ha_parameter: command not found
        Done.

However, the crm was started.

Next, I attempted to bring an IPaddr resource online with the command:

        r...@fc11-2# crm_resource -r SS-IP1 -p target_role -v started

My configuration information is:

        r...@fc11-2# crm configure show
        node $id="306cec3f-d583-4748-a4d5-98c8d745ee93" fc11-1
        node $id="9f84dba6-90ae-4bc6-bd89-87f8b8c8828d" fc11-2
        primitive SS-IP1 ocf:heartbeat:IPaddr \
              params ip="172.16.80.250" target_role="started" \
              op monitor interval="10s"
        property $id="cib-bootstrap-options" \
              dc-version="1.0.6-cebe2b6ff49b36b29a3bd7ada1c4701c7470febe" \
              cluster-infrastructure="Heartbeat" \
              stonith-enabled="false"

The log shows:

Nov 12 14:01:16 fc11-2 crmd: [5301]: info: run_graph: 
====================================================
Nov 12 14:01:16 fc11-2 crmd: [5301]: notice: run_graph: Transition 23 
(Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, 
Source=/var/lib/pengine/pe-warn-12.bz2): Complete
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: print_graph: ## Empty transition 
graph ##
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: te_graph_trigger: Transition 23 is 
now complete
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: notify_crmd: Processing transition 
completion in state S_TRANSITION_ENGINE
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: notify_crmd: Transition 23 status: 
done - <null>
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: s_crmd_fsa: Processing 
I_TE_SUCCESS: [ state=S_TRANSITION_ENGINE cause=C_FSA_INTERNAL 
origin=notify_crmd ]
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:       
// A_LOG
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: do_state_transition: State 
transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS 
cause=C_FSA_INTERNAL origin=notify_crmd ]
Nov 12 14:01:16 fc11-2 crmd: [5301]: info: do_state_transition: Starting 
PEngine Recheck Timer
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: crm_timer_start: Started PEngine 
Recheck Timer (I_PE_CALC:900000ms), src=111
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:       
// A_DC_TIMER_STOP
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:       
// A_INTEGRATE_TIMER_STOP
Nov 12 14:01:16 fc11-2 crmd: [5301]: debug: do_fsa_action: actions:trace:       
// A_FINALIZE_TIMER_STOP
Nov 12 14:01:16 fc11-2 pengine: [5346]: WARN: process_pe_message: Transition 
23: WARNINGs found during PE processing. PEngine Input stored in: 
/var/lib/pengine/pe-warn-12.bz2
Nov 12 14:01:16 fc11-2 pengine: [5346]: info: process_pe_message: Configuration 
WARNINGs found during PE processing.  Please run "crm_verify -L" to identify 
issues.
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: cib_process_xpath: cib_query: 
//cib/configuration/resources//*...@id="SS-IP1"]//meta_attributes//nvpa...@name="target_role"]
 does not exist
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: cib_process_xpath: Processing 
cib_query op for 
//cib/configuration/resources//*...@id="SS-IP1"]//instance_attributes//nvpa...@name="target_role"]
 (/cib/configuration/resources/primitive/instance_attributes/nvpair[2])
Nov 12 14:01:19 fc11-2 cib: [5297]: info: cib_process_request: Operation 
complete: op cib_modify for section resources (origin=local/crm_resource/5, 
version=0.14.11): ok (rc=0)
Nov 12 14:01:19 fc11-2 cib: [5297]: debug: xmlfromIPC: Peer disconnected
Nov 12 14:02:06 fc11-2 cib: [5297]: debug: cib_process_xpath: cib_query: 
//cib/configuration/resources//*...@id="SS-IP1"]//meta_attributes//nvpa...@name="target_role"]
 does not exist
Nov 12 14:02:06 fc11-2 cib: [5297]: debug: cib_process_xpath: Processing 
cib_query op for 
//cib/configuration/resources//*...@id="SS-IP1"]//instance_attributes//nvpa...@name="target_role"]
 (/cib/configuration/resources/primitive/instance_attributes/nvpair[2])
Nov 12 14:02:06 fc11-2 cib: [5297]: info: cib_process_request: Operation 
complete: op cib_modify for section resources (origin=local/crm_resource/5, 
version=0.14.11): ok (rc=0)

I then did crm_verify as follows:

     r...@fc11-2:# crm_verify -L
     crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error - 
SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-2
     crm_verify[13367]: 2009/11/12_14:03:14 ERROR: unpack_rsc_op: Hard error - 
SS-IP1_monitor_0 failed with rc=5: Preventing SS-IP1 from re-starting on fc11-1
     Warnings found during check: config may not be valid
       Use -V for more details

I tried to debug this problem by running the /etc/ha.d/resource.d/IPaddr script 
manually as follows:

      r...@fc11-1:# sh -x /etc/ha.d/resource.d/IPaddr start
      + . /etc/ha.d/resource.d//hto-mapfuncs
      ++ . /etc/ha.d/shellfuncs
      /etc/ha.d/resource.d//hto-mapfuncs: line 9: /etc/ha.d/shellfuncs: No such 
file or directory

I commented out line 9 and then ran into this problem:

     r...@fc11-1:# sh -x /etc/ha.d/resource.d/IPaddr 172.16.80.250 start
     + . /etc/ha.d/resource.d//hto-mapfuncs
     ++ OCF_ROOT_DIR=/usr/lib/ocf
     ++ OCF_RA_DIR=/usr/lib/ocf/resource.d/
     ++ OCF_RA_VERSION_MAJOR=1
     ++ export OCF_RA_VERSION_MAJOR
     ++ OCF_RA_VERSION_MINOR=0
     ++ export OCF_RA_VERSION_MINOR
     ++ OCF_ROOT=/usr/lib/ocf
     ++ export OCF_ROOT
     ++ LEGAL_ACTIONS='(start|stop|status|usage|meta-data)'
     + '[' 2 '!=' 2 ']'
     ++ echo 172.16.80.250
     ++ sed 's%/.*%%'
     + BASEIP=172.16.80.250
     + OCF_RESKEY_ip=172.16.80.250
     + export OCF_RESKEY_ip
     ++ echo 172.16.80.250
     ++ sed 's%^172.16.80.250/*%%'
     + str=
     + '[' '!' -z '' ']'
     + case $2 in
     + OCF_TYPE=IPaddr
     + OCF_RESKEY_lvs_support=0
     + OCF_RESOURCE_INSTANCE=IPaddr_172.16.80.250
     + export OCF_TYPE OCF_RESOURCE_INSTANCE OCF_RESKEY_lvs_support
     + ra_execocf start
     + '[' xstart = x ']'
     + . /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs
     /etc/ha.d/resource.d//hto-mapfuncs: line 35: 
/usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs: No such file or directory

I am in the process of debug this problem.

Has anyone else already fixed these problems or other gotchas with heartbeat 
3.0.1?


Thanks,

Bob


      
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to