Hello

I'm trying to get this working since two days now, but ldirectord
somehow does not work. Had no problem with it on older Heartbeat 2. Hope
you can give me a hint.


My setup:
- CentOS 5.3
- HA packages from
"http://download.opensuse.org/repositories/server:/ha-clustering/RHEL_
$releasever/":
  - heartbeat-3.0.0-33.2
  - openais-0.80.5-15.1
  - libopenais2-0.80.5-15.1
  - pacemaker-1.0.5-4.1
  - pacemaker-libs-1.0.5-4.1


The goal:
- ldirectord with failover to second node


The current config looks like this:
====================================
crm(live)# configure show
node ovz01.icrcom.ch
node ovz04.icrcom.ch
primitive failover-ip ocf:heartbeat:IPaddr \
        params ip="172.30.101.110" nic="eth0" netmask="24"
broadcast="172.30.101.255" \
        op monitor interval="5s" timeout="15s"
primitive ldirectord_1 ocf:heartbeat:ldirectord \
        params 1="ldirectord.cf" target_role="started" \
        op monitor interval="120s" role="Started" timeout="60s" start_delay="0"
disabled="false"
property $id="cib-bootstrap-options" \
        dc-version="1.0.5-462f1569a43740667daf7b0f6b521742e9eb8fa7" \
        cluster-infrastructure="Heartbeat" \
        symetric-cluster="true" \
        stonith-enabled="false" \
        no-quorum-policy="stop" \
        default-resource-stickiness="0" \
        default-resource-failure-stickiness="0" \
        stop-orphan-actions="true" \
        stop-orphan-resources="true" \
        remove-after-stop="false" \
        short-resource-names="true" \
        transition-idle-timeout="5min" \
        default-action-timeout="15s" \
        is-managed-default="true" \
        expected-quorum-votes="2" \
        last-lrm-refresh="1253609925"
====================================


The IP looks good, but not ldirectord:
====================================
# crm_mon --one-shot

============
Last updated: Tue Sep 22 11:57:06 2009
Stack: openais
Current DC: ovz04.icrcom.ch - partition with quorum
Version: 1.0.5-462f1569a43740667daf7b0f6b521742e9eb8fa7
2 Nodes configured, 2 expected votes
2 Resources configured.
============

Online: [ ovz04.icrcom.ch ovz01.icrcom.ch ]

failover-ip     (ocf::heartbeat:IPaddr):        Started ovz04.icrcom.ch
ldirectord_1    (ocf::heartbeat:ldirectord) Started [   ovz04.icrcom.ch
ovz01.icrcom.ch ]

Failed actions:
    ldirectord_1_monitor_0 (node=ovz04.icrcom.ch, call=3, rc=1,
status=complete): unknown error
    ldirectord_1_stop_0 (node=ovz04.icrcom.ch, call=4, rc=1,
status=complete): unknown error
    ldirectord_1_monitor_0 (node=ovz01.icrcom.ch, call=3, rc=1,
status=complete): unknown error
====================================


>From the logs:
====================================
Sep 22 11:56:40 ovz04 pengine: [12685]: info: determine_online_status:
Node ovz04.icrcom.ch is online
Sep 22 11:56:40 ovz04 pengine: [12685]: info: unpack_rsc_op:
ldirectord_1_monitor_0 on ovz04.icrcom.ch returned 1 (unknown error)
instead of the expected value: 7 (not running)
Sep 22 11:56:40 ovz04 pengine: [12685]: WARN: unpack_rsc_op: Processing
failed op ldirectord_1_monitor_0 on ovz04.icrcom.ch: unknown error
Sep 22 11:56:40 ovz04 pengine: [12685]: info: unpack_rsc_op:
ldirectord_1_stop_0 on ovz04.icrcom.ch returned 1 (unknown error)
instead of the expected value: 0 (ok)
Sep 22 11:56:40 ovz04 crmd: [12686]: info: process_lrm_event: LRM
operation failover-ip_start_0 (call=5, rc=0, cib-update=60,
confirmed=true) complete ok
Sep 22 11:56:40 ovz04 crmd: [12686]: info: match_graph_event: Action
failover-ip_start_0 (6) confirmed on ovz04.icrcom.ch (rc=0)
Sep 22 11:56:40 ovz04 crmd: [12686]: info: run_graph:
====================================================
Sep 22 11:56:40 ovz04 crmd: [12686]: notice: run_graph: Transition 6
(Complete=2, Pending=0, Fired=0, Skipped=1, Incomplete=0,
Source=/var/lib/pengine/pe-warn-336.bz2): Stopped
Sep 22 11:56:40 ovz04 crmd: [12686]: info: te_graph_trigger: Transition
6 is now complete
Sep 22 11:56:40 ovz04 pengine: [12685]: WARN: unpack_rsc_op: Processing
failed op ldirectord_1_stop_0 on ovz04.icrcom.ch: unknown error
Sep 22 11:56:40 ovz04 pengine: [12685]: info: native_add_running:
resource ldirectord_1 isnt managed
Sep 22 11:56:40 ovz04 pengine: [12685]: info: determine_online_status:
Node ovz01.icrcom.ch is online
Sep 22 11:56:40 ovz04 pengine: [12685]: info: unpack_rsc_op:
ldirectord_1_monitor_0 on ovz01.icrcom.ch returned 1 (unknown error)
instead of the expected value: 7 (not running)
Sep 22 11:56:40 ovz04 pengine: [12685]: WARN: unpack_rsc_op: Processing
failed op ldirectord_1_monitor_0 on ovz01.icrcom.ch: unknown error
====================================


Thank you
Urs

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to