Hello Dejan

On Tue, 2009-09-22 at 12:57 +0200, Dejan Muhamedagic wrote:
> Hi,
> 
> On Tue, Sep 22, 2009 at 12:06:48PM +0200, Urs Weiss wrote:
> > Hello
> > 
> > I'm trying to get this working since two days now, but ldirectord
> > somehow does not work. Had no problem with it on older Heartbeat 2. Hope
> > you can give me a hint.
> > 
> > 
> > My setup:
> > - CentOS 5.3
> > - HA packages from
> > "http://download.opensuse.org/repositories/server:/ha-clustering/RHEL_
> > $releasever/":
> >   - heartbeat-3.0.0-33.2
> >   - openais-0.80.5-15.1
> >   - libopenais2-0.80.5-15.1
> >   - pacemaker-1.0.5-4.1
> >   - pacemaker-libs-1.0.5-4.1
> > 
> > 
> > The goal:
> > - ldirectord with failover to second node
> > 
> > 
> > The current config looks like this:
> > ====================================
> > crm(live)# configure show
> > node ovz01.icrcom.ch
> > node ovz04.icrcom.ch
> > primitive failover-ip ocf:heartbeat:IPaddr \
> >     params ip="172.30.101.110" nic="eth0" netmask="24"
> > broadcast="172.30.101.255" \
> >     op monitor interval="5s" timeout="15s"
> > primitive ldirectord_1 ocf:heartbeat:ldirectord \
> >     params 1="ldirectord.cf" target_role="started" \
> >     op monitor interval="120s" role="Started" timeout="60s" start_delay="0"
> > disabled="false"
> > property $id="cib-bootstrap-options" \
> >     dc-version="1.0.5-462f1569a43740667daf7b0f6b521742e9eb8fa7" \
> >     cluster-infrastructure="Heartbeat" \
> >     symetric-cluster="true" \
> >     stonith-enabled="false" \
> >     no-quorum-policy="stop" \
> >     default-resource-stickiness="0" \
> >     default-resource-failure-stickiness="0" \
> >     stop-orphan-actions="true" \
> >     stop-orphan-resources="true" \
> >     remove-after-stop="false" \
> >     short-resource-names="true" \
> >     transition-idle-timeout="5min" \
> >     default-action-timeout="15s" \
> >     is-managed-default="true" \
> >     expected-quorum-votes="2" \
> >     last-lrm-refresh="1253609925"
> > ====================================
> > 
> > 
> > The IP looks good, but not ldirectord:
> > ====================================
> > # crm_mon --one-shot
> > 
> > ============
> > Last updated: Tue Sep 22 11:57:06 2009
> > Stack: openais
> > Current DC: ovz04.icrcom.ch - partition with quorum
> > Version: 1.0.5-462f1569a43740667daf7b0f6b521742e9eb8fa7
> > 2 Nodes configured, 2 expected votes
> > 2 Resources configured.
> > ============
> > 
> > Online: [ ovz04.icrcom.ch ovz01.icrcom.ch ]
> > 
> > failover-ip (ocf::heartbeat:IPaddr):        Started ovz04.icrcom.ch
> > ldirectord_1        (ocf::heartbeat:ldirectord) Started [   ovz04.icrcom.ch
> > ovz01.icrcom.ch ]
> > 
> > Failed actions:
> >     ldirectord_1_monitor_0 (node=ovz04.icrcom.ch, call=3, rc=1,
> > status=complete): unknown error
> >     ldirectord_1_stop_0 (node=ovz04.icrcom.ch, call=4, rc=1,
> > status=complete): unknown error
> >     ldirectord_1_monitor_0 (node=ovz01.icrcom.ch, call=3, rc=1,
> > status=complete): unknown error
> > ====================================
> > 
> > 
> > >From the logs:
> > ====================================
> > Sep 22 11:56:40 ovz04 pengine: [12685]: info: determine_online_status:
> > Node ovz04.icrcom.ch is online
> > Sep 22 11:56:40 ovz04 pengine: [12685]: info: unpack_rsc_op:
> > ldirectord_1_monitor_0 on ovz04.icrcom.ch returned 1 (unknown error)
> > instead of the expected value: 7 (not running)
> > Sep 22 11:56:40 ovz04 pengine: [12685]: WARN: unpack_rsc_op: Processing
> > failed op ldirectord_1_monitor_0 on ovz04.icrcom.ch: unknown error
> > Sep 22 11:56:40 ovz04 pengine: [12685]: info: unpack_rsc_op:
> > ldirectord_1_stop_0 on ovz04.icrcom.ch returned 1 (unknown error)
> > instead of the expected value: 0 (ok)
> > Sep 22 11:56:40 ovz04 crmd: [12686]: info: process_lrm_event: LRM
> > operation failover-ip_start_0 (call=5, rc=0, cib-update=60,
> > confirmed=true) complete ok
> > Sep 22 11:56:40 ovz04 crmd: [12686]: info: match_graph_event: Action
> > failover-ip_start_0 (6) confirmed on ovz04.icrcom.ch (rc=0)
> > Sep 22 11:56:40 ovz04 crmd: [12686]: info: run_graph:
> > ====================================================
> > Sep 22 11:56:40 ovz04 crmd: [12686]: notice: run_graph: Transition 6
> > (Complete=2, Pending=0, Fired=0, Skipped=1, Incomplete=0,
> > Source=/var/lib/pengine/pe-warn-336.bz2): Stopped
> > Sep 22 11:56:40 ovz04 crmd: [12686]: info: te_graph_trigger: Transition
> > 6 is now complete
> > Sep 22 11:56:40 ovz04 pengine: [12685]: WARN: unpack_rsc_op: Processing
> > failed op ldirectord_1_stop_0 on ovz04.icrcom.ch: unknown error
> > Sep 22 11:56:40 ovz04 pengine: [12685]: info: native_add_running:
> > resource ldirectord_1 isnt managed
> > Sep 22 11:56:40 ovz04 pengine: [12685]: info: determine_online_status:
> > Node ovz01.icrcom.ch is online
> > Sep 22 11:56:40 ovz04 pengine: [12685]: info: unpack_rsc_op:
> > ldirectord_1_monitor_0 on ovz01.icrcom.ch returned 1 (unknown error)
> > instead of the expected value: 7 (not running)
> > Sep 22 11:56:40 ovz04 pengine: [12685]: WARN: unpack_rsc_op: Processing
> > failed op ldirectord_1_monitor_0 on ovz01.icrcom.ch: unknown error
> > ====================================
> 
> Look for 'lrmd.*ldirector' on all nodes where it failed. That
> should show you what's happening with the resource.
> 

Ouch! The log i didn't looked at (ldirectord.log) ....
Did not found the ldirectord.cf . Oh man, thank you. Simple error,
stupid admin....

But one more (simple) question:
The IP and ldirectord are running now, but not at the some node. What
they have to of course in this setup. How can i make them always running
on the same node?

Thank you
Urs


> Thanks,
> 
> Dejan
> 
> > 
> > 
> > Thank you
> > Urs
> > 
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to