Re: [Linux-HA] Resource Collocation v/s Resource Groups

Andrew Beekhof Mon, 01 Jul 2013 17:14:11 -0700

On 21/06/2013, at 7:01 PM, Parkirat <[email protected]> wrote:


> Hi Andrew,
> 
> Thanks for the reply. I did that experiment again, with Apache and Dummy
> Resource.
> 
> Below is my configuration:
> 
> ========================================================
> [root@prod-hb-nmn-002 ~]# crm configure show
> node $id="208b3c7f-f713-4c06-86c9-24aa948f0859" prod-hb-nmn-001
> node $id="45cbbd64-e048-4e8d-aff4-0f8cbc708b13" prod-hb-nmn-002
> primitive ApacheServer ocf:heartbeat:apache \
>       params configfile="/etc/httpd/conf/httpd.conf" \
>       op monitor interval="60s" timeout="120s" on-fail="restart" \
>       op start interval="0" timeout="40s" \
>       op stop interval="0" timeout="60s" \
>       meta target-role="Started" \
>       meta is-managed="true" \
>       meta resource-stickiness="100" \
>       meta migration-threshold="2" \
>       meta multiple-active="stop_start" \
>       meta failure-timeout="300" \
>       meta allow-migrate="true"
> primitive Dummy ocf:heartbeat:Dummy \
>       op monitor interval="60s" timeout="120s" on-fail="restart" \
>       op start interval="0" timeout="40s" \
>       op stop interval="0" timeout="60s" \
>       meta target-role="Started" \
>       meta is-managed="true" \
>       meta resource-stickiness="100" \
>       meta migration-threshold="2" \
>       meta multiple-active="stop_start" \
>       meta failure-timeout="300" \
>       meta allow-migrate="true"
> group DUMMY-AND-APACHE Dummy ApacheServer
> colocation DUMMY-WITH-APACHE inf: Dummy ApacheServer
> order APACHE-BEFORE-DUMMY inf: ApacheServer Dummy
> property $id="cib-bootstrap-options" \
>       dc-version="1.0.9-89bd754939df5150de7cd76835f98fe90851b677" \

There have been many fixes since 1.0.9 was released 3 years ago.
Can you try something newer first?

>       cluster-infrastructure="Heartbeat" \
>       no-quorum-policy="ignore" \
>       stonith-enabled="false" \
>       expected-quorum-votes="2" \
>       default-resource-stickiness="100"
> ========================================================
> 
> Then stopped the Apache Server manually using /etc/init.d/httpd stop
> and this time monitor did noticed a stopped resource:
> 
> ========================================================
> [root@prod-hb-nmn-002 ~]# crm_mon -1r
> ============
> Last updated: Fri Jun 21 08:11:12 2013
> Stack: Heartbeat
> Current DC: prod-hb-nmn-002 (45cbbd64-e048-4e8d-aff4-0f8cbc708b13) -
> partition with quorum
> Version: 1.0.9-89bd754939df5150de7cd76835f98fe90851b677
> 2 Nodes configured, 2 expected votes
> 1 Resources configured.
> ============
> 
> Online: [ prod-hb-nmn-001 prod-hb-nmn-002 ]
> 
> Full list of resources:
> 
> Resource Group: DUMMY-AND-APACHE
>     Dummy     (ocf::heartbeat:Dummy): Started prod-hb-nmn-002
>     ApacheServer      (ocf::heartbeat:apache):        Started prod-hb-nmn-002 
> FAILED
> 
> Failed actions:
>    ApacheServer_monitor_60000 (node=prod-hb-nmn-002, call=7, rc=7,
> status=complete): not running
> ========================================================
> 
> but the pacemaker failed to do the Transition. I have also attached the full 
> pacemaker-error.log
> <http://linux-ha.996297.n3.nabble.com/file/n14716/pacemaker-error.log>  log
> for pacemaker where it has failed.
> 
> Pacemaker Log:
> ========================================================
> Jun 21 07:42:13 prod-hb-nmn-002 pengine: [26325]: notice: RecurringOp: 
> Start recurring monitor (60s) for ApacheServer on prod-hb-nmn-002
> Jun 21 07:42:13 prod-hb-nmn-002 pengine: [26325]: notice: LogActions:
> Restart resource Dummy    (Started prod-hb-nmn-002)
> Jun 21 07:42:13 prod-hb-nmn-002 pengine: [26325]: notice: LogActions:
> Recover resource ApacheServer     (Started prod-hb-nmn-002)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: info: do_state_transition:
> State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
> cause=C_IPC_MESSAGE origin=handle_response ]
> Jun 21 07:42:13 prod-hb-nmn-002 pengine: [26325]: info: process_pe_message:
> Transition 9: PEngine Input stored in: /var/lib/pengine/pe-input-18.bz2
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: info: unpack_graph: Unpacked
> transition 9: 11 actions in 11 synapses
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: info: do_te_invoke:
> Processing graph 9 (ref=pe_calc-dc-1371800533-71) derived from
> /var/lib/pengine/pe-input-18.bz2
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: info: te_pseudo_action:
> Pseudo action 13 fired and confirmed
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: notice: run_graph:
> ====================================================
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: run_graph: Transition 9
> (Complete=1, Pending=0, Fired=0, Skipped=0, Incomplete=10,
> Source=/var/lib/pengine/pe-input-18.bz2): Terminated
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: ERROR: te_graph_trigger:
> Transition failed: terminated
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_graph: Graph 9
> (11 actions in 11 synapses): batch-limit=30 jobs, network-delay=60000ms
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_graph: Synapse 0
> is pending (priority: 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:     [Action
> 11]: Pending (id: DUMMY-AND-APACHE_start_0, type: pseduo, priority: 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:      *
> [Input 13]: Completed (id: DUMMY-AND-APACHE_stop_0, type: pseduo, priority:
> 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:      *
> [Input 14]: Pending (id: DUMMY-AND-APACHE_stopped_0, type: pseduo, priority:
> 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_graph: Synapse 1
> is pending (priority: 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:     [Action
> 12]: Pending (id: DUMMY-AND-APACHE_running_0, type: pseduo, priority: 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:      *
> [Input 9]: Pending (id: Dummy_start_0, loc: prod-hb-nmn-002, priority: 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:      *
> [Input 10]: Pending (id: ApacheServer_start_0, loc: prod-hb-nmn-002,
> priority: 0)
> Jun 21 07:42:13 prod-hb-nmn-002 crmd: [26205]: WARN: print_elem:      *
> [Input 11]: Pending (id: DUMMY-AND-APACHE_start_0, type: pseduo, priority:
> 0)
> ========================================================
> 
> Regards,
> Parkirat Singh Bagga
> 
> 
> 
> --
> View this message in context: 
> http://linux-ha.996297.n3.nabble.com/Resource-Collocation-v-s-Resource-Groups-tp14699p14716.html
> Sent from the Linux-HA mailing list archive at Nabble.com.
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Resource Collocation v/s Resource Groups

Reply via email to