Ehlers, Kolja wrote:
I would like to use groups for my resources. But always if I manually stop
one of the resources in the group all resources will be shutted down and
restarted by heartbeat. I have tryed to find any information if this
behaviour is normal, but I have not found anything about it. Only that the
resources are started and stopped sequentially. Is this normal, and can I
prevent that? The log does not tell me anything

at this point I stopped tomcat_21

crmd[32141]: 2008/07/09_11:53:14 info: process_lrm_event: LRM operation
tomcat_21_monitor_5000 (call=99, rc=7) complete
tengine[32148]: 2008/07/09_11:53:14 info: process_graph_event: Action
tomcat_21_monitor_5000 arrived after a completed transition
tengine[32148]: 2008/07/09_11:53:14 info: update_abort_priority: Abort
priority upgraded to 1000000
tengine[32148]: 2008/07/09_11:53:14 WARN: update_failcount: Updating
failcount for tomcat_21 on 3a325e23-2184-46ed-9e88-42a11f28c2be after failed
monitor: rc=7
crmd[32141]: 2008/07/09_11:53:14 info: do_state_transition: State transition
S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_IPC_MESSAGE
origin=route_message ]
crmd[32141]: 2008/07/09_11:53:14 info: do_state_transition: All 1 cluster
nodes are eligible to run resources.
pengine[32149]: 2008/07/09_11:53:14 info: determine_online_status: Node
www1test is online
pengine[32149]: 2008/07/09_11:53:14 WARN: unpack_rsc_op: Processing failed
op tomcat_21_monitor_5000 on www1test: Error
pengine[32149]: 2008/07/09_11:53:14 notice: group_print: Resource Group:
group_1
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:
IPaddr_192_168_11_25      (ocf::heartbeat:IPaddr):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     apache_2
(ocf::heartbeat:apache):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: group_print: Resource Group:
group_2
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_21
(ocf::heartbeat:tomcat):        Started www1test FAILED
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_22
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_22sdb
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_30
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_34
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_35
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_36
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_37
(ocf::heartbeat:tomcat):        Started www1test
tengine[32148]: 2008/07/09_11:53:14 info: extract_event: Aborting on
transient_attributes changes for 3a325e23-2184-46ed-9e88-42a11f28c2be
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_38
(ocf::heartbeat:tomcat):        Started www1test
tengine[32148]: 2008/07/09_11:53:14 WARN: notify_crmd: Delaying completion
until all CIB updates complete
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
IPaddr_192_168_11_25   (www1test)
tengine[32148]: 2008/07/09_11:53:14 info: te_update_diff: Aborting on
transient_attributes deletions
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
apache_2       (www1test)
tengine[32148]: 2008/07/09_11:53:14 WARN: notify_crmd: Delaying completion
until all CIB updates complete
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Recover resource
tomcat_21    (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: StopRsc:   www1test Stop
tomcat_21
pengine[32149]: 2008/07/09_11:53:14 notice: StartRsc:  www1test Start
tomcat_21
pengine[32149]: 2008/07/09_11:53:14 notice: RecurringOp: www1test
tomcat_21_monitor_5000
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_22      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_22sdb   (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_30      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_34      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_35      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_36      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_37      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_38      (www1test)
pengine[32149]: 2008/07/09_11:53:14 info: process_pe_message: Transition 10:
PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-68.bz2
pengine[32149]: 2008/07/09_11:53:14 info: determine_online_status: Node
www1test is online
pengine[32149]: 2008/07/09_11:53:14 WARN: unpack_rsc_op: Processing failed
op tomcat_21_monitor_5000 on www1test: Error
pengine[32149]: 2008/07/09_11:53:14 notice: group_print: Resource Group:
group_1
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:
IPaddr_192_168_11_25      (ocf::heartbeat:IPaddr):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     apache_2
(ocf::heartbeat:apache):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: group_print: Resource Group:
group_2
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_21
(ocf::heartbeat:tomcat):        Started www1test FAILED
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_22
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_22sdb
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_30
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_34
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_35
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_36
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_37
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: native_print:     tomcat_38
(ocf::heartbeat:tomcat):        Started www1test
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
IPaddr_192_168_11_25   (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
apache_2       (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Recover resource
tomcat_21    (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: StopRsc:   www1test Stop
tomcat_21
pengine[32149]: 2008/07/09_11:53:14 notice: StartRsc:  www1test Start
tomcat_21
pengine[32149]: 2008/07/09_11:53:14 notice: RecurringOp: www1test
tomcat_21_monitor_5000
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_22      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_22sdb   (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_30      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_34      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_35      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_36      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_37      (www1test)
pengine[32149]: 2008/07/09_11:53:14 notice: NoRoleChange: Leave resource
tomcat_38      (www1test)
crmd[32141]: 2008/07/09_11:53:14 info: do_state_transition: State transition
S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
cause=C_IPC_MESSAGE origin=route_message ]
tengine[32148]: 2008/07/09_11:53:14 info: unpack_graph: Unpacked transition
11: 32 actions in 32 synapses
tengine[32148]: 2008/07/09_11:53:14 info: te_pseudo_action: Pseudo action 43
fired and confirmed
tengine[32148]: 2008/07/09_11:53:14 info: send_rsc_command: Initiating
action 39: tomcat_38_stop_0 on www1test
crmd[32141]: 2008/07/09_11:53:14 info: do_lrm_rsc_op: Performing
op=tomcat_38_stop_0 key=39:11:a5a5ae88-f0aa-4e5a-9c45-59cfb6304a70)
lrmd[32138]: 2008/07/09_11:53:14 info: rsc:tomcat_38: stop

and here it is now stopping tomcat_38, tomcat_37 ... the whole group in
reverse order.

Groups are ordered by default. This means: if you stop the first resource in the group, all subsequent resources are stopped before.

If you do not want this and understand the change, you can set
<group id="whatever" ordered="false">

If you post your cib.xml, we might confirm this assumption.

Regards
Dominik
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to