Re: [Linux-HA] Double node configuration with mixed ressource

Andrew Beekhof Thu, 13 Dec 2007 07:17:45 -0800


On Dec 11, 2007, at 5:46 PM, Franck Ganachaud wrote:

If i may add a question, can i disable start or stop action on aresource with adding the following to a primitive?
<op disabled="true" id="mysql_orb_start" name="start"/>


no.  start is implicit.  sorry.

Franck

Franck Ganachaud a écrit :
Ok, I think I'll have to deal with the start/stop.

2 questions.
- Every time I start heartbeat, the first thing it does is checkingmysql_orb and if it's running then stop it then try to start it.Why does heartbeat want to stop if the cloned resource waspreviously started? Can I change this behaviour?


probably because it thinks each instance is running on multiple nodes.

try adding globally_unique=false as a meta attribute for the clone

- When the mysql_orb monitor return KO on th server_a, group_1ain't relocated to server_b


i dont understand - its running on both isnt it?

I copy the log for the second question.
------------ Log start --------------
Dec 11 14:32:38 server_a mysql_orb[24127]: [24133]: INFO: databaseis KODec 11 14:32:38 server_a crmd: [15760]: info: process_lrm_event:LRM operation mysql_orb1:0_monitor_30000 (call=27, rc=7) completeDec 11 14:32:38 server_a cib: [15756]: info: cib_diff_notify:Update (client: 15760, call:85): 0.316.4076 -> 0.316.4077 (ok)Dec 11 14:32:38 server_a tengine: [15767]: info: te_update_diff:Processing diff (cib_update): 0.316.4076 -> 0.316.4077Dec 11 14:32:38 server_a tengine: [15767]: info:process_graph_event: Detected action mysql_orb1:0_monitor_30000from a different transitio
n: 5 vs. 9
Dec 11 14:32:38 server_a tengine: [15767]: info:update_abort_priority: Abort priority upgraded to 1000000Dec 11 14:32:38 server_a tengine: [15767]: WARN: update_failcount:Updating failcount for mysql_orb1:0 on ffff098f-d059-4bf2-8aaa-58910ad9d
9a0 after failed monitor: rc=7
Dec 11 14:32:38 server_a crmd: [15760]: info: do_state_transition:server_a: State transition S_IDLE -> S_POLICY_ENGINE[ input=I_PE_CALC c
ause=C_IPC_MESSAGE origin=route_message ]
Dec 11 14:32:38 server_a crmd: [15760]: info: do_state_transition:All 2 cluster nodes are eligable to run resources.Dec 11 14:32:38 server_a cib: [15756]: info: cib_diff_notify:Update (client: 15767, call:5): 0.316.4077 -> 0.316.4078 (ok)Dec 11 14:32:38 server_a tengine: [15767]: info: te_update_diff:Processing diff (cib_modify): 0.316.4077 -> 0.316.4078Dec 11 14:32:38 server_a tengine: [15767]: info: extract_event:Aborting on transient_attributes changes for ffff098f-d059-4bf2-8aaa-58910a
d9d9a0
Dec 11 14:32:38 server_a tengine: [15767]: info: te_update_diff:Aborting on transient_attributes deletionsDec 11 14:32:38 server_a pengine: [15768]: info: log_data_element:process_pe_message: [generation] <cib admin_epoch="0" cib-last-written="Tue Dec 11 14:08:37 2007" cib_feature_revision="1.3" epoch="316"generated="true" have_quorum="true" ignore_dtd="false"num_peers="2" num_updates="4077" ccm_transition="2" dc_uuid="ffff098f-d059-4bf2-8aaa-58910ad9d9a0"/>Dec 11 14:32:38 server_a pengine: [15768]: notice: cluster_option:Using default value 'stop' for cluster option 'no-quorum-policy'Dec 11 14:32:38 server_a pengine: [15768]: notice: cluster_option:Using default value '60s' for cluster option 'cluster-delay'Dec 11 14:32:38 server_a pengine: [15768]: notice: cluster_option:Using default value '-1' for cluster option 'pe-error-series-max'Dec 11 14:32:39 server_a pengine: [15768]: notice: cluster_option:Using default value '-1' for cluster option 'pe-warn-series-max'Dec 11 14:32:39 server_a pengine: [15768]: notice: cluster_option:Using default value '-1' for cluster option 'pe-input-series-max'Dec 11 14:32:39 server_a pengine: [15768]: notice: cluster_option:Using default value 'true' for cluster option 'startup-fencing'Dec 11 14:32:39 server_a pengine: [15768]: info:determine_online_status: Node server_a is onlineDec 11 14:32:39 server_a pengine: [15768]: info:determine_online_status: Node server_b is onlineDec 11 14:32:39 server_a pengine: [15768]: info: clone_print: CloneSet: MySQL_ORBDec 11 14:32:39 server_a pengine: [15768]: info: native_print:mysql_orb1:0 (heartbeat::ocf:mysql_orb): Started server_aDec 11 14:32:39 server_a pengine: [15768]: info: native_print:mysql_orb1:1 (heartbeat::ocf:mysql_orb): Started server_bDec 11 14:32:39 server_a pengine: [15768]: info: group_print:Resource Group: group_1Dec 11 14:32:39 server_a pengine: [15768]: info: native_print:IPaddr_192_168_87_100 (heartbeat::ocf:IPaddr):Started server_aDec 11 14:32:39 server_a pengine: [15768]: info: native_print:apache_2 (heartbeat::ocf:apache): Started server_aDec 11 14:32:39 server_a cib: [24134]: info: write_cib_contents:Wrote version 0.316.4078 of the CIB to disk (digest:47227e2fc6e2d490dda64
562d69194ac)
Dec 11 14:32:39 server_a pengine: [15768]: notice: NoRoleChange:Leave resource mysql_orb1:0 (server_a)Dec 11 14:32:39 server_a pengine: [15768]: notice: NoRoleChange:Leave resource mysql_orb1:1 (server_b)Dec 11 14:32:39 server_a pengine: [15768]: info: native_color:Combine scores from apache_2 and IPaddr_192_168_87_100Dec 11 14:32:39 server_a pengine: [15768]: notice: NoRoleChange:Leave resource IPaddr_192_168_87_100 (server_a)Dec 11 14:32:40 server_a pengine: [15768]: notice: NoRoleChange:Leave resource apache_2 (server_a)
------------ Log stop --------------


Andrew Beekhof a écrit :
On Dec 11, 2007, at 12:00 PM, Franck Ganachaud wrote:
Thanks Andrew

I change the configuration to this :

  <resources>
<clone id="MySQL_ORB" interleave="false" is_managed="true"notify="false" ordered="false">
      <instance_attributes id="MySQL_ORB_inst_attr">
        <attributes>
<nvpair id="MySQL_ORB_attr_0" name="clone_max"value="2"/><nvpair id="MySQL_ORB_attr_1" name="clone_node_max"value="1"/>
        </attributes>
      </instance_attributes>
<primitive class="ocf" id="mysql_orb1" is_managed="false"provider="heartbeat" type="mysql_orb">
        <operations>
<op id="mysql_orb_mon" interval="30s" name="monitor"on_fail="stop" timeout="30s"/>
        </operations>
      </primitive>
    </clone>
    <group id="group_1" restart_type="restart">
<primitive class="ocf" id="IPaddr_Cluster"provider="heartbeat" type="IPaddr">
        <operations>
<op id="IPaddr_Cluster_mon" interval="5s"name="monitor" timeout="5s"/>
        </operations>
        <instance_attributes id="IPaddr_Cluster_inst_attr">
          <attributes>
<nvpair id="IPaddr_Cluster_attr_0" name="ip"value="192.168.87.100"/>
          </attributes>
        </instance_attributes>
      </primitive>
<primitive class="ocf" id="apache_2" provider="heartbeat"type="apache">
        <operations>
<op id="apache_2_mon" interval="30s" name="monitor"timeout="30s"/>
        </operations>
        <instance_attributes id="apache_2_inst_attr">
          <attributes>
<nvpair id="apache_2_attr_0" name="configfile"value="/usr/local/apache/conf/httpd.conf"/>
          </attributes>
        </instance_attributes>
      </primitive>
    </group>
  </resources>
  <constraints>
    <rsc_location id="rsc_location_group_1" rsc="group_1">
      <rule id="prefered_location_group_1" score="100">
<expression attribute="#uname"id="prefered_location_group_1_expr" operation="eq"value="server_a"/>
      </rule>
    </rsc_location>
<rsc_colocation from="group_1" id="web_if_mysql"score="INFINITY" to="MySQL_ORB"/>
  </constraints>
With this configuration, DB monitoring doesn't occur anymore. Forthe is_managed parameter, I don't know what to set between theclone and primitive inside the clone to have the good behaviour :don't start/stop it at startup, just monitor.
hmmm... i forgot we don't start recurring monitor actions forunmanaged resources.
if the monitor was already running, then we wouldn't stop it ifyou change the resource to unmanaged... but that doesn't helpyou :-(
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Double node configuration with mixed ressource

Reply via email to