Hi Pacemaker guru's

I have installed pacemaker successfully and I have setup a cluster with the
following configuration:

node tsparxdb01
node tsparxdb02
primitive ClusterIP ocf:heartbeat:IPaddr2 \
params ip="192.168.1.254" cidr_netmask="32" \
op monitor interval="30s" \
meta target-role="Started"
primitive Data ocf:linbit:drbd \
params drbd_resource="data" \
op monitor interval="60s" \
op start interval="0" timeout="240s" \
op stop interval="0" timeout="240s"
primitive DataFS ocf:heartbeat:Filesystem \
params device="/dev/drbd/by-res/data" directory="/data" fstype="ext4" \
op start interval="0" timeout="60s" \
op stop interval="0" timeout="60s"
primitive MySQL lsb:mysql \
op monitor interval="60s" timeout="120s" \
op start interval="20" timeout="120s" \
op stop interval="20s" timeout="120s"
group Server-MySQL DataFS MySQL ClusterIP
ms DataMaster Data \
meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
notify="true"
colocation DataFS_on_DataMaster inf: Server-MySQL DataMaster:Master
order DataFS-after-DataMaster inf: DataMaster:promote Server-MySQL:start
property $id="cib-bootstrap-options" \
dc-version="1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd" \
cluster-infrastructure="openais" \
expected-quorum-votes="2" \
stonith-enabled="false" \
no-quorum-policy="ignore"
rsc_defaults $id="rsc-options" \
resource-stickiness="100"


Everything except the MySQL resource is working correctly (state 1):

============
Last updated: Wed Jan 19 10:15:51 2011
Stack: openais
Current DC: tsparxdb02 - partition with quorum
Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd
2 Nodes configured, 2 expected votes
2 Resources configured.
============

Online: [ tsparxdb01 tsparxdb02 ]

 Master/Slave Set: DataMaster
     Masters: [ tsparxdb02 ]
     Slaves: [ tsparxdb01 ]
 Resource Group: Server-MySQL
     DataFS     (ocf::heartbeat:Filesystem):    Started tsparxdb02
     MySQL      (lsb:mysql):    Stopped
     ClusterIP  (ocf::heartbeat:IPaddr2):       Started tsparxdb02


The MySQL resource keeps trying to start, but after a while I will see this
(state 2):

 ============
Last updated: Wed Jan 19 10:17:18 2011
Stack: openais
Current DC: tsparxdb02 - partition with quorum
Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd
2 Nodes configured, 2 expected votes
2 Resources configured.
============

Online: [ tsparxdb01 tsparxdb02 ]

 Master/Slave Set: DataMaster
     Masters: [ tsparxdb02 ]
     Slaves: [ tsparxdb01 ]
 Resource Group: Server-MySQL
     DataFS     (ocf::heartbeat:Filesystem):    Started tsparxdb02
     MySQL      (lsb:mysql):    Stopped
     ClusterIP  (ocf::heartbeat:IPaddr2):       Started tsparxdb02

Failed actions:
    MySQL_monitor_60000 (node=tsparxdb02, call=62, rc=7, status=complete):
not running

And then it will go back to the previous state 1.

I have tried to use both RA ocf:heartbeat:mysql and lather the lsb:mysql.

My MySQL startup script seems to be LSB compliant since, it gives on all the
start/stop/status commands the correct result 0 or 3.

Anyone any idea what could be wrong. Attached you will find the snippet of
corosync.log that show when the switch between state 1 and state 2 occurs.

Hoping you can help me.

Regards,
Shurbann Martes
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: WARN: action_timer_callback: Timer popped (timeout=120000, abort_level=1000000, complete=false)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: ERROR: print_elem: Aborting transition, action lost: [Action 41]: In-flight (id: MySQL_monitor_60000, loc: tsparxdb02, priority: 0)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: abort_transition_graph: action_timer_callback:482 - Triggered transition abort (complete=0) : Action lost
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: WARN: cib_action_update: rsc_op 41: MySQL_monitor_60000 on tsparxdb02 timed out
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: run_graph: ====================================================
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: notice: run_graph: Transition 29 (Complete=11, Pending=0, Fired=0, Skipped=2, Incomplete=0, Source=/var/lib/pengine/pe-input-1834.bz2): Stopped
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_graph_trigger: Transition 29 is now complete
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=notify_crmd ]
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_pe_invoke: Query 230: Requesting the current CIB: S_POLICY_ENGINE
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: process_graph_event: Action MySQL_monitor_60000 arrived after a completed transition
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: abort_transition_graph: process_graph_event:467 - Triggered transition abort (complete=1, tag=lrm_rsc_op, id=MySQL_monitor_60000, magic=2:1;41:29:0:94d8bb73-f8db-45dc-851a-8c00d57b3867, cib=0.361.69) : Inactive graph
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: WARN: update_failcount: Updating failcount for MySQL on tsparxdb02 after failed monitor: rc=1 (update=value++, time=1295446939)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_pe_invoke: Query 231: Requesting the current CIB: S_POLICY_ENGINE
Jan 19 10:22:19 tsparxdb02 attrd: [10188]: info: attrd_local_callback: Expanded fail-count-MySQL=value++ to 17
Jan 19 10:22:19 tsparxdb02 attrd: [10188]: info: attrd_trigger_update: Sending flush op to all hosts for: fail-count-MySQL (17)
Jan 19 10:22:19 tsparxdb02 attrd: [10188]: info: attrd_perform_update: Sent update 126: fail-count-MySQL=17
Jan 19 10:22:19 tsparxdb02 attrd: [10188]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-MySQL (1295446939)
Jan 19 10:22:19 tsparxdb02 attrd: [10188]: info: attrd_perform_update: Sent update 128: last-failure-MySQL=1295446939
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_pe_invoke_callback: Invoking the PE: query=231, ref=pe_calc-dc-1295446939-173, seq=252, quorate=1
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: abort_transition_graph: te_update_diff:146 - Triggered transition abort (complete=1, tag=transient_attributes, id=tsparxdb02, magic=NA, cib=0.361.70) : Transient attribute: update
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: abort_transition_graph: te_update_diff:146 - Triggered transition abort (complete=1, tag=transient_attributes, id=tsparxdb02, magic=NA, cib=0.361.71) : Transient attribute: update
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_pe_invoke: Query 232: Requesting the current CIB: S_POLICY_ENGINE
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_pe_invoke: Query 233: Requesting the current CIB: S_POLICY_ENGINE
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: unpack_config: On loss of CCM Quorum: Ignore
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: determine_online_status: Node tsparxdb02 is online
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: WARN: unpack_rsc_op: Processing failed op MySQL_monitor_60000 on tsparxdb02: unknown error (1)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: determine_online_status: Node tsparxdb01 is online
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: clone_print:  Master/Slave Set: DataMaster
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: short_print:      Masters: [ tsparxdb02 ]
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: short_print:      Slaves: [ tsparxdb01 ]
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: group_print:  Resource Group: Server-MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: native_print:      DataFS	(ocf::heartbeat:Filesystem):	Started tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: native_print:      MySQL	(lsb:mysql):	Started tsparxdb02 FAILED
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: native_print:      ClusterIP	(ocf::heartbeat:IPaddr2):	Started tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: get_failcount: MySQL has failed 16 times on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: common_apply_stickiness: MySQL can fail 999984 more times on tsparxdb02 before being forced off
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: get_failcount: ClusterIP has failed 2 times on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: common_apply_stickiness: ClusterIP can fail 999998 more times on tsparxdb02 before being forced off
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from DataFS
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from DataFS
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: Promoting Data:0 (Master tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: DataMaster: Promoted 1 instances of a possible 1 to master
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: Promoting Data:0 (Master tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: DataMaster: Promoted 1 instances of a possible 1 to master
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataFS: Rolling back scores from MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring monitor (60s) for MySQL on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring start (20s) for MySQL on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring stop (20s) for MySQL on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring monitor (30s) for ClusterIP on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource Data:0	(Master tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource Data:1	(Slave tsparxdb01)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource DataFS	(Started tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Recover resource MySQL	(Started tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Restart resource ClusterIP	(Started tsparxdb02)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: handle_response: pe_calc calculation pe_calc-dc-1295446939-173 is obsolete
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_pe_invoke_callback: Invoking the PE: query=233, ref=pe_calc-dc-1295446939-174, seq=252, quorate=1
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: process_pe_message: Transition 30: PEngine Input stored in: /var/lib/pengine/pe-input-1835.bz2
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: unpack_config: On loss of CCM Quorum: Ignore
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: determine_online_status: Node tsparxdb02 is online
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: WARN: unpack_rsc_op: Processing failed op MySQL_monitor_60000 on tsparxdb02: unknown error (1)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: determine_online_status: Node tsparxdb01 is online
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: clone_print:  Master/Slave Set: DataMaster
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: short_print:      Masters: [ tsparxdb02 ]
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: short_print:      Slaves: [ tsparxdb01 ]
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: group_print:  Resource Group: Server-MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: native_print:      DataFS	(ocf::heartbeat:Filesystem):	Started tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: native_print:      MySQL	(lsb:mysql):	Started tsparxdb02 FAILED
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: native_print:      ClusterIP	(ocf::heartbeat:IPaddr2):	Started tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: get_failcount: MySQL has failed 17 times on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: common_apply_stickiness: MySQL can fail 999983 more times on tsparxdb02 before being forced off
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: get_failcount: ClusterIP has failed 2 times on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: common_apply_stickiness: ClusterIP can fail 999998 more times on tsparxdb02 before being forced off
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from DataFS
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from DataFS
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: Promoting Data:0 (Master tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: DataMaster: Promoted 1 instances of a possible 1 to master
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: Promoting Data:0 (Master tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: master_color: DataMaster: Promoted 1 instances of a possible 1 to master
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataFS: Rolling back scores from MySQL
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring monitor (60s) for MySQL on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring start (20s) for MySQL on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring stop (20s) for MySQL on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring monitor (30s) for ClusterIP on tsparxdb02
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource Data:0	(Master tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource Data:1	(Slave tsparxdb01)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource DataFS	(Started tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Recover resource MySQL	(Started tsparxdb02)
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: notice: LogActions: Restart resource ClusterIP	(Started tsparxdb02)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: unpack_graph: Unpacked transition 31: 13 actions in 13 synapses
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_te_invoke: Processing graph 31 (ref=pe_calc-dc-1295446939-174) derived from /var/lib/pengine/pe-input-1836.bz2
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_pseudo_action: Pseudo action 49 fired and confirmed
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 44: stop ClusterIP_stop_0 on tsparxdb02 (local)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=44:31:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=ClusterIP_stop_0 )
Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: rsc:ClusterIP:117: stop
Jan 19 10:22:19 tsparxdb02 pengine: [10189]: info: process_pe_message: Transition 31: PEngine Input stored in: /var/lib/pengine/pe-input-1836.bz2
Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: RA output: (ClusterIP:stop:stderr) logger: unknown facility name: none.

Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: RA output: (ClusterIP:stop:stderr) logger: unknown facility name: none.

Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation ClusterIP_stop_0 (call=117, rc=0, cib-update=234, confirmed=true) ok
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: match_graph_event: Action ClusterIP_stop_0 (44) confirmed on tsparxdb02 (rc=0)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 1: stop MySQL_stop_0 on tsparxdb02 (local)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=1:31:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=MySQL_stop_0 )
Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: rsc:MySQL:118: stop
Jan 19 10:22:19 tsparxdb02 lrmd: [25367]: WARN: For LSB init script, no additional parameters are needed.
Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:stop:stdout)  * MySQL manager or server PID file could not be found!

Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation MySQL_stop_0 (call=118, rc=0, cib-update=235, confirmed=true) ok
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: match_graph_event: Action MySQL_stop_0 (1) confirmed on tsparxdb02 (rc=0)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_pseudo_action: Pseudo action 50 fired and confirmed
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_pseudo_action: Pseudo action 3 fired and confirmed
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_pseudo_action: Pseudo action 47 fired and confirmed
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 40: start MySQL_start_0 on tsparxdb02 (local)
Jan 19 10:22:19 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=40:31:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=MySQL_start_0 )
Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: rsc:MySQL:119: start
Jan 19 10:22:19 tsparxdb02 lrmd: [25385]: WARN: For LSB init script, no additional parameters are needed.
Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:start:stdout) Starting MySQL

Jan 19 10:22:19 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:start:stdout) .
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation ClusterIP_monitor_30000 (call=73, rc=7, cib-update=236, confirmed=false) not running
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: process_graph_event: Detected action ClusterIP_monitor_30000 from a different transition: 18 vs. 31
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: abort_transition_graph: process_graph_event:462 - Triggered transition abort (complete=0, tag=lrm_rsc_op, id=ClusterIP_monitor_30000, magic=0:7;1:18:0:94d8bb73-f8db-45dc-851a-8c00d57b3867, cib=0.361.74) : Old event
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: update_abort_priority: Abort action done superceeded by restart
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: WARN: update_failcount: Updating failcount for ClusterIP on tsparxdb02 after failed monitor: rc=7 (update=value++, time=1295446940)
Jan 19 10:22:20 tsparxdb02 attrd: [10188]: info: attrd_local_callback: Expanded fail-count-ClusterIP=value++ to 3
Jan 19 10:22:20 tsparxdb02 attrd: [10188]: info: attrd_trigger_update: Sending flush op to all hosts for: fail-count-ClusterIP (3)
Jan 19 10:22:20 tsparxdb02 attrd: [10188]: info: attrd_perform_update: Sent update 130: fail-count-ClusterIP=3
Jan 19 10:22:20 tsparxdb02 attrd: [10188]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-ClusterIP (1295446940)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: abort_transition_graph: te_update_diff:146 - Triggered transition abort (complete=0, tag=transient_attributes, id=tsparxdb02, magic=NA, cib=0.361.75) : Transient attribute: update
Jan 19 10:22:20 tsparxdb02 attrd: [10188]: info: attrd_perform_update: Sent update 132: last-failure-ClusterIP=1295446940
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: abort_transition_graph: te_update_diff:146 - Triggered transition abort (complete=0, tag=transient_attributes, id=tsparxdb02, magic=NA, cib=0.361.76) : Transient attribute: update
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:start:stdout)  * 

Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation MySQL_start_0 (call=119, rc=0, cib-update=237, confirmed=true) ok
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: match_graph_event: Action MySQL_start_0 (40) confirmed on tsparxdb02 (rc=0)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: run_graph: ====================================================
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: notice: run_graph: Transition 31 (Complete=7, Pending=0, Fired=0, Skipped=6, Incomplete=0, Source=/var/lib/pengine/pe-input-1836.bz2): Stopped
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: te_graph_trigger: Transition 31 is now complete
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=notify_crmd ]
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_pe_invoke: Query 238: Requesting the current CIB: S_POLICY_ENGINE
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_pe_invoke_callback: Invoking the PE: query=238, ref=pe_calc-dc-1295446940-178, seq=252, quorate=1
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: unpack_config: On loss of CCM Quorum: Ignore
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: determine_online_status: Node tsparxdb02 is online
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: WARN: unpack_rsc_op: Processing failed op MySQL_monitor_60000 on tsparxdb02: unknown error (1)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: WARN: unpack_rsc_op: Processing failed op ClusterIP_monitor_30000 on tsparxdb02: not running (7)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: determine_online_status: Node tsparxdb01 is online
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: clone_print:  Master/Slave Set: DataMaster
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: short_print:      Masters: [ tsparxdb02 ]
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: short_print:      Slaves: [ tsparxdb01 ]
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: group_print:  Resource Group: Server-MySQL
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: native_print:      DataFS	(ocf::heartbeat:Filesystem):	Started tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: native_print:      MySQL	(lsb:mysql):	Started tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: native_print:      ClusterIP	(ocf::heartbeat:IPaddr2):	Stopped 
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: get_failcount: MySQL has failed 17 times on tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: common_apply_stickiness: MySQL can fail 999983 more times on tsparxdb02 before being forced off
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: get_failcount: ClusterIP has failed 3 times on tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: common_apply_stickiness: ClusterIP can fail 999997 more times on tsparxdb02 before being forced off
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from DataFS
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from MySQL
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from DataFS
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataMaster: Rolling back scores from MySQL
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: master_color: Promoting Data:0 (Master tsparxdb02)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: master_color: DataMaster: Promoted 1 instances of a possible 1 to master
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: master_color: Promoting Data:0 (Master tsparxdb02)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: master_color: DataMaster: Promoted 1 instances of a possible 1 to master
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: native_merge_weights: DataFS: Rolling back scores from MySQL
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring monitor (60s) for MySQL on tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring start (20s) for MySQL on tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring stop (20s) for MySQL on tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: RecurringOp:  Start recurring monitor (30s) for ClusterIP on tsparxdb02
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource Data:0	(Master tsparxdb02)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource Data:1	(Slave tsparxdb01)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource DataFS	(Started tsparxdb02)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: LogActions: Leave resource MySQL	(Started tsparxdb02)
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: notice: LogActions: Start ClusterIP	(tsparxdb02)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: unpack_graph: Unpacked transition 32: 7 actions in 7 synapses
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_te_invoke: Processing graph 32 (ref=pe_calc-dc-1295446940-178) derived from /var/lib/pengine/pe-input-1837.bz2
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: te_pseudo_action: Pseudo action 46 fired and confirmed
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 41: monitor MySQL_monitor_60000 on tsparxdb02 (local)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=41:32:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=MySQL_monitor_60000 )
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: perform_op:2873: operation monitor[76] on lsb::mysql::MySQL for client 10190, its parameters: CRM_meta_name=[monitor] crm_feature_set=[3.0.1] CRM_meta_timeout=[120000] CRM_meta_interval=[60000]  for rsc is already running.
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: perform_op:2883: postponing all ops on resource MySQL by 1000 ms
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 42: start MySQL_start_20000 on tsparxdb02 (local)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: ERROR: construct_op: Start and Stop actions cannot have an interval: 20000
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=42:32:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=MySQL_start_0 )
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: perform_op:2873: operation monitor[76] on lsb::mysql::MySQL for client 10190, its parameters: CRM_meta_name=[monitor] crm_feature_set=[3.0.1] CRM_meta_timeout=[120000] CRM_meta_interval=[60000]  for rsc is already running.
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: perform_op:2877: operations on resource MySQL already delayed
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 43: stop MySQL_stop_20000 on tsparxdb02 (local)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: ERROR: construct_op: Start and Stop actions cannot have an interval: 20000
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: cancel_op: operation monitor[120] on lsb::mysql::MySQL for client 10190, its parameters: CRM_meta_name=[monitor] crm_feature_set=[3.0.1] CRM_meta_timeout=[120000] CRM_meta_interval=[60000]  cancelled
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=43:32:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=MySQL_stop_0 )
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: perform_op:2873: operation monitor[76] on lsb::mysql::MySQL for client 10190, its parameters: CRM_meta_name=[monitor] crm_feature_set=[3.0.1] CRM_meta_timeout=[120000] CRM_meta_interval=[60000]  for rsc is already running.
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: perform_op:2877: operations on resource MySQL already delayed
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: te_rsc_command: Initiating action 44: start ClusterIP_start_0 on tsparxdb02 (local)
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: do_lrm_rsc_op: Performing key=44:32:0:94d8bb73-f8db-45dc-851a-8c00d57b3867 op=ClusterIP_start_0 )
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: rsc:ClusterIP:123: start
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation MySQL_monitor_60000 (call=120, status=1, cib-update=0, confirmed=true) Cancelled
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: rsc:MySQL:121: start
Jan 19 10:22:20 tsparxdb02 lrmd: [25603]: WARN: For LSB init script, no additional parameters are needed.
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation MySQL_monitor_60000 (call=76, rc=0, cib-update=239, confirmed=false) ok
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: process_graph_event: Detected action MySQL_monitor_60000 from a different transition: 20 vs. 32
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: abort_transition_graph: process_graph_event:462 - Triggered transition abort (complete=0, tag=lrm_rsc_op, id=MySQL_monitor_60000, magic=0:0;36:20:0:94d8bb73-f8db-45dc-851a-8c00d57b3867, cib=0.361.78) : Old event
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jan 19 10:22:20 tsparxdb02 crmd: [10190]: info: update_abort_priority: Abort action done superceeded by restart
Jan 19 10:22:20 tsparxdb02 pengine: [10189]: info: process_pe_message: Transition 32: PEngine Input stored in: /var/lib/pengine/pe-input-1837.bz2
Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: RA output: (ClusterIP:start:stderr) logger: unknown facility name: none.

Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: RA output: (ClusterIP:start:stderr) logger: unknown facility name: none.

Jan 19 10:22:20 tsparxdb02 lrmd: [10187]: info: RA output: (ClusterIP:start:stderr) logger: unknown facility name: none.

Jan 19 10:22:21 tsparxdb02 lrmd: [10187]: info: rsc:MySQL:122: stop
Jan 19 10:22:21 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation ClusterIP_start_0 (call=123, rc=0, cib-update=240, confirmed=true) ok
Jan 19 10:22:21 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:start:stdout) Starting MySQL
 * 

Jan 19 10:22:21 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation MySQL_start_0 (call=121, rc=0, cib-update=241, confirmed=true) ok
Jan 19 10:22:21 tsparxdb02 crmd: [10190]: info: match_graph_event: Action ClusterIP_start_0 (44) confirmed on tsparxdb02 (rc=0)
Jan 19 10:22:21 tsparxdb02 lrmd: [25646]: WARN: For LSB init script, no additional parameters are needed.
Jan 19 10:22:21 tsparxdb02 crmd: [10190]: info: match_graph_event: Action MySQL_start_0 (42) confirmed on tsparxdb02 (rc=0)
Jan 19 10:22:21 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:stop:stdout) Shutting down MySQL

Jan 19 10:22:21 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:stop:stdout) .
Jan 19 10:22:21 tsparxdb02 lrmd: [10187]: info: perform_op:2873: operation stop[122] on lsb::mysql::MySQL for client 10190, its parameters: CRM_meta_name=[stop] crm_feature_set=[3.0.1] CRM_meta_timeout=[120000] CRM_meta_interval=[20000]  for rsc is already running.
Jan 19 10:22:21 tsparxdb02 lrmd: [10187]: info: perform_op:2883: postponing all ops on resource MySQL by 1000 ms
Jan 19 10:22:22 tsparxdb02 lrmd: [10187]: info: RA output: (MySQL:stop:stdout)  * 

Jan 19 10:22:22 tsparxdb02 crmd: [10190]: info: process_lrm_event: LRM operation MySQL_stop_0 (call=122, rc=0, cib-update=242, confirmed=true) ok
Jan 19 10:22:22 tsparxdb02 crmd: [10190]: info: match_graph_event: Action MySQL_stop_0 (43) confirmed on tsparxdb02 (rc=0)
_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

Reply via email to