Hi,

No, I think you've missed the point. RA did not answer at all. Monitor actions had been lost due to a cluster transition:

Mar 1 11:16:00 wapgw1-log crmd: [24547]: info: do_lrm_rsc_op: Performing key=33:1353:7:22dc5497-478f-49ff-b07f-9fcd6da325cd op=p-drbd-mdirect1-1:0_monitor_0 ) Mar 1 11:16:00 wapgw1-log crmd: [24547]: info: do_lrm_rsc_op: Discarding attempt to perform action monitor on p-drbd-mdirect1-1:0 in state S_ELECTION Mar 1 11:16:00 wapgw1-log crmd: [24547]: info: send_direct_ack: ACK'ing resource op p-drbd-mdirect1-1:0_monitor_0 from 33:1353:7:22dc5497-478f-49ff-b07f-9fcd6da325cd: lrm_invoke-lrmd-1298967360-58 Mar 1 11:16:00 wapgw1-log crmd: [24547]: info: process_te_message: Processing (N)ACK lrm_invoke-lrmd-1298967360-58 from wapgw1-log Mar 1 11:16:00 wapgw1-log crmd: [24547]: info: process_graph_event: Action p-drbd-mdirect1-1:0_monitor_0/33 (4:99;33:1353:7:22dc5497-478f-49ff-b07f-9fcd6da325cd) initiated by a different transitioner Mar 1 11:16:00 wapgw1-log crmd: [24547]: info: abort_transition_graph: process_graph_event:456 - Triggered transition abort (complete=1, tag=lrm_rsc_op, id=p-drbd-mdirect1-1:0_monitor_0, magic=4:99;33:1353:7:22dc5497-478f-49ff-b07f-9fcd6da325cd) : Foreign event

So, RA had not have a chance to answer anything.

Apart from this, should I fake all RA's which are supposed to be unused on the particular nodes in the cluster? It seemes to me like a partial solution only.

Suppose that I want to use Virtual machine "X" on hardware nodes A and B, and VM "Y" on nodes B and C. Using DRBD, this is very common configuration, because "X" cannot access it's disk device on hardware node "C". Currently, I must configure "X" and "Y" on every hardware node, or RA will fail with status "not configured". It's not minimalistic configuration, so it is more error prone than needed.

I would be happy to tell the cluster never to touch resource "X" on node C in this case. What do you think?


10.03.2011 14:09, Andrew Beekhof wrote:
Your basic problem is this...

Mar  1 11:17:21 wapgw1-2 pengine: [5748]: WARN: unpack_rsc_op:
Processing failed op vm-mproxy1-1_monitor_0 on wapgw1-log: unknown
error (1)

We asked what state the resource was in and it replied "arrrggghhhh"
instead of "not installed".
Had it replied with not installed, we'd have no reason to call stop or
fence the node to try and clean it up.


--
Pavel Levshin //flicker


_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

Reply via email to