Hello everyone,

I've got a 2-node cluster that exposes iSCSI targets backed by LVM volumes on 
top of a DRBD device.  For the most part I've got everything working as I'd 
like.  Manually moving resources works just fine, either using 'move' or by 
putting a node on standby.  Shutting down the corosync service on one node is 
another story though.  I have an order constraint in place to make iscsi-scst 
shutdown before stopping the LVM volume group but in the logs I see this:

Mar 08 11:43:07 iscsitest2 crmd: [20755]: info: process_lrm_event: LRM 
operation clusterip_stop_0 (call=72, rc=0, cib-update=91, confirmed=true) ok
Mar 08 11:43:07 iscsitest2 crmd: [20755]: info: do_lrm_rsc_op: Performing 
key=88:89:0:1effe13b-3093-4bf7-ae29-f764aaf22933 op=iscsi_target_stop_0 )
Mar 08 11:43:07 iscsitest2 lrmd: [20752]: info: rsc:iscsi_target:73: stop
Mar 08 11:43:07 iscsitest2 lrmd: [24805]: WARN: For LSB init script, no 
additional parameters are needed.
Mar 08 11:43:07 iscsitest2 lrmd: [20752]: info: RA output: 
(iscsi_target:stop:stdout) Stopping iSCSI-SCST target service:
Mar 08 11:43:07 iscsitest2 lrmd: [20752]: info: RA output: 
(iscsi_target:stop:stdout) succeeded.
Mar 08 11:43:07 iscsitest2 lrmd: [20752]: info: RA output: 
(iscsi_target:stop:stdout) Removing iSCSI-SCST target modules:
Mar 08 11:43:07 iscsitest2 crmd: [20755]: info: do_lrm_rsc_op: Performing 
key=51:89:0:1effe13b-3093-4bf7-ae29-f764aaf22933 op=drbd_lvm_stor:1_demote_0 )
Mar 08 11:43:07 iscsitest2 lrmd: [20752]: info: rsc:drbd_lvm_stor:1:74: demote
Mar 08 11:43:07 iscsitest2 lrmd: [20752]: info: RA output: 
(drbd_lvm_stor:1:demote:stderr) 1: State change failed: (-12) Device is held 
open by someone

The first line is fine, the iSCSI target IP should be shutdown first, then the 
target service and its modules are stopped/unloaded.  Next though I see 
corosync trying to demote the DRBD device that sits 'under' the LVM volume 
group, BEFORE it shuts down LVM... why are these things being done out of 
order?  Based on my constraint corosync should:

1.      Shutdown iscsi IP
2.      Shutdown iscsi-scst
3.      Shutdown LVM
4.      Demote drbd device

The order constraint in my configuration looks like this:

order san_startup inf: ms_drbd_lvm_stor:promote lvm_vg0 iscsi_target clusterip

Lastly, I see, near the end of the log:

Mar 08 11:43:11 iscsitest2 lrmd: [20752]: info: RA output: 
(drbd_lvm_stor:1:demote:stdout)

Which, to me, looks like an incomplete line, followed by a number of attrd, 
crmd, stonithd & cib  ERROR messages indicating that the connection to the 
OpenAIS service has been lost.   I suppose this means that corosync doesn't 
wait for proper resource migration before it shuts down which seems very 
strange to me.  Unless I'm missing something here, has anyone else run into 
anything like this?

Beau


_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to