We are having trouble replacing SBD devices with a live cluster running on SLES 12 with
pacemaker 1.1.12-7.1 sbd 1.2.1-8.7 corosync 2.3.3-7.12 We are sometimes seeing these errors. We are not sure which is "device 4" and why it things the header is bad. This sometimes works OK and seems transient. Aug 03 11:04:38 usrv-tsegp1 sbd[2987]: [2987]: info: Watchdog enabled. Aug 03 11:04:38 usrv-tsegp1 sbd[2987]: [2987]: ERROR: Header magic does not match. Aug 03 11:04:38 usrv-tsegp1 sbd[2987]: [2987]: ERROR: header on device 4 is not valid. We also see dependency errors in pacemaker where it says corosync fails to start. A reboot seems to fix it. Our SBD config is SBD_DEVICE="/dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e;/dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c;/dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd" SBD_WATCHDOG="yes" A dump of the devices is: + sbd -d /dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e dump ==Dumping header on disk /dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e Header version : 2.1 UUID : f74bf73f-a6b6-4541-b192-e36dacbae3d6 Number of slots : 255 Sector size : 512 Timeout (watchdog) : 5 Timeout (allocate) : 2 Timeout (loop) : 1 Timeout (msgwait) : 10 ==Header on disk /dev/disk/by-id/scsi-14945540000000000633a8f8ff9e6d4a36268317b4175f84e is dumped + sbd -d /dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c dump ==Dumping header on disk /dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c Header version : 2.1 UUID : 52b1539f-f18e-4266-8065-26ab49c472c0 Number of slots : 255 Sector size : 512 Timeout (watchdog) : 5 Timeout (allocate) : 2 Timeout (loop) : 1 Timeout (msgwait) : 10 ==Header on disk /dev/disk/by-id/scsi-1494554000000000065418950b11404a6b7e8a6ba6a82a05c is dumped + sbd -d /dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd dump ==Dumping header on disk /dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd Header version : 2.1 UUID : ad21ba2f-f09e-49c1-a48b-119a032dc9b2 Number of slots : 255 Sector size : 512 Timeout (watchdog) : 5 Timeout (allocate) : 2 Timeout (loop) : 1 Timeout (msgwait) : 10 ==Header on disk /dev/disk/by-id/scsi-149455400000000009ad2f567db1a0c3280b1efc6d7feb0dd is dumped We are sharing some of our SBD devices with both SLES11 and SLES12 clusters. Is this allowed? Thanks for any help that can be shared, Diane Schaefer
_______________________________________________ Users mailing list: [email protected] http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
