Hi Dejan,
On 20/12/2007, at 7:50 PM, Dejan Muhamedagic wrote:
Hi,
On Thu, Dec 20, 2007 at 05:48:10PM +0900, Trent Lloyd wrote:
Hi All,
I have recently setup a 2-node iSCSI fail-over array backed onto
shared SAS MD3000 storage.
How is this thing connected: is it iSCSI or SAS?
Sorry that wasn't clear - to the nodes running heartbeat they are
connected via SAS - they then serve them up via iSCSI.
I have everything (including RDAC) working fine on my Debian Etch
nodes - however I am curious if it is possible to get heartbeat to
demote itself if it loses access to the disks - I am not sure if I am
missing something but it seems if the disks start failing on a node
there's no mechanism to cause it to failover.
The kernel should take care of that. If the computer hangs or
crashes, there won't be heartbeat and, after a successful fencing
operation (you do have a stonith device, right?), a failover will
occur. You can also configure a watchdog. Or did I misunderstand
your question?
I would expect that if a single disk array disappears - the machine
shouldn't hang - only processes that were depending on those would
hang. The same disk array does not contain the root array or anything
like that - only the data partition.
Is there anything to do this currently?I can't see anything. I
figure
it would be possible to write a plugin to monitor the dm-multipath
stuff - is this a reasonable approach?
It's been a long time since I used that. How can one monitor
dm-multipath? Isn't it fault tolerant?
It is, but I'm talking in a situation where for some reason both paths
are lost. I know this seems kinda paranoid but it just seemed like a
reasonable thing to do to me.
Example output:
filer2:~# multipath -ll
mpath0 (360019b9000b6b68e00001c2a46e8e656) dm-0 DELL ,MD3000
[size=1.9T][features=0][hwhandler=1 rdac]
\_ round-robin 0 [prio=3][enabled]
\_ 2:0:0:0 sdd 8:48 [active][ready]
\_ round-robin 0 [prio=0][enabled]
\_ 1:0:0:0 sdb 8:16 [active][ghost]
So we could parse or write some API that makes the same call this make
to make sure that the mpath0 has at least 1 active working path.
Regards,
Trent
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems