Re: [zfs-discuss] Making a zvol unavailable to iSCSI trips up ZFS

Maurice Volaski Mon, 19 Jul 2010 10:39:17 -0700

This is now CR 6970210.

I've been experimenting with a two system setup in snv_134 whereeach system exports a zvol via COMSTAR iSCSI. One system importsboth its own zvol and the one from the other system and puts themtogether in a ZFS mirror.
I manually faulted the zvol on one system by physically removingsome drives. What I expect to happen is that ZFS will fault the zvolpool and the iSCSI stack will detect this and fault the target. ThenZFS for the mirrored pool will detect a failed device and report it.Throughout all this the system should operate normally, perhaps willsmall delays as it waits on failed devices.
That isn't what happens.
The removed drives were detected and the zvol zpool was faulted.This eventually resulted in iSCSI "device is busy too long" errors,and that sounds about right so far.
But the top-level mirror, which is acting as an NFS share, suddenlyvanished from its NFS client! That is, the failure of a zvol tied toiSCSI seems to poison other parts of the OS causing the NFS to fail.Isn't that odd?
At the same time, zpool status on the mirrored pool detected nothingwrong. Eventually, it did detect errors on the failed device in themirror, but oddly it didn't offline it as the logs claimed it would.Instead, it seems that I/O stopped altogether. Also, it appears thatthe iSCSI timeout errors are taking way longer than what I have themset for and even after they have timed out, ZFS is ignoring that andstill keeps trying.
Somehow, I eventually got the pool to unmount and export, but when Itried to import it, the same thing is happening. First, the iSCSIerrors seem to be ignoring the parameters to timeout and are insteadtaking an arbitrarily long time, even longer than the defaults.Second, ZFS won't give up on trying to import the pool even thoughiSCSI is reporting to it that a device has failed. That is, ZFS getshung when trying to import pools that contain a failed device. Thepool is set to continue on failure, however. And technically, withjust one device in the mirror failed, it really isn't failed, justdegraded.
These are my iSCSI parameters:
recv-login-rsp-timeout=6
conn-login-max=3
polling-login-delay=2


--

Maurice Volaski, maurice.vola...@einstein.yu.edu
Computing Support, Rose F. Kennedy Center
Albert Einstein College of Medicine of Yeshiva University
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Making a zvol unavailable to iSCSI trips up ZFS

Reply via email to