On 02/26/12 06:39, David Dillow wrote: > On Sat, 2012-01-14 at 12:57 +0000, Bart Van Assche wrote: >> Add fast_io_fail_tmo and dev_loss_tmo sysfs attributes. Block >> the SCSI target as soon as a transport layer error has been >> detected (ping timeout, disconnect or IB error completion). Try >> to reconnect until dev_loss_tmo elapses. >> >> Disconnect the IB connection earlier in srp_remove_target() to >> make sure that error recovery is not triggered during host >> removal. Swap the "connected" and "removed" tests in >> srp_queuecommand() because of this change. >> >> Rescan LUNs after having unblocked a SCSI target controlled by >> ib_srp. > As I've previously said, this is better handled by user space and > you add additional failure cases when you try to retrofit a transport > ping into SRP.
Let's look at this from another point of view. If it's possible for the SRP protocol to implement most of the transport layer checking and error handling in user space then that's possible for iSCSI and FC too. Are you claiming that the device mapper layer and the multipath software should be redesigned such that certain parts of the transport checking and recovery mechanisms are moved from kernel space to user space, including a backwards-incompatible change of the kernel ABI (sysfs) the multipath software uses today ? Bart. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html
