On 2010-04-14 19:42, Mike Christie wrote:
On 04/14/2010 07:02 AM, Christian Iversen wrote:
What I'd like is the following:

- Never give up trying (or at least try for a month :)

The iscsi initiator almost always tries to reconnect to the target. If
it gets a successful login then that fails it will try to relogin until
the the user runs some iscsiadm command to logout.

If you mean you want it to hold onto IO and not fail it, then you want
the replacement_timeout/recovery_timeout. There should be info in the
README and iscsid.conf about this. If it is not clear let me know.

There's info about replacement_timeout, but no recovery_timeout. Maybe
only the former is a valid name?

replacement_timeout is the name of the setting in iscsid.conf, but for
some dumb reason I named it recovery_timeout in the kernel.

Ah, ok. I'll go with replacement_timeout then :)

If in the iscsid.conf you see this for
node.session.timeo.replacement_timeout then this is what I think you are
asking for (that is if you are saying you do not want IO failed) and you
want to set the value to 0.
# - If the value is 0, IO will be failed immediately.
# - If the value is less than 0, IO will remain queued until the session
# is logged back in, or until the user runs the logout command.

I'm a little unsure about the semantics for "failed io". What I want is
the iscsi client to see all IO as working, or hanging indefinitely if
the server cannot be contacted.


Then set the replacement_timeout to -1.

Ok.

What about these timeouts?

node.session.err_timeo.abort_timeout = x
node.session.err_timeo.lu_reset_timeout = y
node.session.err_timeo.host_reset_timeout = z

What are reasonable values for x, y and z, and when are they used?

If there is a low-level error, I'd like iscsi to detect this quickly and
reconnect right away. (this will happen when there's a failover). Will
the following settings work for this purpose:

node.conn[0].timeo.noop_out_interval = 2
node.conn[0].timeo.noop_out_timeout = 2
node.session.timeo.replacement_timeout = 86400

Yes.

I'll use this then.

Per my understanding: This will ping the server every 2. seconds, and
wait 2 seconds for a reply. If a connection problem is discovered, the
client will try for 24 hours (86400 seconds) to reestablish a connection
before giving up and returning IO errors to higher layers.

Is this correct? From your description it seems like replacement_timeout

Yes.

= 0 would cause immediate IO errors in case of connection problems? Or
did I misunderstand?


Yeah, on newer versions 0 causes the IO to be failed immediately. I
wrote that wrong before.

Was it different on old versions? In any case, I'll use a value of 86400 for that timeout :)

--
Med venlig hilsen / Best regards
Christian Iversen

Sikkerhed.org ApS
Fuglebakkevej 88                       E-mail:  [email protected]
1. sal                                 Web:     www.sikkerhed.org
DK-2000 Frederiksberg                  Direkte: [email protected]

--
You received this message because you are subscribed to the Google Groups 
"open-iscsi" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/open-iscsi?hl=en.

Reply via email to