Persistent connections across initiator reboot

2009-04-20 Thread HIMANSHU

I discovered 2 machine,192.168.7.50(containing 10 targets,out of which
performed login on 2) and  192.168.7.51(containing 5 targets,out of
which performed login on 3).I am using IET target and 'iscsi-initiator-
utils-6.2.0.868-0.7.el5' initiator.

So at present there are 5 connections on initiator.suddenly if
initiator reboots.

we are having following code in 'iscsi' service
iscsiadm -m node --logoutall=all(stop function)
iscsiadm -m node --loginall=automatic(start function)

But disadvantage is that it will login to all 15 targets from both
7.50 and 7.51 machines,and not 5 which already were there.This is
because previous discovery entry is still there in /var/lib/iscsi/
nodes and /var/lib/iscsi/send_targets.

If I commented above 2 lines from 'iscsi' service,It works fine for
stop,start,restart service operations.i.e.initiator is restoring only
connections which were there previously.

But this doesn't work across reboot.I lost my 5 connections after
reboot though above 2 lines were commented.

Thus,what should I do for persistent connections across reboot?
--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups 
open-iscsi group.
To post to this group, send email to open-iscsi@googlegroups.com
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/open-iscsi
-~--~~~~--~~--~--~---



Re: Advice on device recovery

2009-04-20 Thread Ulrich Windl

On 17 Apr 2009 at 15:22, dave wrote:

 
 Long post, but please bear with me :)
 
 This post is related to my previous post at:
 http://groups.google.com/group/open-iscsi/browse_thread/thread/4569bc674383145a/e6c2e4320ec401f5?lnk=gstq=device+not+ready#e6c2e4320ec401f5
 
 My situation:
 
 I have linux initiators running open-iscsi 2.0-869 with dm-multipath,
 queue_if_no_path enabled. The target is an OpenSolaris box sharing
 zvols from a mirrored zpool, which means the target LUNs are virtual
 devices with storage backed by the ZFS zpool.
 
 The problem:
 
 When one of the disks in the solaris zpool dies, ZFS halts reads/
 writes to the zpool for a minute or two while it waits for the disk
 controller/driver to determine if the device should be offlined. The
 side effect of this is that because the iSCSI targets are virtual
 devices with their data store being the ZFS zpool, the iSCSI read/
 writes are also halted as long as ZFS is waiting for a device to fail.
 The iSCSI targets don't disappear, they are just unable to complete
 read/write ops - they still respond fine to logins and target
 discovery. Once ZFS continues operation, the iSCSI devices also resume
 normal operation. Since I am using multipath on the linux initiators,
 the linux boxes can wait patiently for read/writes to resume, but it
 seems that the scsi system does not retry TUR messages which can cause
 the device to never be put back into operation on the initiator node.

Hi!

I guess the TUR are queued, just as any other SCSI command. Thus the TURs will 
be 
processed after your virtual LUNs did recover.

This brings up an issue I was thinking about already: Does it make any sense 
for 
mutipath to run a path checker, wehen there was regular traffic 
(command/response) 
recently? Likewise, (I don't know if iopen-iscsi does that) does it make sense 
to 
send No-Ops to a target if the target had sent a reply recently?

I'm seeing quite busy switch ports, een if the iSCSI data traffic is expected 
to 
be very low. With 16 paths per LUN here, frequent probing by multipath and open-
iscsi would be one explanation.

[...]

Regards,
Ulrich


--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups 
open-iscsi group.
To post to this group, send email to open-iscsi@googlegroups.com
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/open-iscsi
-~--~~~~--~~--~--~---



Re: Persistent connections across initiator reboot

2009-04-20 Thread Ulrich Windl

On 20 Apr 2009 at 2:46, HIMANSHU wrote:

 
 I discovered 2 machine,192.168.7.50(containing 10 targets,out of which
 performed login on 2) and  192.168.7.51(containing 5 targets,out of
 which performed login on 3).I am using IET target and 'iscsi-initiator-
 utils-6.2.0.868-0.7.el5' initiator.
 
 So at present there are 5 connections on initiator.suddenly if
 initiator reboots.
 
 we are having following code in 'iscsi' service
 iscsiadm -m node --logoutall=all(stop function)
 iscsiadm -m node --loginall=automatic(start function)

Wouldn't your problem be solved if you set some of the automatic connections 
to 
manual?

 
 But disadvantage is that it will login to all 15 targets from both
 7.50 and 7.51 machines,and not 5 which already were there.This is
 because previous discovery entry is still there in /var/lib/iscsi/
 nodes and /var/lib/iscsi/send_targets.
[...]

Ulrich


--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups 
open-iscsi group.
To post to this group, send email to open-iscsi@googlegroups.com
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/open-iscsi
-~--~~~~--~~--~--~---



Re: Data Digest - Unit Attention Problem

2009-04-20 Thread Joer

A few more comments on this failure:
This may be a problem with how the check response is being processed
by the iSCSI stack...possibly due to data digest.

The Inquiry command goes out, the response is received with data
digest, and appears to be accepted.  A Test Unit Ready command then
goes out, response received, and 500 ms later the initiator attempts
to open a new connection/login.   The target then resets the original
TCP connection.  This is repeated several times

Is it possible that the check response with data digest is not being
handled correctly?

The /var/log/message contains a series of the following messages:

connection2:0:0 detected conn error (1011)


On Apr 17, 9:17 am, Joer jre...@yahoo.com wrote:
 I ran into an odd interaction between having Data Digest enabled for a
 session and the classic Unit Attention check condition for the first
 command issued during a session other than Inquiry.  My base setup is
 as follows:

 Open iSCSI Initiator: 2.0-869.2
 RHEL 5.2 32 bit running in an ESX VM

 My test has four target entities behind a dual portal -portal group.
 The targets are setup to negotiate digest as follow:
 Target1: no header digest / no data digest
 Target2: no header digest / data digest
 Target3: header digest / no data digest
 Target4: header digest / data digest

 The problem:
 Open iSCSI posts a connection error and resets the connection when the
 Unit Attention check condition is received on Targets 2 and 4
 following a Test Unit Ready command.  My first guess was that there
 was a problem with Data Digest generated by for the response PDU
 containing the check.  However, the Wireshark capture performed on the
 nitiator side is saying the digest is good.  I checked the response
 PDU format and padding and it looks correct.

 Has anyone seen anything like this before?
--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups 
open-iscsi group.
To post to this group, send email to open-iscsi@googlegroups.com
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/open-iscsi
-~--~~~~--~~--~--~---



Re: Persistent connections across initiator reboot

2009-04-20 Thread Mike Christie

HIMANSHU wrote:
 I discovered 2 machine,192.168.7.50(containing 10 targets,out of which
 performed login on 2) and  192.168.7.51(containing 5 targets,out of
 which performed login on 3).I am using IET target and 'iscsi-initiator-
 utils-6.2.0.868-0.7.el5' initiator.
 
 So at present there are 5 connections on initiator.suddenly if
 initiator reboots.
 
 we are having following code in 'iscsi' service
 iscsiadm -m node --logoutall=all(stop function)
 iscsiadm -m node --loginall=automatic(start function)
 
 But disadvantage is that it will login to all 15 targets from both
 7.50 and 7.51 machines,and not 5 which already were there.This is

Do you have only the target records you want to log into automatically 
marked as node.startup=automatic?


 because previous discovery entry is still there in /var/lib/iscsi/
 nodes and /var/lib/iscsi/send_targets.

I didn't follow you when you meantioned the previous entry. Are you 
running the iscsiadm discovery command at some point during startup?

 
 If I commented above 2 lines from 'iscsi' service,It works fine for
 stop,start,restart service operations.i.e.initiator is restoring only
 connections which were there previously.


Aare you running a custom script that logs into specific targets or how 
do you log into targets if the init script does not run loginall=automatic?

 
 But this doesn't work across reboot.I lost my 5 connections after
 reboot though above 2 lines were commented.
 
 Thus,what should I do for persistent connections across reboot?

I think you just want to set the targets you want to log into with the 
node.startup = automatic. When that loginall=automatic gets run, it will 
only log into those targets.

iscsiadm -m node -T target -p ip:port -o update -n node.startup -v startup


I think with IET you can set up lists to control how can log into which 
target. Does that work for you?

--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups 
open-iscsi group.
To post to this group, send email to open-iscsi@googlegroups.com
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/open-iscsi
-~--~~~~--~~--~--~---



Re: Data Digest - Unit Attention Problem

2009-04-20 Thread Mike Christie

Joer wrote:
 A few more comments on this failure:
 This may be a problem with how the check response is being processed
 by the iSCSI stack...possibly due to data digest.
 
 The Inquiry command goes out, the response is received with data
 digest, and appears to be accepted.  A Test Unit Ready command then
 goes out, response received, and 500 ms later the initiator attempts
 to open a new connection/login.   The target then resets the original
 TCP connection.  This is repeated several times


Is this easy to replicated? Does it happen every time?

Were you using the initiator that comes with RHEL 5.2 or 2.0-869.2 from 
open-iscsi.org?

 
 Is it possible that the check response with data digest is not being
 handled correctly?
 
 The /var/log/message contains a series of the following messages:
 
 connection2:0:0 detected conn error (1011)
 

Do you see any 1015 errors? 1015 is for data digest errors. 1011 is 
generic connection problems or cases where we dropped it for some 
generic reason.

Is there anything else in the log? Something about a ping or nop timing out?

--~--~-~--~~~---~--~~
You received this message because you are subscribed to the Google Groups 
open-iscsi group.
To post to this group, send email to open-iscsi@googlegroups.com
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/open-iscsi
-~--~~~~--~~--~--~---