Re: error (1011) state (3)

Mike Christie Tue, 01 Mar 2011 18:53:38 -0800

On 03/01/2011 11:54 AM, oxide23 wrote:

I am connecting 2 iscsi arrays to the same network port on a linux box
over the network. One device works fine. The other gets the dreaded
conn(1011) errors.

What version of open-iscsi are you using? What is the kernel and are youusing the kernel modules with the kernel?



Is it always the same device?


eth1      Link encap:Ethernet  HWaddr 00:1D:09:2F:15:B2
           inet addr:128.128.101.92  Bcast:128.128.103.255  Mask:
255.255.252.0

iscsiadm -m iface
default tcp,<empty>,<empty>,<empty>,<empty>
iser iser,<empty>,<empty>,<empty>,<empty>
bnx2i.00:00:00:00:00:00 bnx2i,
00:00:00:00:00:00,<empty>,<empty>,<empty>
bnx2i.00:1d:09:2f:15:b1 bnx2i,00:1d:09:2f:
15:b1,<empty>,<empty>,<empty>
bnx2i.00:1d:09:2f:15:b3 bnx2i,00:1d:09:2f:
15:b3,<empty>,<empty>,<empty>
iface1 tcp,00:1D:09:2F:15:B2,<empty>,<empty>,<empty>

iscsiadm -m node
128.128.181.69:3260,1 iqn.1994-12.com.promise.dd.64.55.55.1.0.0.20
128.128.181.99:3260,1 iqn.2010.09.maxtronic.com:732af838098e3a24:ist1
128.128.181.51:3260,1 iqn.2010.09.maxtronic.com:732af838098e3a24:ist0


What is the output of "iscsiadm -m node -P 1"?

do I need to create an iface for each? is each considered a "network
object"?  or can they share the one I created:


They can share one.


more /var/lib/iscsi/ifaces/iface1
iface.transport_name = tcp
iface.hwaddress = 00:1D:09:2F:15:B2

could this be the source of my conn(1011) disconnect errors in /var/
log/messages?

Normally, if the iface binding is not working then we would fail to evenconnect, so you would not see the login successful message and you wouldnot get a target or device.

If your network configuration is changing while iscsi is running, thenthat could cause a disruption and cause you get to get the 1011 errors.


Has anyone out there been able to get an "Arena SS-6603R" to work with
any iscsi version?

target discovery seems to work, and I can log in. However, the logs
fill up with this:

/var/log/messages.1:Feb 25 15:32:37 ndsfdh1 iscsid: Kernel reported
iSCSI connection 6:0 error (1011) state (3)
/var/log/messages.1:Feb 25 15:32:40 ndsfdh1 iscsid: connection6:0 is
operational after recovery (1 attempts)

I can then actually create a partition, create a file system on it and
then mount it. But after about 10-15 minutes, the filesystem becomes
unreadable and has to be unmounted/redone.

When you say it fills up with this messages do you mean it iscontinuously happening after login, and also occurs while creating thepartition?


Is there anything else in the logs? Could you attach it?

Do you get those errors even when you are not doing any IO. So beforeyou do your partition and FS creation test when the system is more orless idle, do you see those messages? If so check the target logs.


I've tried changing timeouts, however I don't know which timeouts are
the correct ones to change, by how much and if it will negatively
effect the working iscsi array. Are there some general troubleshooting
guidelines on which values are likely to have an effect and what
acceptable ranges are for operation? Do vendors of these iscsi devices
set their own acceptable ranges based on the RFC or is it not
standard?


Vendors will a lot of times recommend their preferred values.

If those are the only log messages you are seeing then the iscsitimeouts are not coming into play. If you see iscsi nop/ping errors thenyou would want to change the noop values. There are some generalguidlines for setting those and the scsi cmd timeout in the currentiscsi readme in here:

http://kernel.org/pub/linux/kernel/people/mnc/open-iscsi/releases/open-iscsi-2.0-872.tar.gz

If you are seeing only those messages you mentioned when you do thepartition and mkfs test then the scsi commands could be timing out. Youcould turn on iscsi debugging


echo 1 > /sys/module/libiscsi/paramters/debug_libiscsi_eh

then run your fdisk/mkfs test and then send the logs. If a scsi commandis timeout you would want to increase that timer(/sys/block/sdX/device/timeout - there is info on it in the README).


iscsiadm -m node -T iqn.2010.09.maxtronic.com:732af838098e3a24:ist1 |
grep timeout :

node.session.timeo.replacement_timeout = 120
node.session.err_timeo.abort_timeout = 15
node.session.err_timeo.lu_reset_timeout = 20
node.session.err_timeo.host_reset_timeout = 60
node.conn[0].timeo.logout_timeout = 15
node.conn[0].timeo.login_timeout = 15
node.conn[0].timeo.auth_timeout = 45
node.conn[0].timeo.noop_out_timeout = 10


thank you for any suggestions


--
You received this message because you are subscribed to the Google Groups 
"open-iscsi" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/open-iscsi?hl=en.

Re: error (1011) state (3)

Reply via email to