On 03/01/2011 11:54 AM, oxide23 wrote:
I am connecting 2 iscsi arrays to the same network port on a linux box
over the network. One device works fine. The other gets the dreaded
conn(1011) errors.
What version of open-iscsi are you using? What is the kernel and are you
using the kernel modules with the kernel?
Is it always the same device?
eth1 Link encap:Ethernet HWaddr 00:1D:09:2F:15:B2
inet addr:128.128.101.92 Bcast:128.128.103.255 Mask:
255.255.252.0
iscsiadm -m iface
default tcp,<empty>,<empty>,<empty>,<empty>
iser iser,<empty>,<empty>,<empty>,<empty>
bnx2i.00:00:00:00:00:00 bnx2i,
00:00:00:00:00:00,<empty>,<empty>,<empty>
bnx2i.00:1d:09:2f:15:b1 bnx2i,00:1d:09:2f:
15:b1,<empty>,<empty>,<empty>
bnx2i.00:1d:09:2f:15:b3 bnx2i,00:1d:09:2f:
15:b3,<empty>,<empty>,<empty>
iface1 tcp,00:1D:09:2F:15:B2,<empty>,<empty>,<empty>
iscsiadm -m node
128.128.181.69:3260,1 iqn.1994-12.com.promise.dd.64.55.55.1.0.0.20
128.128.181.99:3260,1 iqn.2010.09.maxtronic.com:732af838098e3a24:ist1
128.128.181.51:3260,1 iqn.2010.09.maxtronic.com:732af838098e3a24:ist0
What is the output of "iscsiadm -m node -P 1"?
do I need to create an iface for each? is each considered a "network
object"? or can they share the one I created:
They can share one.
more /var/lib/iscsi/ifaces/iface1
iface.transport_name = tcp
iface.hwaddress = 00:1D:09:2F:15:B2
could this be the source of my conn(1011) disconnect errors in /var/
log/messages?
Normally, if the iface binding is not working then we would fail to even
connect, so you would not see the login successful message and you would
not get a target or device.
If your network configuration is changing while iscsi is running, then
that could cause a disruption and cause you get to get the 1011 errors.
Has anyone out there been able to get an "Arena SS-6603R" to work with
any iscsi version?
target discovery seems to work, and I can log in. However, the logs
fill up with this:
/var/log/messages.1:Feb 25 15:32:37 ndsfdh1 iscsid: Kernel reported
iSCSI connection 6:0 error (1011) state (3)
/var/log/messages.1:Feb 25 15:32:40 ndsfdh1 iscsid: connection6:0 is
operational after recovery (1 attempts)
I can then actually create a partition, create a file system on it and
then mount it. But after about 10-15 minutes, the filesystem becomes
unreadable and has to be unmounted/redone.
When you say it fills up with this messages do you mean it is
continuously happening after login, and also occurs while creating the
partition?
Is there anything else in the logs? Could you attach it?
Do you get those errors even when you are not doing any IO. So before
you do your partition and FS creation test when the system is more or
less idle, do you see those messages? If so check the target logs.
I've tried changing timeouts, however I don't know which timeouts are
the correct ones to change, by how much and if it will negatively
effect the working iscsi array. Are there some general troubleshooting
guidelines on which values are likely to have an effect and what
acceptable ranges are for operation? Do vendors of these iscsi devices
set their own acceptable ranges based on the RFC or is it not
standard?
Vendors will a lot of times recommend their preferred values.
If those are the only log messages you are seeing then the iscsi
timeouts are not coming into play. If you see iscsi nop/ping errors then
you would want to change the noop values. There are some general
guidlines for setting those and the scsi cmd timeout in the current
iscsi readme in here:
http://kernel.org/pub/linux/kernel/people/mnc/open-iscsi/releases/open-iscsi-2.0-872.tar.gz
If you are seeing only those messages you mentioned when you do the
partition and mkfs test then the scsi commands could be timing out. You
could turn on iscsi debugging
echo 1 > /sys/module/libiscsi/paramters/debug_libiscsi_eh
then run your fdisk/mkfs test and then send the logs. If a scsi command
is timeout you would want to increase that timer
(/sys/block/sdX/device/timeout - there is info on it in the README).
iscsiadm -m node -T iqn.2010.09.maxtronic.com:732af838098e3a24:ist1 |
grep timeout :
node.session.timeo.replacement_timeout = 120
node.session.err_timeo.abort_timeout = 15
node.session.err_timeo.lu_reset_timeout = 20
node.session.err_timeo.host_reset_timeout = 60
node.conn[0].timeo.logout_timeout = 15
node.conn[0].timeo.login_timeout = 15
node.conn[0].timeo.auth_timeout = 45
node.conn[0].timeo.noop_out_timeout = 10
thank you for any suggestions
--
You received this message because you are subscribed to the Google Groups
"open-iscsi" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/open-iscsi?hl=en.