Hi all,
I am running into issues where I am getting iscsid ping timeouts for
my connections (not all.. just some... and it appears to be when the
EqualLogic system is busier).
Example:
=======
Nov 29 01:03:47 oim6102505 kernel: connection90:0: ping timeout of 10
secs expired, recv timeout 5, last rx 198077764, last ping 1980790
14, now 198081514
Nov 29 01:03:47 oim6102505 kernel: connection90:0: detected conn
error (1011)
Nov 29 01:03:47 oim6102505 multipathd: sdam: readsector0 checker
reports path is down
GIVENS:
=======
[r...@servernamehere ~]# iscsiadm -m host -P 1
Host Number: 10
State: running
Transport: tcp
Initiatorname: <empty>
IPaddress: 192.168.9.9
HWaddress: 00:10:18:3B:e5:23
Netdev: <empty>
[r...@servernamehere ~]# rpm -qa | grep iscsi
iscsi-initiator-utils-6.2.0.871-0.7.el5
[r...@servernamehere ~]# uname -a
Linux servernamehere 2.6.18-128.2.1.4.27.el5xen #1 SMP Sat Jul 24
02:16:40 EDT 2010 i686 i686 i386 GNU/Linux
I've run into issues in the past where this was related to nop-out
code. Mike Christie had provided the patches that appear to have
resolved it in the open-iscsi 871 code. I worked with Oracle support
(this is an Oracle VM 2.2.1 environment). and they were able to update
their yum repos to reflect this open-iscsi update.
Now (a year or so later), I'm starting to see more connection timeout
messages. After digging into this I determined that it looks like we
may be hitting possible EqualLogic problems with it sending pings in a
different way that it is expected in the nop-out standard/code?
I found this thread which may be related:
http://groups.google.com/group/open-iscsi/browse_thread/thread/a220595ec4f5f1d2/e90fc5d983a6186c?lnk=gst&q=bnx2i#e90fc5d983a6186c
QUESTIONS:
===========
1) I guess what I'm wondering (and I've asked Oracle support to dig
further into this as well, btw) is if anyone knows if bnx2 falls into
the same type of bugs as bnx2i with regards to nop-out code?
2) If I disable nop-outs, this will likely remove these errors. But
will it negatively affect my connections? Even if the EQLX is 100%
busy doing stuff, will the scsi and dm-multipath code just handle that
outside of iSCSI code? In other words, I guess I don't know what what
question I'm really asking here, but just am nervous about disabling
nop-outs :/
Any help/advice is appreciated :)
Thanks,
Joe
--
You received this message because you are subscribed to the Google Groups
"open-iscsi" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/open-iscsi?hl=en.