The iscsiadm crash is sort of expected due to where the kernel is crashing initially.
When I get some time at work, I will make a debug patch for you run with that should spit out some extra info. On 07/28/2013 03:33 AM, Alex Lyakas wrote: > Hi Mike, > Attached is a trace of another repro, maybe it will give more info. > The crashing iscsiadm process (2704) was spawned via fork/exec by our > "zadara_vam" process (2657). > > Thanks, > Alex. > > > -----Original Message----- From: Mike Christie > Sent: 22 July, 2013 7:14 PM > To: [email protected] > Cc: Alex Lyakas ; Lev Vainblat ; Yair Hershko ; Liran Strugano > Subject: Re: NULL pointer deref in iscsi_sw_tcp_host_get_param > > For me to replicate the problem I just login to the target using bnx2i. > Just one instance of iscsiadm -m node --login using the bnx2i driver > causes the problem. > > I have been trying to replicate when adding debug comments in the kernel > but that made the problem go away. I also tried getting a crash report > but with that enabled the problem went away. > > > On 07/22/2013 09:24 AM, Alex Lyakas wrote: >> Hi Mike, >> any advice on how to proceed further with this issue? >> >> Thanks, >> Alex. >> >> >> -----Original Message----- From: Alex Lyakas >> Sent: 02 July, 2013 9:41 PM >> To: Mike Christie ; [email protected] >> Cc: Lev Vainblat ; Yair Hershko >> Subject: Re: NULL pointer deref in iscsi_sw_tcp_host_get_param >> >> Hi Mike, >> For us it happened only once till now; and from our kernel log, I don't >> think anything special was going on during that time, except that we were >> reading the sysfs entry. Can you pls share how do you replicate the >> problem >> with the Oracle kernel? If this narrows us down a bit on how to >> replicate, >> then, yes, we can apply a debugging patch. >> >> One thing our application is doing, is to run several iscsiadm commands >> (via >> fork/exec) in parallel. Is this, in general, a safe thing to do, i.e., >> running multiple iscsiadm processes in parallel? Each iscsiadm process >> operates against a different iSCSI target. >> >> Thanks, >> Alex. >> >> >> -----Original Message----- From: Mike Christie >> Sent: 02 July, 2013 8:23 PM >> To: [email protected] >> Cc: Alex Lyakas ; Lev Vainblat ; Yair Hershko >> Subject: Re: NULL pointer deref in iscsi_sw_tcp_host_get_param >> >> Hey, >> >> Is it easy for you to replicate this problem and if so would it be >> possible to run with a patch that spits out some extra debugging info? >> >> It is easy for me to replicate with the Oracle linux kernel, but when I >> add debugging it seems to move around or become difficult to hit. >> >> >> On 06/27/2013 04:54 AM, Alex Lyakas wrote: >>> Hello Mike, >>> thank you for responding to my bug report. >>> Here is the information you asked for: >>> >>> This issue happened within a virtual machine. The network interface, >>> that is used for iscsi within the VM is a SR-IOV Virtual Function. The >>> VM runs a stock ixgbevf driver from 3.8.13 mainline kernel. On the >>> physical machine, the Virtual Function is spawned out of Intel 82599EB >>> card. The ixgbe driver for the Intel card on the physical machine is >>> 3.11.33. The physical machine runs stock Ubuntu Precise kernel >>> "3.2.0-29-generic #46-Ubuntu", while the VM runs mainline 3.8.13 kernel. >>> From within the VM, we connect to targets that live both on the same >>> physical machine and on other physical machines. >>> >>> I am attaching a .config file for the VM kernel. We did not build the >>> kernel ourselves, this is a mainline build done by Ubunti here: >>> http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.8.13-raring/ >>> I am also attaching the full kernel log with the information about the >>> crash, it has more stack traces. >>> >>> The iscsiadm commands that we use (from another application via >>> fork()/exec()) are: >>> iscsiadm --mode node --portal <portal> --targetname <name> --op new >>> iscsiadm --mode node --portal <portal> --targetname <name> --login >>> iscsiadm --mode node --portal <portal> --targetname <name> --logout >>> iscsiadm --mode node [--print <level>] >>> iscsiadm --mode session [--print <level>] >>> iscsiadm --mode host [--print <level>] >>> >>> Occasionally we also read sysfs attributes, with a script that collects >>> all system's sysfs entries for further analysis. >>> >>> We use version 2.0-871 of open-iscsi, we install them via apt-get: >>> dpkg -l: >>> ii open-iscsi 2.0.871-0ubuntu9.12.04.1 >>> High performance, transport independent iSCSI implementation >>> ii open-iscsi-utils 2.0.871-0ubuntu9.12.04.1 >>> iSCSI initiatior administrative utility >>> >>> We open one session to each target, but we connect to multiple targets. >>> >>> Please let me know if any other info is needed. >>> >>> Thanks for your help, >>> Alex. >>> >>> >>> -----Original Message----- From: Mike Christie >>> Sent: 27 June, 2013 10:13 AM >>> To: [email protected] >>> Cc: Alex Lyakas >>> Subject: Re: NULL pointer deref in iscsi_sw_tcp_host_get_param >>> >>> On 06/26/2013 07:50 PM, Mike Christie wrote: >>>> On 06/26/2013 05:27 PM, Mike Christie wrote: >>>>> We have not seen it before. I am not seeing it here. >>>> >>>> Oh wait, I can hit it when using bnx2i and the OEL kernel >>>> 2.6.39-400.17.1.el6uek.x86_64 kernel. I do not hit it with iscsi_tcp >>>> though. Have not tried other upstream kernels with offload yet. >>>> >>> >>> Huh. I tried upstream 2.6.39 to 3.8 and also 3.8.10 and could not hit >>> the problem. I only hit it with that OEL kernel when using bnx2x. Also I >>> hit the oops in a slightly different place. >>> >> -- You received this message because you are subscribed to the Google Groups "open-iscsi" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/open-iscsi. For more options, visit https://groups.google.com/groups/opt_out.
