> On Nov 5, 2015, at 6:56 PM, Chris Leech <cle...@redhat.com> wrote:
> 
> On Thu, Nov 05, 2015 at 02:33:44PM -0200, Guilherme G. Piccoli wrote:
>> Hello Shlomo and Or,
>> 
>> I'm Guilherme Piccoli from LTC/IBM - firstly, sorry to bother you.
>> 
>> 
>> We are running some tests with iSCSI and we found an issue caused possibly
>> by commit 659743b02c41 ("libiscsi: Reduce locking contention in fast path").
>> 
>> After some time (+/- 1 hour) of testing with a hardware target (using fio
>> benchmark tool), we got a kernel oops; the following link is a pastebin of
>> the error message (we got lots of these messages, since our system has
>> multiple cores): http://codepad.org/KS2C9Jjt
> 
> Interesting. From the trace, the list debugging code is detecting
> corruption when removing a task from some list.  Could be the connection
> mgmtqueue, cmdqueue, or requeue.
> 
> After the locking change adding a task to any of those lists is under
> the session fwrd_lock, but the call to iscsi_complete_task which deletes
> the task from whatever list it's on is under the back_lock.
> 
> Am I missing something, or is splitting a linked list across two locks a
> major failing of this change?

You are right. Patch is really wrong. Not sure what I was thinking when I 
merged it. It has had multiple issues now,

Or and Shlomo, I am going to just revert the patch for now until I can do some 
performance testing with a patch to add back correct locking and also 
re-rerview the patch. Maybe instead of locks I can just use some kfifos too.

-- 
You received this message because you are subscribed to the Google Groups 
"open-iscsi" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to open-iscsi+unsubscr...@googlegroups.com.
To post to this group, send email to open-iscsi@googlegroups.com.
Visit this group at http://groups.google.com/group/open-iscsi.
For more options, visit https://groups.google.com/d/optout.

Reply via email to