Same again unfortunately

This is with nosoftlockup and both packages updated

BUG: soft lockup detected on CPU#3!

Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
       <ffffffff80289151>{update_process_times+66}
<ffffffff802713fe>{smp_local_timer_interrupt+35}
       <ffffffff80271463>{smp_apic_timer_interrupt+65}
<ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
       <ffffffff80224b87>{tcp_sendmsg+0}
<ffffffff803e0b9f>{sk_stream_wait_memory+89}
       <ffffffff80291608>{autoremove_wake_function+0}
<ffffffff80232136>{__tcp_push_pending_frames+1367}
       <ffffffff80291608>{autoremove_wake_function+0}
<ffffffff804043cb>{tcp_sendpage+1317}
       <ffffffff8812ed7f>{:iscsi_trgt:istd+2343}
<ffffffff80403ea6>{tcp_sendpage+0}
       <ffffffff8027fef6>{__wake_up_common+67}
<ffffffff8029131c>{keventd_create_kthread+0}
       <ffffffff8812e458>{:iscsi_trgt:istd+0}
<ffffffff8029131c>{keventd_create_kthread+0}
       <ffffffff80231a7d>{kthread+200} <ffffffff8025f8a2>{child_rip+8}
       <ffffffff8029131c>{keventd_create_kthread+0}
<ffffffff802319b5>{kthread+0}
       <ffffffff8025f89a>{child_rip+0}

It is on a different CPU now though

I also noticed when updating iscsi_trgt-kernel that there are 2 warnings
displayed, they are

..../scsi/stex.ko needs unknown symbol map mem_map

And the second is 

..../block/kvblade.ko needs unknows symbol map mem_map

As far a sI know I'm not using either of those so they shouldn't be a
problem but I thought I'd mention it

-----Original Message-----
From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, 28 November 2006 1:30 a.m.
To: Dave Watkins
Cc: [email protected]
Subject: Re: [OF-users] iSCSI bug?

Dave Watkins wrote:
> Still the same, both with bonding enabled and disabled unfortunatly
>   
First:

try adding "nosoftlockup" to the grub boot options and then run the 
benchmarks again.

Next:

http://www.openfiler.com/download/PACKAGES/iscsi_trgt-kernel-r78.ccs
http://www.openfiler.com/download/PACKAGES/iscsi_trgt-r78.ccs

(kernel and userland)

same as before (--replace-files)

Finally:

Also a bit more detail about your test set-up (components, parameters, 
triggers etc) would be great.

R.
> -----Original Message-----
> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] 
> Sent: Monday, 27 November 2006 2:52 p.m.
> To: Rafiu Fakunle
> Cc: Dave Watkins; [email protected]
> Subject: Re: [OF-users] iSCSI bug?
>
> Rafiu Fakunle wrote:
>   
>> Dave Watkins wrote:
>>     
>>> Ok, UP is fine. To be sure it wasn't the e1000 driver I also tried
>>>       
> using
>   
>>> only the Broadcom NIC's as well. Under UP there is no error, under
>>>       
> SMP
>   
>>> the error reoccurs even with e1000 not loaded and no bonding.
>>>
>>> Hope this helps
>>>   
>>>       
>> Immensely. I'm just doing up a changeset for you.
>>     
>
>
http://www.openfiler.com/download/PACKAGES/iscsi_trgt-kernel-0.4.14.ccs
>
> conary update iscsi_trgt-kernel-0.4.14.ccs --replace-files
>
> Then test again with 2.6.17.14-0.3.smp.x86_64 (with and without
bonding)
>
>
>
>
> Thx,
>
> R.
>   
>> R.
>>
>>     
>>> Dave
>>>
>>> -----Original Message-----
>>> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] Sent: Monday, 27 
>>> November 2006 1:15 p.m.
>>> To: Dave Watkins
>>> Cc: [email protected]
>>> Subject: Re: [OF-users] iSCSI bug?
>>>
>>> OK, and UP without trunking?
>>>
>>> R.
>>>
>>> Dave Watkins wrote:
>>>  
>>>       
>>>> With or without trunking seem to generate the same problem
>>>>
>>>> Without trunking I got
>>>> BUG: soft lockup detected on CPU#0!
>>>>
>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>        <ffffffff80289151>{update_process_times+66}
>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>        <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>        <ffffffff80224b87>{tcp_sendmsg+0}
>>>> <ffffffff80413bba>{inet_ioctl+0}
>>>>        <ffffffff88141216>{:iscsi_trgt:is_data_available+62}
>>>>        <ffffffff881419e7>{:iscsi_trgt:istd+1460}
>>>> <ffffffff80403ea6>{tcp_sendpage+0}
>>>>        <ffffffff8027fef6>{__wake_up_common+67}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>        <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>        <ffffffff80231a7d>{kthread+200}
>>>>         
> <ffffffff8025f8a2>{child_rip+8}
>   
>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>>        <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>>        <ffffffff802319b5>{kthread+0}
<ffffffff8025f89a>{child_rip+0}
>>>> BUG: soft lockup detected on CPU#0!
>>>>
>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>        <ffffffff80289151>{update_process_times+66}
>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>        <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>        <ffffffff80224b87>{tcp_sendmsg+0}
>>>> <ffffffff881411c0>{:iscsi_trgt:nthread_wakeup+35}
>>>>        <ffffffff881411b3>{:iscsi_trgt:nthread_wakeup+22}
>>>> <ffffffff8814219a>{:iscsi_trgt:istd+3431}
>>>>        <ffffffff80403ea6>{tcp_sendpage+0}
>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff80231a7d>{kthread+200}
>>>>        <ffffffff8025f8a2>{child_rip+8}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>        <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>>        <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff802319b5>{kthread+0}
>>>>        <ffffffff8025f89a>{child_rip+0}
>>>>
>>>> Re-enabling trunking again and I get
>>>> BUG: soft lockup detected on CPU#0!
>>>>
>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>        <ffffffff80289151>{update_process_times+66}
>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>        <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>        <ffffffff80254356>{tcp_ioctl+0}
>>>> <ffffffff8020af50>{__might_sleep+30}
>>>>        <ffffffff802326d7>{lock_sock+28}
>>>> <ffffffff80263257>{_spin_lock_bh+9}
>>>>        <ffffffff8022fd23>{release_sock+15}
>>>> <ffffffff802543a2>{tcp_ioctl+76}
>>>>        <ffffffff80413c44>{inet_ioctl+138}
>>>> <ffffffff88141216>{:iscsi_trgt:is_data_available+62}
>>>>        <ffffffff8814125a>{:iscsi_trgt:do_recv+41}
>>>> <ffffffff8023081f>{qdisc_restart+24}
>>>>        <ffffffff8022eaa6>{dev_queue_xmit+510}
>>>> <ffffffff8807c266>{:bonding:bond_dev_queue_xmit+489}
>>>>        <ffffffff8023277e>{lock_sock+195}
>>>> <ffffffff8807fd96>{:bonding:bond_xmit_roundrobin+154}
>>>>        <ffffffff80232136>{__tcp_push_pending_frames+1367}
>>>> <ffffffff8022fd23>{release_sock+15}
>>>>        <ffffffff80225551>{tcp_sendmsg+2506}
>>>> <ffffffff80236f84>{do_sock_write+199}
>>>>        <ffffffff803dbac1>{sock_writev+220}
>>>> <ffffffff8025db21>{cache_alloc_refill+237}
>>>>        <ffffffff80220d80>{tcp_transmit_skb+1579}
>>>> <ffffffff80408067>{tcp_retransmit_skb+1352}
>>>>        <ffffffff80254356>{tcp_ioctl+0}
>>>> <ffffffff8024f5a4>{finish_wait+52}
>>>>        <ffffffff803e0d10>{sk_stream_wait_memory+458}
>>>> <ffffffff80291608>{autoremove_wake_function+0}
>>>>        <ffffffff80291608>{autoremove_wake_function+0}
>>>> <ffffffff8022fd23>{release_sock+15}
>>>>        <ffffffff80246a25>{try_to_wake_up+955}
>>>> <ffffffff88141609>{:iscsi_trgt:istd+470}
>>>>        <ffffffff80403ea6>{tcp_sendpage+0}
>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff80231a7d>{kthread+200}
>>>>        <ffffffff8025f8a2>{child_rip+8}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>        <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>>        <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff802319b5>{kthread+0}
>>>>        <ffffffff8025f89a>{child_rip+0}
>>>>
>>>> Without trunking though the write performance after this doesn't
>>>>         
> seem
>   
>>>>     
>>>>         
>>> to
>>>  
>>>       
>>>> be affected (still at about 80-90MB rather than down at less than
>>>>     
>>>>         
>>> 10MB)
>>>  
>>>       
>>>> -----Original Message-----
>>>> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] Sent: Monday, 27 
>>>> November 2006 12:27 p.m.
>>>> To: Dave Watkins
>>>> Cc: [email protected]
>>>> Subject: Re: [OF-users] iSCSI bug?
>>>>
>>>> Dave Watkins wrote:
>>>>      
>>>>         
>>>>> Sorry about that, I remembered as soon as I sent it that I hadn't
>>>>> included version. It's x86_64 version 2.2 (did a conary updateall
>>>>>       
>>>>>           
>>> from
>>>  
>>>       
>>>>> 2.1 beta. Uname -r gives 2.6.17.14-0.3.smp.x86_64.
>>>>>
>>>>> I'll try with a UP kernel although it will take some time as I
have
>>>>>       
>>>>>           
>>> to
>>>  
>>>       
>>>>> rebuild the e1000 module from the UP kernel sources.           
>>>>>           
>>>> Try without the network trunking anyway in the meantime. Would be
an
>>>>         
>
>   
>>>> interesting test.
>>>>
>>>> R.
>>>>
>>>>
>>>>      
>>>>         
>>>>> I'll let you know
>>>>> if I can reproduce on the UP kernel.
>>>>>
>>>>> I don't think it's related to that ticket as they are all writes
>>>>>           
>>>>>           
>>>> anyway
>>>>      
>>>>         
>>>>> and they only see the problem on large files.
>>>>>
>>>>> Dave
>>>>>
>>>>> -----Original Message-----
>>>>> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] Sent: Monday, 27 
>>>>> November 2006 11:40 a.m.
>>>>> To: Dave Watkins
>>>>> Cc: [email protected]
>>>>> Subject: Re: [OF-users] iSCSI bug?
>>>>>
>>>>> Hi Dave,
>>>>>
>>>>> Excellent test and bug report.
>>>>>
>>>>> I wonder whether it may be related to this:
>>>>>
>>>>> https://project.openfiler.com/tracker/ticket/435
>>>>>
>>>>> Can you try to reproduce with a UP kernel pls.
>>>>>
>>>>> Also I need the output of `uname -r`
>>>>>
>>>>> Thx,
>>>>>
>>>>> R.
>>>>>
>>>>> FTR: this is running r58 from IET svn
>>>>>
>>>>>
>>>>> Dave Watkins wrote:
>>>>>            
>>>>>           
>>>>>> Hi All
>>>>>>
>>>>>> I think I've found a bug in the iscsi target software in my
>>>>>> benchmarking/testing.
>>>>>>
>>>>>> Some background on the hardware first in case it may be related.
>>>>>> Dual core/dual opteron with 2GB of ram
>>>>>> 3ware 8006 2 port raid card for OS drives
>>>>>> 3ware 9550SX card for data drives
>>>>>> Dual GB Broadcom on-board NIC's teamed into bond0 (management)
>>>>>> Quad port Intel PCI-E GB NIC with all 4 ports teamed into bond1
>>>>>>         
>>>>>>             
>>> (main
>>>  
>>>       
>>>>>> iscsi data network)
>>>>>> 4 x 250GB WD SATA HDD's in RAID5
>>>>>>
>>>>>> Of note here is that I have had to replace the e1000 driver with
>>>>>>             
> the
>   
>>>>>> latest from Intel to support the quad port card
>>>>>>
>>>>>> I have made some volumes and mounted them on various windows
>>>>>>             
> servers
>   
>>>>>>                   
>>>>>>             
>>>>> and
>>>>>            
>>>>>           
>>>>>> have been using iobench to tune performance of the system. When
>>>>>>         
>>>>>>             
>>> using
>>>  
>>>       
>>>>>>                   
>>>>>>             
>>>>> a
>>>>>            
>>>>>           
>>>>>> read only test pattern I see this
>>>>>>
>>>>>> BUG: soft lockup detected on CPU#0!
>>>>>>
>>>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>>>        <ffffffff80289151>{update_process_times+66}
>>>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>>>        <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>>>        <ffffffff88141486>{:iscsi_trgt:istd+83}
>>>>>> <ffffffff88141476>{:iscsi_trgt:istd+67}
>>>>>>        <ffffffff80403ea6>{tcp_sendpage+0}
>>>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff80231a7d>{kthread+200}
>>>>>>        <ffffffff8025f8a2>{child_rip+8}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>>        <ffffffff802319b5>{kthread+0}
>>>>>>             
> <ffffffff8025f89a>{child_rip+0}
>   
>>>>>> BUG: soft lockup detected on CPU#0!
>>>>>>
>>>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>>>        <ffffffff80289151>{update_process_times+66}
>>>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>>>        <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>>>        <ffffffff802631ec>{_spin_unlock_irqrestore+8}
>>>>>> <ffffffff80246a25>{try_to_wake_up+955}
>>>>>>        <ffffffff881411cc>{:iscsi_trgt:nthread_wakeup+47}
>>>>>> <ffffffff8814219a>{:iscsi_trgt:istd+3431}
>>>>>>        <ffffffff80403ea6>{tcp_sendpage+0}
>>>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>>>>        <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff80231a7d>{kthread+200}
>>>>>>        <ffffffff8025f8a2>{child_rip+8}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>>        <ffffffff802319b5>{kthread+0}
>>>>>>             
> <ffffffff8025f89a>{child_rip+0}
>   
>>>>>> Doing write only based patterns this doesn't come up. After this
>>>>>> performance of the system dives (from about 110MB/sec of iscsi
>>>>>> performance to about 10MB/sec).
>>>>>>
>>>>>> This is fairly reproducible here so if you need anymore
>>>>>>             
> information
>   
>>>>>>                   
>>>>>>             
>>>>> just
>>>>>            
>>>>>           
>>>>>> ask.
>>>>>>
>>>>>> Dave
>>>>>>
>>>>>>  
>>>>>>                   
>>>>>>             
>
------------------------------------------------------------------------
>   
>>>  
>>>       
>>>>      
>>>>         
>>>>>            
>>>>>           
>>>>>> _______________________________________________
>>>>>> Openfiler-users mailing list
>>>>>> [email protected]
>>>>>> https://lists.openfiler.com/mailman/listinfo/openfiler-users
>>>>>>                     
>>>>>>             
>>>>>             
>>>>>           
>>>>       
>>>>         
>>>   
>>>       
>> _______________________________________________
>> Openfiler-users mailing list
>> [email protected]
>> https://lists.openfiler.com/mailman/listinfo/openfiler-users
>>     
>
>   

_______________________________________________
Openfiler-users mailing list
[email protected]
https://lists.openfiler.com/mailman/listinfo/openfiler-users

Reply via email to