Same again unfortunately
This is with nosoftlockup and both packages updated
BUG: soft lockup detected on CPU#3!
Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
<ffffffff80289151>{update_process_times+66}
<ffffffff802713fe>{smp_local_timer_interrupt+35}
<ffffffff80271463>{smp_apic_timer_interrupt+65}
<ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
<ffffffff80224b87>{tcp_sendmsg+0}
<ffffffff803e0b9f>{sk_stream_wait_memory+89}
<ffffffff80291608>{autoremove_wake_function+0}
<ffffffff80232136>{__tcp_push_pending_frames+1367}
<ffffffff80291608>{autoremove_wake_function+0}
<ffffffff804043cb>{tcp_sendpage+1317}
<ffffffff8812ed7f>{:iscsi_trgt:istd+2343}
<ffffffff80403ea6>{tcp_sendpage+0}
<ffffffff8027fef6>{__wake_up_common+67}
<ffffffff8029131c>{keventd_create_kthread+0}
<ffffffff8812e458>{:iscsi_trgt:istd+0}
<ffffffff8029131c>{keventd_create_kthread+0}
<ffffffff80231a7d>{kthread+200} <ffffffff8025f8a2>{child_rip+8}
<ffffffff8029131c>{keventd_create_kthread+0}
<ffffffff802319b5>{kthread+0}
<ffffffff8025f89a>{child_rip+0}
It is on a different CPU now though
I also noticed when updating iscsi_trgt-kernel that there are 2 warnings
displayed, they are
..../scsi/stex.ko needs unknown symbol map mem_map
And the second is
..../block/kvblade.ko needs unknows symbol map mem_map
As far a sI know I'm not using either of those so they shouldn't be a
problem but I thought I'd mention it
-----Original Message-----
From: Rafiu Fakunle [mailto:[EMAIL PROTECTED]
Sent: Tuesday, 28 November 2006 1:30 a.m.
To: Dave Watkins
Cc: [email protected]
Subject: Re: [OF-users] iSCSI bug?
Dave Watkins wrote:
> Still the same, both with bonding enabled and disabled unfortunatly
>
First:
try adding "nosoftlockup" to the grub boot options and then run the
benchmarks again.
Next:
http://www.openfiler.com/download/PACKAGES/iscsi_trgt-kernel-r78.ccs
http://www.openfiler.com/download/PACKAGES/iscsi_trgt-r78.ccs
(kernel and userland)
same as before (--replace-files)
Finally:
Also a bit more detail about your test set-up (components, parameters,
triggers etc) would be great.
R.
> -----Original Message-----
> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED]
> Sent: Monday, 27 November 2006 2:52 p.m.
> To: Rafiu Fakunle
> Cc: Dave Watkins; [email protected]
> Subject: Re: [OF-users] iSCSI bug?
>
> Rafiu Fakunle wrote:
>
>> Dave Watkins wrote:
>>
>>> Ok, UP is fine. To be sure it wasn't the e1000 driver I also tried
>>>
> using
>
>>> only the Broadcom NIC's as well. Under UP there is no error, under
>>>
> SMP
>
>>> the error reoccurs even with e1000 not loaded and no bonding.
>>>
>>> Hope this helps
>>>
>>>
>> Immensely. I'm just doing up a changeset for you.
>>
>
>
http://www.openfiler.com/download/PACKAGES/iscsi_trgt-kernel-0.4.14.ccs
>
> conary update iscsi_trgt-kernel-0.4.14.ccs --replace-files
>
> Then test again with 2.6.17.14-0.3.smp.x86_64 (with and without
bonding)
>
>
>
>
> Thx,
>
> R.
>
>> R.
>>
>>
>>> Dave
>>>
>>> -----Original Message-----
>>> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] Sent: Monday, 27
>>> November 2006 1:15 p.m.
>>> To: Dave Watkins
>>> Cc: [email protected]
>>> Subject: Re: [OF-users] iSCSI bug?
>>>
>>> OK, and UP without trunking?
>>>
>>> R.
>>>
>>> Dave Watkins wrote:
>>>
>>>
>>>> With or without trunking seem to generate the same problem
>>>>
>>>> Without trunking I got
>>>> BUG: soft lockup detected on CPU#0!
>>>>
>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>> <ffffffff80289151>{update_process_times+66}
>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>> <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>> <ffffffff80224b87>{tcp_sendmsg+0}
>>>> <ffffffff80413bba>{inet_ioctl+0}
>>>> <ffffffff88141216>{:iscsi_trgt:is_data_available+62}
>>>> <ffffffff881419e7>{:iscsi_trgt:istd+1460}
>>>> <ffffffff80403ea6>{tcp_sendpage+0}
>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff80231a7d>{kthread+200}
>>>>
> <ffffffff8025f8a2>{child_rip+8}
>
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff802319b5>{kthread+0}
<ffffffff8025f89a>{child_rip+0}
>>>> BUG: soft lockup detected on CPU#0!
>>>>
>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>> <ffffffff80289151>{update_process_times+66}
>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>> <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>> <ffffffff80224b87>{tcp_sendmsg+0}
>>>> <ffffffff881411c0>{:iscsi_trgt:nthread_wakeup+35}
>>>> <ffffffff881411b3>{:iscsi_trgt:nthread_wakeup+22}
>>>> <ffffffff8814219a>{:iscsi_trgt:istd+3431}
>>>> <ffffffff80403ea6>{tcp_sendpage+0}
>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff80231a7d>{kthread+200}
>>>> <ffffffff8025f8a2>{child_rip+8}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff802319b5>{kthread+0}
>>>> <ffffffff8025f89a>{child_rip+0}
>>>>
>>>> Re-enabling trunking again and I get
>>>> BUG: soft lockup detected on CPU#0!
>>>>
>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>> <ffffffff80289151>{update_process_times+66}
>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>> <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>> <ffffffff80254356>{tcp_ioctl+0}
>>>> <ffffffff8020af50>{__might_sleep+30}
>>>> <ffffffff802326d7>{lock_sock+28}
>>>> <ffffffff80263257>{_spin_lock_bh+9}
>>>> <ffffffff8022fd23>{release_sock+15}
>>>> <ffffffff802543a2>{tcp_ioctl+76}
>>>> <ffffffff80413c44>{inet_ioctl+138}
>>>> <ffffffff88141216>{:iscsi_trgt:is_data_available+62}
>>>> <ffffffff8814125a>{:iscsi_trgt:do_recv+41}
>>>> <ffffffff8023081f>{qdisc_restart+24}
>>>> <ffffffff8022eaa6>{dev_queue_xmit+510}
>>>> <ffffffff8807c266>{:bonding:bond_dev_queue_xmit+489}
>>>> <ffffffff8023277e>{lock_sock+195}
>>>> <ffffffff8807fd96>{:bonding:bond_xmit_roundrobin+154}
>>>> <ffffffff80232136>{__tcp_push_pending_frames+1367}
>>>> <ffffffff8022fd23>{release_sock+15}
>>>> <ffffffff80225551>{tcp_sendmsg+2506}
>>>> <ffffffff80236f84>{do_sock_write+199}
>>>> <ffffffff803dbac1>{sock_writev+220}
>>>> <ffffffff8025db21>{cache_alloc_refill+237}
>>>> <ffffffff80220d80>{tcp_transmit_skb+1579}
>>>> <ffffffff80408067>{tcp_retransmit_skb+1352}
>>>> <ffffffff80254356>{tcp_ioctl+0}
>>>> <ffffffff8024f5a4>{finish_wait+52}
>>>> <ffffffff803e0d10>{sk_stream_wait_memory+458}
>>>> <ffffffff80291608>{autoremove_wake_function+0}
>>>> <ffffffff80291608>{autoremove_wake_function+0}
>>>> <ffffffff8022fd23>{release_sock+15}
>>>> <ffffffff80246a25>{try_to_wake_up+955}
>>>> <ffffffff88141609>{:iscsi_trgt:istd+470}
>>>> <ffffffff80403ea6>{tcp_sendpage+0}
>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff80231a7d>{kthread+200}
>>>> <ffffffff8025f8a2>{child_rip+8}
>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff8027308f>{flat_send_IPI_mask+0}
>>>> <ffffffff802319b5>{kthread+0}
>>>> <ffffffff8025f89a>{child_rip+0}
>>>>
>>>> Without trunking though the write performance after this doesn't
>>>>
> seem
>
>>>>
>>>>
>>> to
>>>
>>>
>>>> be affected (still at about 80-90MB rather than down at less than
>>>>
>>>>
>>> 10MB)
>>>
>>>
>>>> -----Original Message-----
>>>> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] Sent: Monday, 27
>>>> November 2006 12:27 p.m.
>>>> To: Dave Watkins
>>>> Cc: [email protected]
>>>> Subject: Re: [OF-users] iSCSI bug?
>>>>
>>>> Dave Watkins wrote:
>>>>
>>>>
>>>>> Sorry about that, I remembered as soon as I sent it that I hadn't
>>>>> included version. It's x86_64 version 2.2 (did a conary updateall
>>>>>
>>>>>
>>> from
>>>
>>>
>>>>> 2.1 beta. Uname -r gives 2.6.17.14-0.3.smp.x86_64.
>>>>>
>>>>> I'll try with a UP kernel although it will take some time as I
have
>>>>>
>>>>>
>>> to
>>>
>>>
>>>>> rebuild the e1000 module from the UP kernel sources.
>>>>>
>>>> Try without the network trunking anyway in the meantime. Would be
an
>>>>
>
>
>>>> interesting test.
>>>>
>>>> R.
>>>>
>>>>
>>>>
>>>>
>>>>> I'll let you know
>>>>> if I can reproduce on the UP kernel.
>>>>>
>>>>> I don't think it's related to that ticket as they are all writes
>>>>>
>>>>>
>>>> anyway
>>>>
>>>>
>>>>> and they only see the problem on large files.
>>>>>
>>>>> Dave
>>>>>
>>>>> -----Original Message-----
>>>>> From: Rafiu Fakunle [mailto:[EMAIL PROTECTED] Sent: Monday, 27
>>>>> November 2006 11:40 a.m.
>>>>> To: Dave Watkins
>>>>> Cc: [email protected]
>>>>> Subject: Re: [OF-users] iSCSI bug?
>>>>>
>>>>> Hi Dave,
>>>>>
>>>>> Excellent test and bug report.
>>>>>
>>>>> I wonder whether it may be related to this:
>>>>>
>>>>> https://project.openfiler.com/tracker/ticket/435
>>>>>
>>>>> Can you try to reproduce with a UP kernel pls.
>>>>>
>>>>> Also I need the output of `uname -r`
>>>>>
>>>>> Thx,
>>>>>
>>>>> R.
>>>>>
>>>>> FTR: this is running r58 from IET svn
>>>>>
>>>>>
>>>>> Dave Watkins wrote:
>>>>>
>>>>>
>>>>>> Hi All
>>>>>>
>>>>>> I think I've found a bug in the iscsi target software in my
>>>>>> benchmarking/testing.
>>>>>>
>>>>>> Some background on the hardware first in case it may be related.
>>>>>> Dual core/dual opteron with 2GB of ram
>>>>>> 3ware 8006 2 port raid card for OS drives
>>>>>> 3ware 9550SX card for data drives
>>>>>> Dual GB Broadcom on-board NIC's teamed into bond0 (management)
>>>>>> Quad port Intel PCI-E GB NIC with all 4 ports teamed into bond1
>>>>>>
>>>>>>
>>> (main
>>>
>>>
>>>>>> iscsi data network)
>>>>>> 4 x 250GB WD SATA HDD's in RAID5
>>>>>>
>>>>>> Of note here is that I have had to replace the e1000 driver with
>>>>>>
> the
>
>>>>>> latest from Intel to support the quad port card
>>>>>>
>>>>>> I have made some volumes and mounted them on various windows
>>>>>>
> servers
>
>>>>>>
>>>>>>
>>>>> and
>>>>>
>>>>>
>>>>>> have been using iobench to tune performance of the system. When
>>>>>>
>>>>>>
>>> using
>>>
>>>
>>>>>>
>>>>>>
>>>>> a
>>>>>
>>>>>
>>>>>> read only test pattern I see this
>>>>>>
>>>>>> BUG: soft lockup detected on CPU#0!
>>>>>>
>>>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>>> <ffffffff80289151>{update_process_times+66}
>>>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>>> <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>>> <ffffffff88141486>{:iscsi_trgt:istd+83}
>>>>>> <ffffffff88141476>{:iscsi_trgt:istd+67}
>>>>>> <ffffffff80403ea6>{tcp_sendpage+0}
>>>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff80231a7d>{kthread+200}
>>>>>> <ffffffff8025f8a2>{child_rip+8}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff802319b5>{kthread+0}
>>>>>>
> <ffffffff8025f89a>{child_rip+0}
>
>>>>>> BUG: soft lockup detected on CPU#0!
>>>>>>
>>>>>> Call Trace: <IRQ> <ffffffff8029f73c>{softlockup_tick+210}
>>>>>> <ffffffff80289151>{update_process_times+66}
>>>>>> <ffffffff802713fe>{smp_local_timer_interrupt+35}
>>>>>> <ffffffff80271463>{smp_apic_timer_interrupt+65}
>>>>>> <ffffffff8025f54c>{apic_timer_interrupt+132} <EOI>
>>>>>> <ffffffff802631ec>{_spin_unlock_irqrestore+8}
>>>>>> <ffffffff80246a25>{try_to_wake_up+955}
>>>>>> <ffffffff881411cc>{:iscsi_trgt:nthread_wakeup+47}
>>>>>> <ffffffff8814219a>{:iscsi_trgt:istd+3431}
>>>>>> <ffffffff80403ea6>{tcp_sendpage+0}
>>>>>> <ffffffff8027fef6>{__wake_up_common+67}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff88141433>{:iscsi_trgt:istd+0}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff80231a7d>{kthread+200}
>>>>>> <ffffffff8025f8a2>{child_rip+8}
>>>>>> <ffffffff8029131c>{keventd_create_kthread+0}
>>>>>> <ffffffff802319b5>{kthread+0}
>>>>>>
> <ffffffff8025f89a>{child_rip+0}
>
>>>>>> Doing write only based patterns this doesn't come up. After this
>>>>>> performance of the system dives (from about 110MB/sec of iscsi
>>>>>> performance to about 10MB/sec).
>>>>>>
>>>>>> This is fairly reproducible here so if you need anymore
>>>>>>
> information
>
>>>>>>
>>>>>>
>>>>> just
>>>>>
>>>>>
>>>>>> ask.
>>>>>>
>>>>>> Dave
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>
------------------------------------------------------------------------
>
>>>
>>>
>>>>
>>>>
>>>>>
>>>>>
>>>>>> _______________________________________________
>>>>>> Openfiler-users mailing list
>>>>>> [email protected]
>>>>>> https://lists.openfiler.com/mailman/listinfo/openfiler-users
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>
>>>
>> _______________________________________________
>> Openfiler-users mailing list
>> [email protected]
>> https://lists.openfiler.com/mailman/listinfo/openfiler-users
>>
>
>
_______________________________________________
Openfiler-users mailing list
[email protected]
https://lists.openfiler.com/mailman/listinfo/openfiler-users