Hi all,
I'm having stability problems with my setup.
Here's my setup
Two 24 bay supermicro servers (x8dth-6f) full with 24x Western Digital
2TB SATA RE4-GP disks.
The OS on both nodes is Debian Lenny with the backports
linux-image-2.6.32-bpo.5-amd64 kernel, the OS resides on a SSD.
I created 4x 8TB software raid5 sets (md0/1/2/3) containing 5 disks
each. The four remaining disks are hotspares
Then i created 4x DRBD volumes of the raid sets with the other server.
Using pacemaker and the iSCSITarget/iSCSILogicalUnit RA's I created 4
targets/lun/failover ips on top of the 4 DRBD volumes.
Each DRBD target/lun has it's own subnet and a dedicated 1gbit ethernet link.
I compiled the stable version v1.4.20.2 of IETD from the tarball from
the stable ietd website.
After a while of operation (sometimes a few days, sometimes a week)
the system freezes and I have to hard reset the system to get it back
online.
Pacemaker does a failover to the second node and then that node
freezes also after a while, sometimes instantly, sometimes it takes
longer.
I see these call traces in the syslog:
Jul 10 13:14:15 node01 kernel: [ 477.389011] istd1: page allocation
failure. order:0, mode:0x4020
Jul 10 13:14:17 node01 kernel: [ 477.389242] Pid: 4213, comm: istd1
Not tainted 2.6.32-bpo.5-amd64 #1
Jul 10 13:14:17 node01 kernel: [ 477.389486] Call Trace:
Jul 10 13:14:17 node01 kernel: [ 477.389675] <IRQ>
[<ffffffff810ba4b1>] ? __alloc_pages_nodemask+0x592/0x5f5
Jul 10 13:14:17 node01 kernel: [ 477.390014] [<ffffffff810e67c2>] ?
new_slab+0x5b/0x1ca
Jul 10 13:14:17 node01 kernel: [ 477.390244] [<ffffffff810e6b21>] ?
__slab_alloc+0x1f0/0x39b
Jul 10 13:14:17 node01 kernel: [ 477.390480] [<ffffffff812497b8>] ?
__netdev_alloc_skb+0x29/0x45
Jul 10 13:14:17 node01 kernel: [ 477.390789] [<ffffffff812487d1>] ?
__alloc_skb+0x3e/0x15a
Jul 10 13:14:17 node01 kernel: [ 477.391024] [<ffffffff810e7553>] ?
__kmalloc_node_track_caller+0xbb/0x11b
Jul 10 13:14:17 node01 kernel: [ 477.391276] [<ffffffff812497b8>] ?
__netdev_alloc_skb+0x29/0x45
Jul 10 13:14:17 node01 kernel: [ 477.391542] [<ffffffff812487fc>] ?
__alloc_skb+0x69/0x15a
Jul 10 13:14:17 node01 kernel: [ 477.392045] [<ffffffff812497b8>] ?
__netdev_alloc_skb+0x29/0x45
Jul 10 13:14:17 node01 kernel: [ 477.392321] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392555] istd1: page allocation
failure. order:0, mode:0x4020
Jul 10 13:14:17 node01 kernel: [ 477.392558] Pid: 4213, comm: istd1
Not tainted 2.6.32-bpo.5-amd64 #1
Jul 10 13:14:17 node01 kernel: [ 477.392559] Call Trace:
Jul 10 13:14:17 node01 kernel: [ 477.392560] <IRQ>
[<ffffffff810ba4b1>] ? __alloc_pages_nodemask+0x592/0x5f5
Jul 10 13:14:17 node01 kernel: [ 477.392568] [<ffffffff810e67c2>] ?
new_slab+0x5b/0x1ca
Jul 10 13:14:17 node01 kernel: [ 477.392571] [<ffffffff810e6b21>] ?
__slab_alloc+0x1f0/0x39b
Jul 10 13:14:17 node01 kernel: [ 477.392575] [<ffffffff8125cef2>] ?
find_skb+0x30/0x83
Jul 10 13:14:17 node01 kernel: [ 477.392578] [<ffffffff810e7553>] ?
__kmalloc_node_track_caller+0xbb/0x11b
Jul 10 13:14:17 node01 kernel: [ 477.392581] [<ffffffff8125cef2>] ?
find_skb+0x30/0x83
Jul 10 13:14:17 node01 kernel: [ 477.392583] [<ffffffff812487fc>] ?
__alloc_skb+0x69/0x15a
Jul 10 13:14:17 node01 kernel: [ 477.392586] [<ffffffff8125cef2>] ?
find_skb+0x30/0x83
Jul 10 13:14:17 node01 kernel: [ 477.392589] [<ffffffff8125d177>] ?
netpoll_send_udp+0x2b/0x1fc
Jul 10 13:14:17 node01 kernel: [ 477.392593] [<ffffffffa0344222>] ?
write_msg+0x90/0xeb [netconsole]
Jul 10 13:14:17 node01 kernel: [ 477.392597] [<ffffffff8104e095>] ?
__call_console_drivers+0x64/0x75
Jul 10 13:14:17 node01 kernel: [ 477.392600] [<ffffffff8104e49c>] ?
release_console_sem+0x10f/0x1af
Jul 10 13:14:17 node01 kernel: [ 477.392604] [<ffffffff8101654f>] ?
sched_clock+0x5/0x8
Jul 10 13:14:17 node01 kernel: [ 477.392607] [<ffffffff8104ea80>] ?
vprintk+0x315/0x364
Jul 10 13:14:17 node01 kernel: [ 477.392609] [<ffffffff8104e50b>] ?
release_console_sem+0x17e/0x1af
Jul 10 13:14:17 node01 kernel: [ 477.392614] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392618] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392622] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392626] [<ffffffff812fa4f5>] ?
printk+0x4e/0x59
Jul 10 13:14:17 node01 kernel: [ 477.392629] [<ffffffffa0251000>] ?
pci_alloc_consistent+0x0/0x94 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392633] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392835] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392842] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392846] [<ffffffff81014309>] ?
print_trace_address+0x1d/0x4b
Jul 10 13:14:17 node01 kernel: [ 477.392848] [<ffffffff810143df>] ?
print_context_stack+0x76/0xbf
Jul 10 13:14:17 node01 kernel: [ 477.392851] [<ffffffff81013926>] ?
dump_trace+0x1fb/0x24d
Jul 10 13:14:17 node01 kernel: [ 477.392854] [<ffffffff812fa35e>] ?
dump_stack+0x69/0x6f
Jul 10 13:14:17 node01 kernel: [ 477.392858] [<ffffffff81192175>] ?
__ratelimit+0xb5/0xc0
Jul 10 13:14:17 node01 kernel: [ 477.392861] [<ffffffff810ba4b1>] ?
__alloc_pages_nodemask+0x592/0x5f5
Jul 10 13:14:17 node01 kernel: [ 477.392870] [<ffffffff810e67c2>] ?
new_slab+0x5b/0x1ca
Jul 10 13:14:17 node01 kernel: [ 477.392873] [<ffffffff810e6b21>] ?
__slab_alloc+0x1f0/0x39b
Jul 10 13:14:17 node01 kernel: [ 477.392876] [<ffffffff812497b8>] ?
__netdev_alloc_skb+0x29/0x45
Jul 10 13:14:17 node01 kernel: [ 477.392878] [<ffffffff812487d1>] ?
__alloc_skb+0x3e/0x15a
Jul 10 13:14:17 node01 kernel: [ 477.392881] [<ffffffff810e7553>] ?
__kmalloc_node_track_caller+0xbb/0x11b
Jul 10 13:14:17 node01 kernel: [ 477.392884] [<ffffffff812497b8>] ?
__netdev_alloc_skb+0x29/0x45
Jul 10 13:14:17 node01 kernel: [ 477.392886] [<ffffffff812487fc>] ?
__alloc_skb+0x69/0x15a
Jul 10 13:14:17 node01 kernel: [ 477.392889] [<ffffffff812497b8>] ?
__netdev_alloc_skb+0x29/0x45
Jul 10 13:14:17 node01 kernel: [ 477.392893] [<ffffffffa02542d0>] ?
e1000_alloc_rx_buffers+0x94/0x344 [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392898] [<ffffffffa0252b28>] ?
e1000_clean_rx_irq+0x35f/0x3fb [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392900] [<ffffffff81012922>] ?
do_IRQ+0xa0/0xb6
Jul 10 13:14:17 node01 kernel: [ 477.392905] [<ffffffffa025687b>] ?
e1000_clean+0x2fa/0x49f [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.392909] [<ffffffff8104a40f>] ?
try_to_wake_up+0x289/0x29b
Jul 10 13:14:17 node01 kernel: [ 477.392914] [<ffffffff8105a638>] ?
run_timer_softirq+0x222/0x268
Jul 10 13:14:17 node01 kernel: [ 477.392919] [<ffffffff8124f903>] ?
net_rx_action+0xae/0x1c9
Jul 10 13:14:17 node01 kernel: [ 477.392922] [<ffffffff81053c8a>] ?
__do_softirq+0xdd/0x1a7
Jul 10 13:14:17 node01 kernel: [ 477.392925] [<ffffffff81011cac>] ?
call_softirq+0x1c/0x30
Jul 10 13:14:17 node01 kernel: [ 477.392926] <EOI>
[<ffffffff8101322b>] ? do_softirq+0x3f/0x7c
Jul 10 13:14:17 node01 kernel: [ 477.392931] [<ffffffff8105383b>] ?
_local_bh_enable_ip+0x7d/0x8f
Jul 10 13:14:17 node01 kernel: [ 477.392934] [<ffffffff81250406>] ?
dev_queue_xmit+0x35b/0x38d
Jul 10 13:14:17 node01 kernel: [ 477.392937] [<ffffffff812775cb>] ?
ip_queue_xmit+0x311/0x386
Jul 10 13:14:17 node01 kernel: [ 477.392940] [<ffffffff812864a1>] ?
tcp_rcv_established+0x57d/0x6d9
Jul 10 13:14:17 node01 kernel: [ 477.392943] [<ffffffff8128809d>] ?
tcp_send_ack+0x23/0xf4
Jul 10 13:14:17 node01 kernel: [ 477.392946] [<ffffffff81287f73>] ?
tcp_transmit_skb+0x648/0x687
Jul 10 13:14:17 node01 kernel: [ 477.392949] [<ffffffff8127ea0b>] ?
tcp_recvmsg+0x983/0xa9e
Jul 10 13:14:17 node01 kernel: [ 477.392951] [<ffffffff81191d40>] ?
radix_tree_delete+0xbf/0x1ba
Jul 10 13:14:17 node01 kernel: [ 477.392955] [<ffffffff810e56a7>] ?
__slab_free+0x7f/0x27a
Jul 10 13:14:17 node01 kernel: [ 477.392958] [<ffffffff81242b1a>] ?
sock_common_recvmsg+0x30/0x45
Jul 10 13:14:17 node01 kernel: [ 477.392961] [<ffffffff81241069>] ?
sock_recvmsg+0xa6/0xbe
Jul 10 13:14:17 node01 kernel: [ 477.392964] [<ffffffff810b844a>] ?
zone_watermark_ok+0x20/0xb1
Jul 10 13:14:17 node01 kernel: [ 477.392966] [<ffffffff81064e86>] ?
autoremove_wake_function+0x0/0x2e
Jul 10 13:14:17 node01 kernel: [ 477.392970] [<ffffffff810c758c>] ?
zone_statistics+0x3c/0x5d
Jul 10 13:14:17 node01 kernel: [ 477.392973] [<ffffffff8124382f>] ?
lock_sock_nested+0xa0/0xab
Jul 10 13:14:17 node01 kernel: [ 477.392976] [<ffffffff812fc2f1>] ?
_spin_lock_bh+0x9/0x25
Jul 10 13:14:17 node01 kernel: [ 477.392979] [<ffffffff81243702>] ?
release_sock+0x13/0xa0
Jul 10 13:14:17 node01 kernel: [ 477.392981] [<ffffffff8127d129>] ?
tcp_ioctl+0x128/0x134
Jul 10 13:14:17 node01 kernel: [ 477.392986] [<ffffffffa038ca98>] ?
do_recv+0x102/0x1e1 [iscsi_trgt]
Jul 10 13:14:17 node01 kernel: [ 477.392989] [<ffffffff810e6a04>] ?
__slab_alloc+0xd3/0x39b
Jul 10 13:14:17 node01 kernel: [ 477.392991] [<ffffffff8127eeed>] ?
sk_stream_alloc_skb+0x2f/0xd5
Jul 10 13:14:17 node01 kernel: [ 477.392994] [<ffffffff810114ce>] ?
common_interrupt+0xe/0x13
Jul 10 13:14:17 node01 kernel: [ 477.392999] [<ffffffffa0255ffb>] ?
e1000_xmit_frame+0x9b4/0xa8b [e1000]
Jul 10 13:14:17 node01 kernel: [ 477.393002] [<ffffffff8124fee0>] ?
dev_hard_start_xmit+0x211/0x2db
Jul 10 13:14:17 node01 kernel: [ 477.393005] [<ffffffff810c758c>] ?
zone_statistics+0x3c/0x5d
Jul 10 13:14:17 node01 kernel: [ 477.393009] [<ffffffff81262ea3>] ?
sch_direct_xmit+0x7f/0x14c
Jul 10 13:14:17 node01 kernel: [ 477.393012] [<ffffffff8105383b>] ?
_local_bh_enable_ip+0x7d/0x8f
Jul 10 13:14:17 node01 kernel: [ 477.393015] [<ffffffff81250406>] ?
dev_queue_xmit+0x35b/0x38d
Jul 10 13:14:17 node01 kernel: [ 477.393017] [<ffffffff812775cb>] ?
ip_queue_xmit+0x311/0x386
Jul 10 13:14:17 node01 kernel: [ 477.393021] [<ffffffff8100f5e7>] ?
__switch_to+0xd0/0x297
Jul 10 13:14:17 node01 kernel: [ 477.393024] [<ffffffff81016512>] ?
native_sched_clock+0x2e/0x66
Jul 10 13:14:17 node01 kernel: [ 477.393027] [<ffffffff8101654f>] ?
sched_clock+0x5/0x8
Jul 10 13:14:17 node01 kernel: [ 477.393030] [<ffffffff810493d2>] ?
update_rq_clock+0xf/0x28
Jul 10 13:14:17 node01 kernel: [ 477.393033] [<ffffffff8104a40f>] ?
try_to_wake_up+0x289/0x29b
Jul 10 13:14:17 node01 kernel: [ 477.393035] [<ffffffff81040420>] ?
set_next_entity+0x34/0x56
Jul 10 13:14:17 node01 kernel: [ 477.393040] [<ffffffffa038c8e8>] ?
nthread_wakeup+0x34/0x40 [iscsi_trgt]
Jul 10 13:14:17 node01 kernel: [ 477.393044] [<ffffffffa038ee76>] ?
iet_data_ready+0x22/0x37 [iscsi_trgt]
Jul 10 13:14:17 node01 kernel: [ 477.393047] [<ffffffff81243b2f>] ?
sock_def_readable+0x10/0x62
Jul 10 13:14:17 node01 kernel: [ 477.393050] [<ffffffff812864a1>] ?
tcp_rcv_established+0x57d/0x6d9
Jul 10 13:14:17 node01 kernel: [ 477.393052] [<ffffffff8100f5e7>] ?
__switch_to+0xd0/0x297
Jul 10 13:14:17 node01 kernel: [ 477.393055] [<ffffffff8103fc7c>] ?
update_curr+0xa6/0x147
Jul 10 13:14:17 node01 kernel: [ 477.393057] [<ffffffff81040420>] ?
set_next_entity+0x34/0x56
Jul 10 13:14:17 node01 kernel: [ 477.393060] [<ffffffff81041aba>] ?
pick_next_task_fair+0xca/0xd6
Jul 10 13:14:17 node01 kernel: [ 477.393063] [<ffffffff81048242>] ?
finish_task_switch+0x3a/0xaf
Jul 10 13:14:17 node01 kernel: [ 477.393066] [<ffffffff810e4627>] ?
get_partial_node+0x15/0x85
Jul 10 13:14:17 node01 kernel: [ 477.393069] [<ffffffff810e6a04>] ?
__slab_alloc+0xd3/0x39b
Jul 10 13:14:17 node01 kernel: [ 477.393073] [<ffffffffa038ab08>] ?
cmnd_alloc+0x20/0xe4 [iscsi_trgt]
Jul 10 13:14:17 node01 kernel: [ 477.393076] [<ffffffff81016512>] ?
native_sched_clock+0x2e/0x66
Jul 10 13:14:17 node01 kernel: [ 477.393078] [<ffffffff8101654f>] ?
sched_clock+0x5/0x8
Jul 10 13:14:17 node01 kernel: [ 477.393081] [<ffffffff810493d2>] ?
update_rq_clock+0xf/0x28
Jul 10 13:14:17 node01 kernel: [ 477.393085] [<ffffffffa038cd79>] ?
istd+0x202/0x1159 [iscsi_trgt]
Jul 10 13:14:17 node01 kernel: [ 477.393088] [<ffffffff8103fc7c>] ?
update_curr+0xa6/0x147
Jul 10 13:14:17 node01 kernel: [ 477.393091] [<ffffffff8127f818>] ?
tcp_sendpage+0x0/0x45d
Jul 10 13:14:17 node01 kernel: [ 477.393094] [<ffffffff8103aa46>] ?
__wake_up_common+0x44/0x72
Jul 10 13:14:17 node01 kernel: [ 477.393098] [<ffffffffa038cb77>] ?
istd+0x0/0x1159 [iscsi_trgt]
Jul 10 13:14:17 node01 kernel: [ 477.393100] [<ffffffff81064bb9>] ?
kthread+0x79/0x81
Jul 10 13:14:17 node01 kernel: [ 477.393103] [<ffffffff81011baa>] ?
child_rip+0xa/0x20
Jul 10 13:14:17 node01 kernel: [ 477.393105] [<ffffffff81064b40>] ?
kthread+0x0/0x81
Jul 10 13:14:17 node01 kernel: [ 477.393107] [<ffffffff81011ba0>] ?
child_rip+0x0/0x20
Jul 10 13:14:17 node01 kernel: [ 477.393109] Mem-Info:
Jul 10 13:14:17 node01 kernel: [ 477.393111] Node 0 DMA per-cpu:
Jul 10 13:14:17 node01 kernel: [ 477.393113] CPU 0: hi: 0,
btch: 1 usd: 0
Jul 10 13:14:17 node01 kernel: [ 477.393114] Node 0 DMA32 per-cpu:
Jul 10 13:14:17 node01 kernel: [ 477.393116] CPU 0: hi: 186,
btch: 31 usd: 30
Jul 10 13:14:17 node01 kernel: [ 477.393121] active_anon:28148
inactive_anon:28740 isolated_anon:0
Jul 10 13:14:17 node01 kernel: [ 477.393122] active_file:14658
inactive_file:24493 isolated_file:0
Jul 10 13:14:17 node01 kernel: [ 477.393123] unevictable:4547
dirty:382 writeback:19817 unstable:0
Jul 10 13:14:17 node01 kernel: [ 477.393124] free:751
slab_reclaimable:2325 slab_unreclaimable:6997
Jul 10 13:14:17 node01 kernel: [ 477.393125] mapped:3650 shmem:152
pagetables:986 bounce:0
Jul 10 13:14:17 node01 kernel: [ 477.393126] Node 0 DMA free:1984kB
min:84kB low:104kB high:124kB active_anon:0kB inactive_anon:12kB
active_file:7884kB inactive_file:5152kB unevictable:0kB
isolated(anon):0kB isolated(file):0kB present:15312kB mlocked:0kB
dirty:8kB writeback:808kB mapped:0kB shmem:0kB slab_reclaimable:392kB
slab_unreclaimable:352kB kernel_stack:0kB pagetables:0kB unstable:0kB
bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Jul 10 13:14:17 node01 kernel: [ 477.393136] lowmem_reserve[]: 0 489 489 489
Jul 10 13:14:17 node01 kernel: [ 477.393138] Node 0 DMA32 free:1020kB
min:2784kB low:3480kB high:4176kB active_anon:112592kB
inactive_anon:114948kB active_file:50748kB inactive_file:92820kB
unevictable:18188kB isolated(anon):0kB isolated(file):0kB
present:500896kB mlocked:18188kB dirty:1520kB writeback:78460kB
mapped:14600kB shmem:608kB slab_reclaimable:8908kB
slab_unreclaimable:27636kB kernel_stack:1032kB pagetables:3944kB
unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:448
all_unreclaimable? no
Jul 10 13:14:17 node01 kernel: [ 477.393148] lowmem_reserve[]: 0 0 0 0
Jul 10 13:14:17 node01 kernel: [ 477.393150] Node 0 DMA: 2*4kB 1*8kB
1*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 1*1024kB 0*2048kB
0*4096kB = 1984kB
Jul 10 13:14:17 node01 kernel: [ 477.393156] Node 0 DMA32: 187*4kB
0*8kB 1*16kB 0*32kB 2*64kB 1*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB
0*4096kB = 1020kB
Jul 10 13:14:17 node01 kernel: [ 477.393162] 41122 total pagecache pages
Jul 10 13:14:17 node01 kernel: [ 477.393163] 39 pages in swap cache
Jul 10 13:14:17 node01 kernel: [ 477.393165] Swap cache stats: add
39, delete 0, find 36/36
Jul 10 13:14:17 node01 kernel: [ 477.393166] Free swap = 2128416kB
Jul 10 13:14:17 node01 kernel: [ 477.393167] Total swap = 2128572kB
Jul 10 13:14:17 node01 kernel: [ 477.395112] 131056 pages RAM
Jul 10 13:14:17 node01 kernel: [ 477.395114] 3844 pages reserved
Jul 10 13:14:17 node01 kernel: [ 477.395115] 51112 pages shared
Jul 10 13:14:17 node01 kernel: [ 477.395116] 90961 pages non-shared
Jul 10 13:14:17 node01 kernel: [ 477.395119] SLUB: Unable to allocate
memory on node -1 (gfp=0x20)
Jul 10 13:14:17 node01 kernel: [ 477.395121] cache: kmalloc-1024,
object size: 1024, buffer size: 1024, default order: 1, min order: 0
Jul 10 13:14:17 node01 kernel: [ 477.395124] node 0: slabs: 90,
objs: 720, free: 0
I see messages about the e1000 driver and iscsi_trgt so that's why i
sent this to these mailing lists.
Sometimes I get swapper: page allocation failure. order:0, mode:0x402
It only arises when there is heavy network IO to the system.
What is causing these freezes? Is it some kind of memory leak because
it only happens after a while and not instantly?
Is this a known problem?
I tried the following:
1) Put in more RAM, the system had 4GB RAM and I upgraded to 16GB on
both nodes. This doesn't seem to have any effect.
Ps. the log above is from a virtual (vmware) instance of the same
OS image and the same issue arises in the virtual machine (512MB
memory).
2) I reverted to the lenny stable kernel linux-image-2.6.26-2-amd64
kernel and then there are no freezes, but with this kernel the
performance is much lower
then with the 2.6.32 kernel, and the system load seem to get much
higher on heavy load. I'd like to know the cause of this and keep
using the 2.6.32 kernel.
3) I tried using a different NIC (Intel Pro/1000 PT Quad port server
adpater) using the e1000e driver and I have the same issue with that
NIC.
4) I will try using the latest stable linux intel e1000e drivers
(v1.3.17) from the intel site, i already compiled them but didn't have
enough time to get results (freezes).
5) Could it be possible that ethernet flow control could solve this
issue? Maybe the storage can't handle the IO's and now I don't have
flow control enabled on the switch. I am just guessing here.
6) Googling around I found that other users have these issues with
different NIC drivers/kernels and that it is not only on debian.
Here's a link to a ubuntu bug report which doesn't provide a solution
or cause: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/164018
If more info is needed I'm willing to provide.
Maybe other mailing lists need to be informed?
Kind regards,
Caspar Smit
------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
_______________________________________________
E1000-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel® Ethernet, visit
http://communities.intel.com/community/wired