trigon3 driver memory leak -> crash
hi all it's my first post in this list, so please point me to another source of inspiration if needed. i'm using a dell poweredge 2650 with four nic (two of them tg3). this server works as transparent bridge. (bridge code and netfilter). currently i am running linux 2.6.11.6. i observe a steady decrease of free memory and at the same time an increase of active memory /proc/meminfo. i stopped almost all services and unloaded most of the iptables-modules. and still there was this phenomenon. after a while the server crashes with help of netconsole i was able to fetch the latest messages (postet below). there was no unusual traffic or something wired in my logfiles. btw, the decrease doesn't not reache the bottom, i.e., there was still some memory free befor crash. the server was running perfectly with kernel 2.6.6. is there an issue with trigon3 network cards? regards moritz here the messages capturated by netcat from netconsole: NETDEV WATCHDOG: eth2: transmit timed out tg3: eth2: transmit timed out, resetting tg3: tg3_stop_block timed out, ofs=2c00 enable_bit=2 tg3: tg3_stop_block timed out, ofs=3400 enable_bit=2 tg3: tg3_stop_block timed out, ofs=2400 enable_bit=2 tg3: tg3_stop_block timed out, ofs=1800 enable_bit=2 tg3: tg3_stop_block timed out, ofs=4800 enable_bit=2 tg3: eth2: Link is down. br0: port 2(eth2) entering disabled state tg3: eth2: Link is up at 100 Mbps, half duplex. tg3: eth2: Flow control is off for TX and off for RX. br0: port 2(eth2) entering learning state br0: topology change detected, propagating br0: port 2(eth2) entering forwarding state Ebtables v2.0 registered HTB: quantum of class 10033 is small. Consider r2q change. oom-killer: gfp_mask=0xd0 DMA per-cpu: cpu 0 hot: low 2, high 6, batch 1 cpu 0 cold: low 0, high 2, batch 1 cpu 1 hot: low 2, high 6, batch 1 cpu 1 cold: low 0, high 2, batch 1 cpu 2 hot: low 2, high 6, batch 1 cpu 2 cold: low 0, high 2, batch 1 cpu 3 hot: low 2, high 6, batch 1 cpu 3 cold: low 0, high 2, batch 1 Normal per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 HighMem per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 Free pages: 433920kB (420416kB HighMem) Active:186580 inactive:1496 dirty:38836 writeback:0 unstable:0 free:108480 slab:218645 mapped:4893 pagetables:131 DMA free:3696kB min:176kB low:220kB high:264kB active:0kB inactive:32kB present:16384kB pages_scanned:399 all_unreclaimable? yes lowmem_reserve[]: 0 880 2031 Normal free:9808kB min:9820kB low:12272kB high:14728kB active:4kB inactive:108kB present:901120kB pages_scanned:14744 all_unreclaimable? yes lowmem_reserve[]: 0 0 9215 HighMem free:420416kB min:512kB low:640kB high:768kB active:746316kB inactive:5844kB present:1179520kB pages_scanned:0 all_unreclaimable? no lowmem_reserve[]: 0 0 0 DMA: 0*4kB 0*8kB 1*16kB 1*32kB 1*64kB 0*128kB 0*256kB 1*512kB 1*1024kB 1*2048kB 0*4096kB = 3696kB Normal: 0*4kB 0*8kB 1*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 1*1024kB 0*2048kB 2*4096kB = 9808kB HighMem: 0*4kB 18*8kB 63*16kB 6*32kB 4*64kB 2*128kB 1*256kB 1*512kB 2*1024kB 21*2048kB 91*4096kB = 420416kB Swap cache: add 0, delete 0, find 0/0, race 0+0 Free swap = 979924kB Total swap = 979924kB Out of Memory: Killed process 13539 (apache). oom-killer: gfp_mask=0xd0 DMA per-cpu: cpu 0 hot: low 2, high 6, batch 1 cpu 0 cold: low 0, high 2, batch 1 cpu 1 hot: low 2, high 6, batch 1 cpu 1 cold: low 0, high 2, batch 1 cpu 2 hot: low 2, high 6, batch 1 cpu 2 cold: low 0, high 2, batch 1 cpu 3 hot: low 2, high 6, batch 1 cpu 3 cold: low 0, high 2, batch 1 Normal per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 HighMem per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 Free pages: 433920kB (420480kB HighMem) Active:186531 inactive:1495 dirty:38835 writeback:0 unstable:0 free:108480 slab:218665 mapped:4837 pagetables:125 DMA free:3696kB min:176kB low:220kB high:264kB active:0kB inactive:24kB present:16384kB pages_scanned:2265 all_unreclaimable? yes lowmem_reserve[]: 0 880 2031 Normal
trigon3 driver memory leak - crash
hi all it's my first post in this list, so please point me to another source of inspiration if needed. i'm using a dell poweredge 2650 with four nic (two of them tg3). this server works as transparent bridge. (bridge code and netfilter). currently i am running linux 2.6.11.6. i observe a steady decrease of free memory and at the same time an increase of active memory /proc/meminfo. i stopped almost all services and unloaded most of the iptables-modules. and still there was this phenomenon. after a while the server crashes with help of netconsole i was able to fetch the latest messages (postet below). there was no unusual traffic or something wired in my logfiles. btw, the decrease doesn't not reache the bottom, i.e., there was still some memory free befor crash. the server was running perfectly with kernel 2.6.6. is there an issue with trigon3 network cards? regards moritz here the messages capturated by netcat from netconsole: NETDEV WATCHDOG: eth2: transmit timed out tg3: eth2: transmit timed out, resetting tg3: tg3_stop_block timed out, ofs=2c00 enable_bit=2 tg3: tg3_stop_block timed out, ofs=3400 enable_bit=2 tg3: tg3_stop_block timed out, ofs=2400 enable_bit=2 tg3: tg3_stop_block timed out, ofs=1800 enable_bit=2 tg3: tg3_stop_block timed out, ofs=4800 enable_bit=2 tg3: eth2: Link is down. br0: port 2(eth2) entering disabled state tg3: eth2: Link is up at 100 Mbps, half duplex. tg3: eth2: Flow control is off for TX and off for RX. br0: port 2(eth2) entering learning state br0: topology change detected, propagating br0: port 2(eth2) entering forwarding state Ebtables v2.0 registered HTB: quantum of class 10033 is small. Consider r2q change. oom-killer: gfp_mask=0xd0 DMA per-cpu: cpu 0 hot: low 2, high 6, batch 1 cpu 0 cold: low 0, high 2, batch 1 cpu 1 hot: low 2, high 6, batch 1 cpu 1 cold: low 0, high 2, batch 1 cpu 2 hot: low 2, high 6, batch 1 cpu 2 cold: low 0, high 2, batch 1 cpu 3 hot: low 2, high 6, batch 1 cpu 3 cold: low 0, high 2, batch 1 Normal per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 HighMem per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 Free pages: 433920kB (420416kB HighMem) Active:186580 inactive:1496 dirty:38836 writeback:0 unstable:0 free:108480 slab:218645 mapped:4893 pagetables:131 DMA free:3696kB min:176kB low:220kB high:264kB active:0kB inactive:32kB present:16384kB pages_scanned:399 all_unreclaimable? yes lowmem_reserve[]: 0 880 2031 Normal free:9808kB min:9820kB low:12272kB high:14728kB active:4kB inactive:108kB present:901120kB pages_scanned:14744 all_unreclaimable? yes lowmem_reserve[]: 0 0 9215 HighMem free:420416kB min:512kB low:640kB high:768kB active:746316kB inactive:5844kB present:1179520kB pages_scanned:0 all_unreclaimable? no lowmem_reserve[]: 0 0 0 DMA: 0*4kB 0*8kB 1*16kB 1*32kB 1*64kB 0*128kB 0*256kB 1*512kB 1*1024kB 1*2048kB 0*4096kB = 3696kB Normal: 0*4kB 0*8kB 1*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 1*1024kB 0*2048kB 2*4096kB = 9808kB HighMem: 0*4kB 18*8kB 63*16kB 6*32kB 4*64kB 2*128kB 1*256kB 1*512kB 2*1024kB 21*2048kB 91*4096kB = 420416kB Swap cache: add 0, delete 0, find 0/0, race 0+0 Free swap = 979924kB Total swap = 979924kB Out of Memory: Killed process 13539 (apache). oom-killer: gfp_mask=0xd0 DMA per-cpu: cpu 0 hot: low 2, high 6, batch 1 cpu 0 cold: low 0, high 2, batch 1 cpu 1 hot: low 2, high 6, batch 1 cpu 1 cold: low 0, high 2, batch 1 cpu 2 hot: low 2, high 6, batch 1 cpu 2 cold: low 0, high 2, batch 1 cpu 3 hot: low 2, high 6, batch 1 cpu 3 cold: low 0, high 2, batch 1 Normal per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 HighMem per-cpu: cpu 0 hot: low 32, high 96, batch 16 cpu 0 cold: low 0, high 32, batch 16 cpu 1 hot: low 32, high 96, batch 16 cpu 1 cold: low 0, high 32, batch 16 cpu 2 hot: low 32, high 96, batch 16 cpu 2 cold: low 0, high 32, batch 16 cpu 3 hot: low 32, high 96, batch 16 cpu 3 cold: low 0, high 32, batch 16 Free pages: 433920kB (420480kB HighMem) Active:186531 inactive:1495 dirty:38835 writeback:0 unstable:0 free:108480 slab:218665 mapped:4837 pagetables:125 DMA free:3696kB min:176kB low:220kB high:264kB active:0kB inactive:24kB present:16384kB pages_scanned:2265 all_unreclaimable? yes lowmem_reserve[]: 0 880 2031 Normal