Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e
Hi Emil, It appears as if updating to the latest BIOS fixed it (the previous BIOS was almost a couple of years old, there just hadn't been a reason to update it). The events thread is no longer stealing the cpu with any of the e1000e-1.x drivers. Thanks for the suggestion. I have attached the output of ethtool anyway. There were no specific patterns with the issue, it would start as soon I loaded the module and brought up eth0. I still haven't tried with the link connected, and if that happens to cause problems when do I get a chance to test it, I will let you know. Thanks, Aditya On 01/05/2010 05:25 PM, Tantilov, Emil S wrote: Hi Aditya, Thanks for the information. Could you please provide the full dmesg output from boot to the time you see those spikes. I am curious about the error messages you are seeing. Do you know if you are running the latest BIOS? Maybe check for updates. Do you have AMT enabled? Maybe try enabling/disabling it and see if it would make a difference. It is enabled on my system. Also if you could provide the output of ethtool -e eth0 that would be great. I will continue to play with my system and see if I can get a repro. Have you noticed any patterns - for example does it matter if the system is running on battery vs. power. Also you mentioned that you don't have cable connected - do you see the same spikes when cable is present? Thanks, Emil -Original Message- From: Aditya Rajgarhia [mailto:adity...@gmail.com] Sent: Tuesday, January 05, 2010 4:10 PM To: Tantilov, Emil S Cc: Brandeburg, Jesse; e1000-devel@lists.sourceforge.net; Allan, Bruce W Subject: Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e Emil, that is strange. I wonder if mine being a T61p has some different hardware (although the main difference is supposed to be the video card). Please check the output of lspci that I had attached if needed. Since it is proving difficult to determine what in the kernel is causing me the problem, I decided to run the problematic 2.6.30.2 kernel (same configuration that I attached earlier) with different versions of the e1000e driver (from http://sourceforge.net/projects/e1000) and here is what I found (after booting into each a few times): Kernel-2.6.30.2 with e1000e-1.1.2 (the module shipped with the kernel) causes CPU spikes of ~50% as I reported earlier. Kernel-2.6.30.2 with e1000e-1.0.15 causes CPU spikes of ~50%. Kernel-2.6.30.2 with e1000e-1.0.2.5 causes CPU spikes of ~10%. Kernel-2.6.30.2 with e1000e-0.5.18.3 uses CPU spikes of ~5% but also causes bad irq messages while booting, so I am guessing it won't work (as I said, I don't have an ethernet connection available unless necessary to test something). With the e1000e-1.x drivers I see a message :00:19.0: :00:19.0: Hardware Error while booting. The rest of the driver related messages are the same as with the kernel that works. I did use the ethernet even a few days ago with 2.6.30.4 and it worked, so I don't think the hardware is corrupted. Without the e1000e module loaded, there is no CPU spikes caused by the events thread. I am curious to know why there would be any, even negligible, CPU usage caused by an ethernet driver with no link connected. For the time being, I can use e1000e-1.0.2.5 if I need ethernet access, but given that this version may not compile with future kernels, hopefully the issue can be resolved. Perhaps the above numbers will help to locate the cause. Let me know if anything else is needed or if I should try something different Thanks, Aditya On 01/05/2010 02:05 PM, Tantilov, Emil S wrote: Aditya, I installed vanilla 2.6.32.2 kernel using the config file from your first email on my T61 laptop and at least off the bat did not see the issue you are describing. I tested on the docking station and also with just a battery. Restarted the OS couple of times, but the CPU usage is in line (9x% idle). Just an update on where we are atm with respect to this issue. Thanks, Emil -Original Message- From: Aditya Rajgarhia [mailto:adity...@gmail.com] Sent: Monday, January 04, 2010 3:49 PM To: Brandeburg, Jesse Cc: e1000-devel@lists.sourceforge.net; Allan, Bruce W Subject: Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e It's indeed a ThinkPad T61p running Arch Linux 2009.08. I just tried with 2.6.32.2 from kernel.org and the problem is still there. I used make oldconfig with the config file from my 2.6.30.4 kernel (which does not have this problem) so it doesn't seem to be some config option that the Arch Linux kernel enabled that is causing the problem. I should have mentioned that this happens when there is no link/cable present so there is no traffic at all. However, problem is not there with 2.6.30.4 under the same circumstances. I have not tested it with an ethernet connection since I only have wireless access at the moment, but could perhaps test with an ethernet connection if needed (but I guess that will not help). Also
Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e
Emil, that is strange. I wonder if mine being a T61p has some different hardware (although the main difference is supposed to be the video card). Please check the output of lspci that I had attached if needed. Since it is proving difficult to determine what in the kernel is causing me the problem, I decided to run the problematic 2.6.30.2 kernel (same configuration that I attached earlier) with different versions of the e1000e driver (from http://sourceforge.net/projects/e1000) and here is what I found (after booting into each a few times): Kernel-2.6.30.2 with e1000e-1.1.2 (the module shipped with the kernel) causes CPU spikes of ~50% as I reported earlier. Kernel-2.6.30.2 with e1000e-1.0.15 causes CPU spikes of ~50%. Kernel-2.6.30.2 with e1000e-1.0.2.5 causes CPU spikes of ~10%. Kernel-2.6.30.2 with e1000e-0.5.18.3 uses CPU spikes of ~5% but also causes bad irq messages while booting, so I am guessing it won't work (as I said, I don't have an ethernet connection available unless necessary to test something). With the e1000e-1.x drivers I see a message :00:19.0: :00:19.0: Hardware Error while booting. The rest of the driver related messages are the same as with the kernel that works. I did use the ethernet even a few days ago with 2.6.30.4 and it worked, so I don't think the hardware is corrupted. Without the e1000e module loaded, there is no CPU spikes caused by the events thread. I am curious to know why there would be any, even negligible, CPU usage caused by an ethernet driver with no link connected. For the time being, I can use e1000e-1.0.2.5 if I need ethernet access, but given that this version may not compile with future kernels, hopefully the issue can be resolved. Perhaps the above numbers will help to locate the cause. Let me know if anything else is needed or if I should try something different Thanks, Aditya On 01/05/2010 02:05 PM, Tantilov, Emil S wrote: Aditya, I installed vanilla 2.6.32.2 kernel using the config file from your first email on my T61 laptop and at least off the bat did not see the issue you are describing. I tested on the docking station and also with just a battery. Restarted the OS couple of times, but the CPU usage is in line (9x% idle). Just an update on where we are atm with respect to this issue. Thanks, Emil -Original Message- From: Aditya Rajgarhia [mailto:adity...@gmail.com] Sent: Monday, January 04, 2010 3:49 PM To: Brandeburg, Jesse Cc: e1000-devel@lists.sourceforge.net; Allan, Bruce W Subject: Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e It's indeed a ThinkPad T61p running Arch Linux 2009.08. I just tried with 2.6.32.2 from kernel.org and the problem is still there. I used make oldconfig with the config file from my 2.6.30.4 kernel (which does not have this problem) so it doesn't seem to be some config option that the Arch Linux kernel enabled that is causing the problem. I should have mentioned that this happens when there is no link/cable present so there is no traffic at all. However, problem is not there with 2.6.30.4 under the same circumstances. I have not tested it with an ethernet connection since I only have wireless access at the moment, but could perhaps test with an ethernet connection if needed (but I guess that will not help). Also, I do not see any strange output or errors in the system log upon loading or unloading the module. Thanks, Aditya On 01/04/2010 02:39 PM, Brandeburg, Jesse wrote: Also, do you have a continuous stream of receive traffic arriving? The events/0 thread freaking out is typically due to IPv6 setup, but under normal traffic conditions it only takes a few seconds to complete. -Original Message- From: Allan, Bruce W [mailto:bruce.w.al...@intel.com] Sent: Monday, January 04, 2010 12:09 PM To: Aditya Rajgarhia; e1000-devel@lists.sourceforge.net Subject: Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e Apparently this is on a ThinkPad T61, correct? What version of Arch Linux are you running? 2009.08 or other? Have you tried a kernel.org version of the 2.6.32? -Original Message- From: Aditya Rajgarhia [mailto:adity...@gmail.com] Sent: Saturday, January 02, 2010 1:09 PM To: e1000-devel@lists.sourceforge.net Subject: [E1000-devel] Problem with Linux 2.6.32 and e1000e Hi, I upgraded from 2.6.30.4 to 2.6.32.2 and am experiencing problems with e1000e. The thread events/0 uses half the CPU (or all of one core I guess) and the keyboard doesn't work properly (only some of the keys get typed). I have isolated this problem to e1000e by disabling each hardware independently (the issue appears if I load the e1000e module and the eth0 interface, and disappears if I do the opposite). I used to compile my own kernel but this kernel which is giving problems is the stock kernel from Arch Linux with tuxice, so I'm not sure if there is something enabled in this kernel which
Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e
It's indeed a ThinkPad T61p running Arch Linux 2009.08. I just tried with 2.6.32.2 from kernel.org and the problem is still there. I used make oldconfig with the config file from my 2.6.30.4 kernel (which does not have this problem) so it doesn't seem to be some config option that the Arch Linux kernel enabled that is causing the problem. I should have mentioned that this happens when there is no link/cable present so there is no traffic at all. However, problem is not there with 2.6.30.4 under the same circumstances. I have not tested it with an ethernet connection since I only have wireless access at the moment, but could perhaps test with an ethernet connection if needed (but I guess that will not help). Also, I do not see any strange output or errors in the system log upon loading or unloading the module. Thanks, Aditya On 01/04/2010 02:39 PM, Brandeburg, Jesse wrote: Also, do you have a continuous stream of receive traffic arriving? The events/0 thread freaking out is typically due to IPv6 setup, but under normal traffic conditions it only takes a few seconds to complete. -Original Message- From: Allan, Bruce W [mailto:bruce.w.al...@intel.com] Sent: Monday, January 04, 2010 12:09 PM To: Aditya Rajgarhia; e1000-devel@lists.sourceforge.net Subject: Re: [E1000-devel] Problem with Linux 2.6.32 and e1000e Apparently this is on a ThinkPad T61, correct? What version of Arch Linux are you running? 2009.08 or other? Have you tried a kernel.org version of the 2.6.32? -Original Message- From: Aditya Rajgarhia [mailto:adity...@gmail.com] Sent: Saturday, January 02, 2010 1:09 PM To: e1000-devel@lists.sourceforge.net Subject: [E1000-devel] Problem with Linux 2.6.32 and e1000e Hi, I upgraded from 2.6.30.4 to 2.6.32.2 and am experiencing problems with e1000e. The thread events/0 uses half the CPU (or all of one core I guess) and the keyboard doesn't work properly (only some of the keys get typed). I have isolated this problem to e1000e by disabling each hardware independently (the issue appears if I load the e1000e module and the eth0 interface, and disappears if I do the opposite). I used to compile my own kernel but this kernel which is giving problems is the stock kernel from Arch Linux with tuxice, so I'm not sure if there is something enabled in this kernel which causes the issue. Anyway, it would be best if we can resolve it. Attached is the .config, and the outputs of lspci -vvv and /proc/interrupts. Please let me know if you need any other information. Thanks, Aditya -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel
Re: [E1000-devel] e1000e not working on 2.6.29.1
Jesse, Before I got your reply, I had recompiled my kernel a couple of times in the hope of getting wireless to work at least. However, I also removed some other stuff from the kernel that I didn't need, and now e1000e works and I don't see anything unusual in dmesg. Unfortunately, I can't pinpoint which feature's addition/removal caused it to work. I mostly changed features for getting the iwl3945 driver to run. Looking at the kernel config again, the only thing I can think of that I may have removed is CONFIG_X86_REROUTE_FOR_BROKEN_BOOT_IRQS but again, there may have been something else. Apologies for the confusion, and perhaps I'll try again with the above option and see if it reproduces the problem. Thanks, Aditya Brandeburg, Jesse wrote: Aditya Rajgarhia wrote: Hi, I updated to kernel 2.6.29.1 (from 2.6.22) yesterday and my Intel 82566MM fails to work now (eth0 is not recognized). First, I noticed that the new kernel uses e1000e for this controller rather than e1000, but I get the following in dmesg: e1000e: Intel(R) PRO/1000 Network Driver - 0.3.3.3-k6 e1000e: Copyright (c) 1999-2008 Intel Corporation. e1000e :00:19.0: PCI INT A - GSI 20 (level, low) - IRQ 20 e1000e :00:19.0: setting latency timer to 64 :00:19.0: :00:19.0: Failed to initialize MSI interrupts. Falling back to legacy interrupts. :00:19.0: :00:19.0: Invalid MAC Address: 00:00:00:00:00:00 e1000e :00:19.0: PCI INT A disabled e1000e: probe of :00:19.0 failed with error -5 please send your .config as config.txt and full output of lspci -vvv as lspci.txt when the driver has failed to load. I searched for this issue, and came up with similar problems with e1000e but they involved checksum errors etc. I couldn't find any solution for this seems like a new issue. the above. I tried with CONFIG_PCI_MSI enabled but then I get the following messages and eth0 is still not recognized. e1000e: Unknown symbol pci_disable_msi e1000e: Unknown symbol pci_enable_msi e1000e: Unknown symbol pci_enable_msix e1000e: Unknown symbol pci_disable_msi that's just a compile error, you have to rebuild both kernel and driver with make -j2 all; make modules_install install snip Note that I can boot into the old kernel (2.6.22) and the ethernet works fine as before. Please let me know if there is a fix for this, or if you need more information. in the old kernel please do ethtool -e eth0 and get the dump of the eeprom and send it to us as ethtool.txt -- Stay on top of everything new and different, both inside and around Java (TM) technology - register by April 22, and save $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. 300 plus technical and hands-on sessions. Register today. Use priority code J9JMT32. http://p.sf.net/sfu/p ___ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel
[E1000-devel] e1000e not working on 2.6.29.1
Hi, I updated to kernel 2.6.29.1 (from 2.6.22) yesterday and my Intel 82566MM fails to work now (eth0 is not recognized). First, I noticed that the new kernel uses e1000e for this controller rather than e1000, but I get the following in dmesg: e1000e: Intel(R) PRO/1000 Network Driver - 0.3.3.3-k6 e1000e: Copyright (c) 1999-2008 Intel Corporation. e1000e :00:19.0: PCI INT A - GSI 20 (level, low) - IRQ 20 e1000e :00:19.0: setting latency timer to 64 :00:19.0: :00:19.0: Failed to initialize MSI interrupts. Falling back to legacy interrupts. :00:19.0: :00:19.0: Invalid MAC Address: 00:00:00:00:00:00 e1000e :00:19.0: PCI INT A disabled e1000e: probe of :00:19.0 failed with error -5 I searched for this issue, and came up with similar problems with e1000e but they involved checksum errors etc. I couldn't find any solution for the above. I tried with CONFIG_PCI_MSI enabled but then I get the following messages and eth0 is still not recognized. e1000e: Unknown symbol pci_disable_msi e1000e: Unknown symbol pci_enable_msi e1000e: Unknown symbol pci_enable_msix e1000e: Unknown symbol pci_disable_msi I'm using a Thinkpad T61p, and following is the ouput from lspci: 00:00.0 Host bridge: Intel Corporation Mobile Memory Controller Hub (rev 0c) 00:01.0 PCI bridge: Intel Corporation Mobile PCI Express Root Port (rev 0c) 00:19.0 Ethernet controller: Intel Corporation 82566MM Gigabit Network Connection (rev 03) 00:1a.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Contoller #4 (rev 03) 00:1a.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #5 (rev 03) 00:1a.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #2 (rev 03) 00:1b.0 Audio device: Intel Corporation 82801H (ICH8 Family) HD Audio Controller (rev 03) 00:1c.0 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 1 (rev 03) 00:1c.1 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 2 (rev 03) 00:1c.2 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 3 (rev 03) 00:1c.3 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 4 (rev 03) 00:1c.4 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 5 (rev 03) 00:1d.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #1 (rev 03) 00:1d.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #2 (rev 03) 00:1d.2 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #3 (rev 03) 00:1d.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #1 (rev 03) 00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev f3) 00:1f.0 ISA bridge: Intel Corporation Mobile LPC Interface Controller (rev 03) 00:1f.1 IDE interface: Intel Corporation Mobile IDE Controller (rev 03) 00:1f.2 SATA controller: Intel Corporation Mobile SATA AHCI Controller (rev 03) 00:1f.3 SMBus: Intel Corporation 82801H (ICH8 Family) SMBus Controller (rev 03) 01:00.0 VGA compatible controller: nVidia Corporation Unknown device 040c (rev a1) 03:00.0 Network controller: Intel Corporation PRO/Wireless 3945ABG Network Connection (rev 02) 05:00.0 Mass storage controller: Silicon Image, Inc. SiI 3132 Serial ATA Raid II Controller (rev 01) 15:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev b6) Note that I can boot into the old kernel (2.6.22) and the ethernet works fine as before. Please let me know if there is a fix for this, or if you need more information. Thanks, Aditya -- Stay on top of everything new and different, both inside and around Java (TM) technology - register by April 22, and save $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. 300 plus technical and hands-on sessions. Register today. Use priority code J9JMT32. http://p.sf.net/sfu/p ___ E1000-devel mailing list E1000-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/e1000-devel