Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-15 Thread Peter Stuge
Larry Finger wrote: merely triggered by some interaction with ACPI and/or the BIOS. From what I found in looking back through the DMA error reports, most (if not all) people with the problem have netbook computers with Intel ATOM processors. Gábor Stefanik wrote: Linus has also reported this

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-14 Thread Rafał Miłecki
2009/11/14 Matthew Garrett mj...@srcf.ucam.org: On Sat, Nov 14, 2009 at 12:41:49AM +, Andrew Benton wrote: And it seems to be working well. No errors so far. I've just downloaded a kernel, browsed slashdot a bit. I'll test it some more tomorrow but this is a BIG step in the right

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-14 Thread Andrew Benton
On 14/11/09 07:29, Matthew Garrett wrote: I should emphasise that this patch works by effectively disabling deep C states on your CPU, which in turn will increase your power consumption. It's very much either a workaround for broken hardware or something that covers up a more subtle bug

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-14 Thread Chris Vine
On Sat, 14 Nov 2009 09:52:15 + Andrew Benton b3n...@gmail.com wrote: It was working fine last night but I can't get that kernel to connect today. Nothing has changed, I can see no reason why it was working and isn't working now. It feels like a hardware problem. I'm recompiling with a

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-14 Thread Andrew Benton
On 14/11/09 11:24, Chris Vine wrote: Be aware that if you have been using the proprietary wl driver to send on your bug reports, you must do a cold boot before testing b43, as if you warm boot after having initialised the wireless device with the wl driver then the DMA bug disappears. That

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-14 Thread William Bourque
Larry Finger wrote: On 11/13/2009 06:15 PM, William Bourque wrote: Larry Finger wrote: Based on a suggestion by Matthew Garrett, please try the patch below. Thanks, Larry = Index: wireless-testing/drivers/net/wireless/b43/main.c

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-14 Thread Larry Finger
On 11/14/2009 12:51 PM, William Bourque wrote: Ok, I tried with value of 150, 100 already and I'm recompiling to see the result with 175 and 125. 150us seems to give me the best result. As with 200 I could use the wireless for several minutes and insert/remove the module. However it

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Michael Buesch
On Friday 13 November 2009 01:02:44 Larry Finger wrote: On 11/12/2009 05:57 PM, Michael Buesch wrote: On Friday 13 November 2009 00:23:59 Larry Finger wrote: No, then was a 14e4:4311. I have now installed that same card and it seems to be working without the workaround. When I had that

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Andrew Benton
-phy0: Loading firmware version 410.2160 (2007-05-26 15:32:10) Nov 13 12:14:14 doughnut kernel: b43-phy0: Controller restarted Nov 13 12:14:14 doughnut kernel: b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x Nov 13 12:14:14 doughnut kernel: b43

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Andrew Benton
Nov 13 13:21:28 doughnut kernel: b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x Nov 13 13:21:28 doughnut kernel: b43-phy0: Controller RESET (DMA error) ... Nov 13 13:21:28 doughnut kernel: b43-phy0: Loading firmware version 410.2160 (2007

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Larry Finger
On 11/13/2009 05:16 AM, Michael Buesch wrote: Ok, so my guess is that the DMA allocator simply returned high memory that was unusable to the device. My new code explicitly checks for that (and a few other things) and retries with GFP_DMA in case the address has illegal bits set. That's the

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Michael Buesch
On Friday 13 November 2009 17:05:30 Larry Finger wrote: (3) When a DMA descriptor error occurs, a dump of the descriptors does not reveal any obvious problems. I was going to write a patch that dumps the whole affected ring. But I think we don't see something suspicious there, either. So I

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Gábor Stefanik
On Fri, Nov 13, 2009 at 5:05 PM, Larry Finger larry.fin...@lwfinger.net wrote: On 11/13/2009 05:16 AM, Michael Buesch wrote: Ok, so my guess is that the DMA allocator simply returned high memory that was unusable to the device. My new code explicitly checks for that (and a few other things)

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Michael Buesch
Please test the following patch. It changes more stuff related to the descriptor ring handling (remove the old patch first before applying this one). http://bu3sch.de/patches/wireless-testing/20091113-1834/patches/001-b43-rewrite-dma-ring-alloc.patch -- Greetings, Michael.

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Andrew Benton
On 13/11/09 17:36, Michael Buesch wrote: Please test the following patch. It changes more stuff related to the descriptor ring handling (remove the old patch first before applying this one). http://bu3sch.de/patches/wireless-testing/20091113-1834/patches/001-b43-rewrite-dma-ring-alloc.patch

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread William Bourque
[ 134.520799] b43-phy0 debug: QoS enabled [ 134.563410] b43-phy0 debug: Wireless interface started [ 134.563424] b43-phy0: Controller restarted [ 134.583034] b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x #2 : [7.834359] b43-phy0: Broadcom

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Larry Finger
On 11/13/2009 11:36 AM, Michael Buesch wrote: Please test the following patch. It changes more stuff related to the descriptor ring handling (remove the old patch first before applying this one).

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Larry Finger
Based on a suggestion by Matthew Garrett, please try the patch below. Thanks, Larry = Index: wireless-testing/drivers/net/wireless/b43/main.c === --- wireless-testing.orig/drivers/net/wireless/b43/main.c +++

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread William Bourque
Larry Finger wrote: Based on a suggestion by Matthew Garrett, please try the patch below. Thanks, Larry = Index: wireless-testing/drivers/net/wireless/b43/main.c === ---

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Andrew Benton
On 13/11/09 21:38, Larry Finger wrote: Based on a suggestion by Matthew Garrett, please try the patch below. I've only been using it for a few minutes but this looks very good. I compile the kernel with lots of ACPI CONFIG_ACPI=y CONFIG_ACPI_SYSFS_POWER=y CONFIG_ACPI_AC=y CONFIG_ACPI_BATTERY=y

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Larry Finger
On 11/13/2009 06:15 PM, William Bourque wrote: Larry Finger wrote: Based on a suggestion by Matthew Garrett, please try the patch below. Thanks, Larry = Index: wireless-testing/drivers/net/wireless/b43/main.c

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-13 Thread Matthew Garrett
On Sat, Nov 14, 2009 at 12:41:49AM +, Andrew Benton wrote: And it seems to be working well. No errors so far. I've just downloaded a kernel, browsed slashdot a bit. I'll test it some more tomorrow but this is a BIG step in the right direction. This is the first kernel that's worked for

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 12:42:47 Andrew Benton wrote: On 11/11/09 19:12, Larry Finger wrote: Such an error in SLUB handling could be arising from a DMA problem in b43, but it could also arise from a memory error. Please run memtest86+ for an extended period so that a hardware

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Andrew Benton
On 11/11/09 19:12, Larry Finger wrote: Such an error in SLUB handling could be arising from a DMA problem in b43, but it could also arise from a memory error. Please run memtest86+ for an extended period so that a hardware error can be ruled out. A 24 hour run would be good. If that is not

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Andrew Benton
On 12/11/09 00:37, Larry Finger wrote: Andy, Please try the patch below to see what we can learn from the DMA descriptor errors. Some of this code is temporary, but there are also some statements that will probably become permanent. Please post any messages that result. The patch failed

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
From: Larry Finger larry.fin...@lwfinger.net User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.4pre) Gecko/20090915 SUSE/3.0b4-3.6 Thunderbird/3.0b4 MIME-Version: 1.0 To: Andrew Benton b3n...@gmail.com Subject: Re: b43-phy0 ERROR: Fatal DMA error: 0x0400 References

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 06:16 AM, Andrew Benton wrote: On 12/11/09 00:37, Larry Finger wrote: Andy, Please try the patch below to see what we can learn from the DMA descriptor errors. Some of this code is temporary, but there are also some statements that will probably become permanent. Please

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Andrew Benton
On 12/11/09 15:50, Larry Finger wrote: Sorry about the kernel mismatch. I developed that patch while offline and waiting at an auto repair place and forgot to refresh my sources before sending it. The revised version that Michael sent should work. If not, please let me know and I will send

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 17:42:07 Andrew Benton wrote: On 12/11/09 15:50, Larry Finger wrote: Sorry about the kernel mismatch. I developed that patch while offline and waiting at an auto repair place and forgot to refresh my sources before sending it. The revised version that Michael

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 10:53 AM, Michael Buesch wrote: On Thursday 12 November 2009 17:42:07 Andrew Benton wrote: On 12/11/09 15:50, Larry Finger wrote: Sorry about the kernel mismatch. I developed that patch while offline and waiting at an auto repair place and forgot to refresh my sources before

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Andrew Benton
(2007-05-26 15:32:10) Nov 12 18:40:49 doughnut kernel: b43-phy0: Controller restarted Nov 12 18:40:49 doughnut kernel: b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x Nov 12 18:40:49 doughnut kernel: b43-phy0: Controller RESET (DMA error

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 19:45:45 Andrew Benton wrote: On 12/11/09 17:14, Larry Finger wrote: I guess I'm a failure at writing diagnostic patches. Until there is a DMA error, the only effect of the patch is to add a little extra time to the routine that fills in the descriptor

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 12:45 PM, Andrew Benton wrote: On 12/11/09 17:14, Larry Finger wrote: I guess I'm a failure at writing diagnostic patches. Until there is a DMA error, the only effect of the patch is to add a little extra time to the routine that fills in the descriptor structure, and it adds

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread William Bourque
Michael Buesch wrote: On Thursday 12 November 2009 19:45:45 Andrew Benton wrote: On 12/11/09 17:14, Larry Finger wrote: I guess I'm a failure at writing diagnostic patches. Until there is a DMA error, the only effect of the patch is to add a little extra time to the routine that fills in

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 20:33:54 Larry Finger wrote: Nov 12 18:40:43 doughnut kernel: b43: Descr. 0: 0x0 0x930 0x364BD020 0x8000 Nov 12 18:40:43 doughnut kernel: b43: Descr. 1: 0x0 0x930 0x364BF020 0x8000 Nov 12 18:40:43 doughnut kernel: b43: Descr. 2: 0x0 0x930

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 20:32:32 William Bourque wrote: Sorry for the late reply... I seem to have the exact same bug here. Do you need more people to run the diagnostic patch? Well, it doesn't hurt. -- Greetings, Michael. ___ Bcm43xx-dev

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 01:48 PM, Michael Buesch wrote: Now we have some progress. You will note the difference in the control words (first 2 columns) for descriptors 8 9. They are wrong. What do you think is wrong here? I think the control words are OK. At the point where I captured them, I didn't

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 21:10:59 Larry Finger wrote: Do the address_low values for 8 and 9 look right? They should be aligned on a 4K boundary. Is this really a requirement? I think the 4k alignment is only required for the descriptor memory. We never guaranteed any alignment for the skbs.

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 20:33:54 Larry Finger wrote: While Michael is coming up with a test patch, Here you go: http://bu3sch.de/patches/wireless-testing/20091112-2213/patches/001-b43-rewrite-dma-ring-alloc.patch Please test this patch (also on 64bit-DMA devices that currently work). It

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread William Bourque
] b43-phy0 debug: 64-bit DMA initialized [ 109.213169] b43-phy0 debug: QoS enabled [ 109.253562] b43-phy0 debug: Wireless interface started [ 109.272964] b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x [ 109.272990] b43-phy0: Controller

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 22:34:00 William Bourque wrote: Michael Buesch wrote: On Thursday 12 November 2009 20:32:32 William Bourque wrote: Sorry for the late reply... I seem to have the exact same bug here. Do you need more people to run the diagnostic patch? Well, it doesn't

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread William Bourque
debug: 64-bit DMA initialized [ 181.210719] b43-phy0 debug: QoS enabled [ 181.251102] b43-phy0 debug: Wireless interface started [ 181.251113] b43-phy0 debug: Adding Interface type 2 [ 181.270285] b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Thursday 12 November 2009 22:59:26 William Bourque wrote: Michael Buesch wrote: On Thursday 12 November 2009 22:34:00 William Bourque wrote: Michael Buesch wrote: On Thursday 12 November 2009 20:32:32 William Bourque wrote: Sorry for the late reply... I seem to have the exact same bug

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 03:16 PM, Michael Buesch wrote: On Thursday 12 November 2009 20:33:54 Larry Finger wrote: While Michael is coming up with a test patch, Here you go: http://bu3sch.de/patches/wireless-testing/20091112-2213/patches/001-b43-rewrite-dma-ring-alloc.patch Please test this patch

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Friday 13 November 2009 00:04:50 Larry Finger wrote: On 11/12/2009 03:16 PM, Michael Buesch wrote: On Thursday 12 November 2009 20:33:54 Larry Finger wrote: While Michael is coming up with a test patch, Here you go:

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 05:08 PM, Michael Buesch wrote: On Friday 13 November 2009 00:04:50 Larry Finger wrote: On 11/12/2009 03:16 PM, Michael Buesch wrote: On Thursday 12 November 2009 20:33:54 Larry Finger wrote: While Michael is coming up with a test patch, Here you go:

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Michael Buesch
On Friday 13 November 2009 00:23:59 Larry Finger wrote: No, then was a 14e4:4311. I have now installed that same card and it seems to be working without the workaround. When I had that problem, I had a different laptop than I do now, thus it is not possible to reduplicate the setup. I am

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-12 Thread Larry Finger
On 11/12/2009 05:57 PM, Michael Buesch wrote: On Friday 13 November 2009 00:23:59 Larry Finger wrote: No, then was a 14e4:4311. I have now installed that same card and it seems to be working without the workaround. When I had that problem, I had a different laptop than I do now, thus it is

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-11 Thread Andrew Benton
On 11/11/09 02:58, Larry Finger wrote: On 11/10/2009 06:48 PM, Andrew Benton wrote: Nov 11 00:04:37 doughnut kernel: b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x Nov 11 00:04:37 doughnut kernel: b43-phy0: Controller RESET (DMA error

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-11 Thread Andrew Benton
I recompiled my kernel today (current wireless-testing) and disabled ACPI entirely. It worked fine for 10 minutes and then the internet connection died and left this in /var/log/sys.log Nov 11 14:31:31 doughnut ntpd[398]: kernel time sync status change 2001 Nov 11 14:36:57 doughnut ntpd[398]:

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-11 Thread Larry Finger
On 11/11/2009 08:52 AM, Andrew Benton wrote: I recompiled my kernel today (current wireless-testing) and disabled ACPI entirely. It worked fine for 10 minutes and then the internet connection died and left this in /var/log/sys.log Nov 11 14:31:31 doughnut ntpd[398]: kernel time sync

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-11 Thread Michael Buesch
On Wednesday 11 November 2009 20:12:59 Larry Finger wrote: On 11/11/2009 08:52 AM, Andrew Benton wrote: I recompiled my kernel today (current wireless-testing) and disabled ACPI entirely. It worked fine for 10 minutes and then the internet connection died and left this in

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-11 Thread Larry Finger
Andy, Please try the patch below to see what we can learn from the DMA descriptor errors. Some of this code is temporary, but there are also some statements that will probably become permanent. Please post any messages that result. Larry Index: wireless-testing/drivers/net/wireless/b43/dma.c

b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-10 Thread Andrew Benton
: Controller RESET (DMA error) ... Nov 11 00:04:31 doughnut kernel: b43-phy0: Loading firmware version 410.2160 (2007-05-26 15:32:10) Nov 11 00:04:37 doughnut kernel: b43-phy0: Controller restarted Nov 11 00:04:37 doughnut kernel: b43-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x

Re: b43-phy0 ERROR: Fatal DMA error: 0x00000400

2009-11-10 Thread Larry Finger
-phy0 ERROR: Fatal DMA error: 0x0400, 0x, 0x, 0x, 0x, 0x Nov 11 00:04:37 doughnut kernel: b43-phy0: Controller RESET (DMA error) ... The 0x0800 indicates a descriptor problem. Why it should happen after 15 minutes is perplexing. If I write a diagnostic