Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-03-22 Thread Dan Halbert
I'd like to mention what might be a new twist on this problem. We are seeing the same kind of 4k-block data corruption on multiple Tyan dual-Opteron boards (S3870) with a ServerWorks chipset, not Nvidia. I wonder if it really an Nvidia-specific issue. The Nvidia boards are a lot more popular, s

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-03-22 Thread Christoph Anton Mitterer
Hi folks. 1) Are there any new developments in this issue? Does someone know if AMD and Nvidia is still investigating? 2) Steve Langasek from Debian sent me a patch that disables the hw-iommu per default on Nvidia boards. I've attached it in the kernel bugzilla and asked for inclusion in the kern

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-03-12 Thread Jeff Garzik
Andi Kleen wrote: in Linux. Apparently in some cases sata_nv does DMA on an already freed and then reused mapping. Any data or additional info on that? Did you discover this by tracking the DMA API software routines, or something lower level (like a bus analyzer)? libata handles all the DM

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-03-12 Thread Andi Kleen
> Andi, have you had a look at this? I'm a bit surprised at the lack of > reaction to this find.. FYI the problem is still being analysed behind the scenes. Chip's patch didn't fix it in all cases unfortunately -- it just changed the timing enough to make it happen less often. The latest evi

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-03-04 Thread Robert Hancock
Chip Coldwell wrote: On Wed, 17 Jan 2007, Andi Kleen wrote: On Wednesday 17 January 2007 07:31, Chris Wedgwood wrote: On Tue, Jan 16, 2007 at 08:52:32PM +0100, Christoph Anton Mitterer wrote: I agree,... it seems drastic, but this is the only really secure solution. I'd like to here from And

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-02-21 Thread Chip Coldwell
On Wed, 17 Jan 2007, Andi Kleen wrote: > On Wednesday 17 January 2007 07:31, Chris Wedgwood wrote: > > On Tue, Jan 16, 2007 at 08:52:32PM +0100, Christoph Anton Mitterer wrote: > > > I agree,... it seems drastic, but this is the only really secure > > > solution. > > > > I'd like to here from Andi

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Andi Kleen
On Thursday 18 January 2007 22:00, Erik Andersen wrote: > I just tried again and while using iommu=soft does avoid the > corruption problem, as with previous kernels with 2.6.20-rc5 > using iommu=soft still makes my pcHDTV HD5500 DVB cards not work. This must be some separate bug and needs to be

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Andi Kleen
On Friday 19 January 2007 08:57, Chip Coldwell wrote: > But it still might be a reasonable thing to do to test the theory that > the problem is cache coherency across the graphics aperture, even if > it isn't a long-term solution for the problem. I suspect it would disturb timing so badly that it

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Chip Coldwell
On Thu, 18 Jan 2007, Andi Kleen wrote: The Northbridge guarantees coherency over the aperture, but only if the caching attributes match. That's interesting. Makes sense, I suppose. You would need to change_page_attr() every kernel address that is mapped into the IOMMU to use an uncached ap

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Chris Wedgwood
On Thu, Jan 18, 2007 at 10:29:14AM +0100, joachim wrote: > Not only has it only been on Nvidia chipsets but we have only seen > reports on the Nvidia CK804 SATA controller. People have reported problems with other controllers. I have one here I can test given a day or so. I don't think it's SAT

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Chris Wedgwood
On Thu, Jan 18, 2007 at 04:00:28AM -0700, Erik Andersen wrote: > I just tried again and while using iommu=soft does avoid the > corruption problem, as with previous kernels with 2.6.20-rc5 using > iommu=soft still makes my pcHDTV HD5500 DVB cards not work. i would file a separate bug about that,

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Christoph Anton Mitterer
Erik Andersen wrote: > I just tried again and while using iommu=soft does avoid the > corruption problem, as with previous kernels with 2.6.20-rc5 > using iommu=soft still makes my pcHDTV HD5500 DVB cards not work. > I still have to disable memhole and lose 1 GB. :-( Please add this to the bugrep

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Christoph Anton Mitterer
joachim wrote: > Not only has it only been on Nvidia chipsets but we have only seen > reports on the Nvidia CK804 SATA controller. Please write in or add > yourself to the bugzilla entry [1] and tell us which hardware you have > if you get 4kB pagesize corruption and it goes away with "iommu=soft"

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread Erik Andersen
On Wed Jan 17, 2007 at 08:29:53AM +1100, Andi Kleen wrote: > AMD is looking at the issue. Only Nvidia chipsets seem to be affected, > although there were similar problems on VIA in the past too. > Unless a good workaround comes around soon I'll probably default > to iommu=soft on Nvidia. I just tr

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-18 Thread joachim
Andi Kleen <[EMAIL PROTECTED]> wrote on 22:29 16/01/2007 +0100 : > AMD is looking at the issue. Only Nvidia chipsets seem to be affected, > although there were similar problems on VIA in the past too. > Unless a good workaround comes around soon I'll probably default > to iommu=soft on Nvidia. > >

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-17 Thread Andi Kleen
> We've just verified that configuring the graphics aperture to be > write-combining instead of write-back using an MTRR also solves the > problem. It appears to be a cache incoherency issue in the graphics > aperture. Interesting. Unfortunately it is also not correct. It was intentional to m

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-17 Thread Chip Coldwell
On Wed, 17 Jan 2007, Chip Coldwell wrote: On Wed, 17 Jan 2007, Andi Kleen wrote: On Wednesday 17 January 2007 07:31, Chris Wedgwood wrote: On Tue, Jan 16, 2007 at 08:52:32PM +0100, Christoph Anton Mitterer wrote: I agree,... it seems drastic, but this is the only really secure solution. I'

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-17 Thread Chip Coldwell
On Wed, 17 Jan 2007, Andi Kleen wrote: On Wednesday 17 January 2007 07:31, Chris Wedgwood wrote: On Tue, Jan 16, 2007 at 08:52:32PM +0100, Christoph Anton Mitterer wrote: I agree,... it seems drastic, but this is the only really secure solution. I'd like to here from Andi how he feels about

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Christoph Anton Mitterer
Andi Kleen wrote: > AMD is looking at the issue. Only Nvidia chipsets seem to be affected, > although there were similar problems on VIA in the past too. > Unless a good workaround comes around soon I'll probably default > to iommu=soft on Nvidia. I've just read the posts about AMDs and NVIDIAs eff

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Christoph Anton Mitterer
Chris Wedgwood wrote: > I'd like to here from Andi how he feels about this? It seems like a > somewhat drastic solution in some ways given a lot of hardware doesn't > seem to be affected (or maybe in those cases it's just really hard to > hit, I don't know). > Yes this might be true,.. those wh

RE: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Allen Martin
> I'd like to here from Andi how he feels about this? It seems like a > somewhat drastic solution in some ways given a lot of hardware doesn't > seem to be affected (or maybe in those cases it's just really hard to > hit, I don't know). > > > Well we can hope that Nvidia will find out more (thoug

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Andi Kleen
On Wednesday 17 January 2007 07:31, Chris Wedgwood wrote: > On Tue, Jan 16, 2007 at 08:52:32PM +0100, Christoph Anton Mitterer wrote: > > I agree,... it seems drastic, but this is the only really secure > > solution. > > I'd like to here from Andi how he feels about this? It seems like a > somewha

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Chris Wedgwood
On Tue, Jan 16, 2007 at 09:31:31PM +0100, Krzysztof Halasa wrote: > Do you (someone) have (maintain) a list of affected systems, > including motherboard type and possibly version, BIOS version and > CPU type? A similar list of unaffected systems with 4GB+ RAM could > be useful, too. All I know is

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Chris Wedgwood
On Tue, Jan 16, 2007 at 08:52:32PM +0100, Christoph Anton Mitterer wrote: > I agree,... it seems drastic, but this is the only really secure > solution. I'd like to here from Andi how he feels about this? It seems like a somewhat drastic solution in some ways given a lot of hardware doesn't seem

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Krzysztof Halasa
Chris Wedgwood <[EMAIL PROTECTED]> writes: > right now i'm thinking if we can't figure out which cpu/bios > combinations are safe we might almost be better off doing iommu=soft > for *all* k8 stuff except for those that are whitelisted; though this > seems extremely drastic Do you (someone) have

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Christoph Anton Mitterer
Arkadiusz Miskiewicz wrote: > FYI it seems that I was also hit by this bug with qlogic fc card + adaptec > taro raid controller on Thunder K8SRE S2891 mainboard with nvidia chipset on > it. > > http://groups.google.com/group/fa.linux.kernel/browse_thread/thread/b8bdbde9721f7d35/45701994c95fe2cf?l

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Arkadiusz Miskiewicz
On Tuesday 16 January 2007 19:01, Chris Wedgwood wrote: > On Tue, Jan 16, 2007 at 08:26:05AM -0600, Robert Hancock wrote: > > >If one use iommu=soft the sata_nv will continue to use the new code > > >for the ADMA, right? > > > > Right, that shouldn't affect it. > > right now i'm thinking if we can'

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Christoph Anton Mitterer
Chris Wedgwood wrote: > right now i'm thinking if we can't figure out which cpu/bios > combinations are safe we might almost be better off doing iommu=soft > for *all* k8 stuff except for those that are whitelisted; though this > seems extremely drastic > I agree,... it seems drastic, but this i

Re: data corruption with nvidia chipsets and IDE/SATA drives (k8 cpu errata needed?)

2007-01-16 Thread Chris Wedgwood
On Tue, Jan 16, 2007 at 08:26:05AM -0600, Robert Hancock wrote: > >If one use iommu=soft the sata_nv will continue to use the new code > >for the ADMA, right? > > Right, that shouldn't affect it. right now i'm thinking if we can't figure out which cpu/bios combinations are safe we might almost be

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-16 Thread Robert Hancock
Christoph Anton Mitterer wrote: Ok,.. that sounds reasonable,.. so the whole thing might (!) actually be a hardware design error,... but we just don't use that hardware any longer when accessing devices via sata_nv. So this doesn't solve our problem with PATA drives or other devices (although we

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-16 Thread Christoph Anton Mitterer
Robert Hancock wrote: >> What is that GART thing exactly? Is this the hardware IOMMU? I've always >> thought GART was something graphics card related,.. but if so,.. how >> could this solve our problem (that seems to occur mainly on harddisks)? >> > The GART built into the Athlon 64/Opteron CP

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-15 Thread Robert Hancock
Christoph Anton Mitterer wrote: Sorry, as always I've forgot some things... *g* Robert Hancock wrote: If this is related to some problem with using the GART IOMMU with memory hole remapping enabled What is that GART thing exactly? Is this the hardware IOMMU? I've always thought GART was some

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-15 Thread Christoph Anton Mitterer
Sorry, as always I've forgot some things... *g* Robert Hancock wrote: > If this is related to some problem with using the GART IOMMU with memory > hole remapping enabled What is that GART thing exactly? Is this the hardware IOMMU? I've always thought GART was something graphics card related,..

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-15 Thread Christoph Anton Mitterer
Hi everybody. Sorry again for my late reply... Robert gave us the following interesting information some days ago: Robert Hancock wrote: > If this is related to some problem with using the GART IOMMU with memory > hole remapping enabled, then 2.6.20-rc kernels may avoid this problem on > nForc

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-15 Thread Christoph Anton Mitterer
Hi. Some days ago I received the following message from "Sunny Days". I think he did not send it lkml so I forward it now: Sunny Days wrote: > hello, > > i have done some extensive testing on this. > > various opterons, always single socket > various dimms 1 and 2gb modules > and hitachi+seagate

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-04 Thread Christoph Anton Mitterer
Hi. Just for you information: I've put the issue into the kernel.org bugzilla. http://bugzilla.kernel.org/show_bug.cgi?id=7768 Chris. begin:vcard fn:Mitterer, Christoph Anton n:Mitterer;Christoph Anton email;internet:[EMAIL PROTECTED] x-mozilla-html:TRUE version:2.1 end:vcard

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-03 Thread Robert Hancock
Christoph Anton Mitterer wrote: Hi. Perhaps some of you have read my older two threads: http://marc.theaimsgroup.com/?t=11631244001&r=1&w=2 and the even older http://marc.theaimsgroup.com/?t=11629131451&r=1&w=2 The issue was basically the following: I found a severe bug mainly by fortun

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2007-01-03 Thread Christoph Anton Mitterer
Hi everybody. After my last mails to this issue (btw: anything new in the meantime? I received no replys..) I wrote again to nvidia and AMD... This time with some more success. Below is the answer from Mr. Friedman to my mail. He says that he wasn't able to reproduce the problem and asks for a te

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-22 Thread Christoph Anton Mitterer
John A Chaves wrote: > I didn't need to run a specific test for this. The normal workload of the > machine approximates a continuous selftest for almost the last year. > > Large files (4-12GB is typical) are being continuously packed and unpacked > with gzip and bzip2. Statistical analysis of the

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-22 Thread John A Chaves
On Friday 22 December 2006 20:04, Christoph Anton Mitterer wrote: > This brings me to: > Chris Wedgwood wrote: > > Does anyone have an amd64 with an nforce4 chipset and >4GB that does > > NOT have this problem? If so it might be worth chasing the BIOS > > vendors to see what errata they are dealing

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-22 Thread Christoph Anton Mitterer
Hi my friends It became a little bit silent about this issue... any new ideas or results? Karsten Weiss wrote: > BTW: Did someone already open an official bug at > http://bugzilla.kernel.org ? Karsten, did you already file a bug? I told the whole issue to the Debian people which are abou

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-15 Thread Paul Slootman
[EMAIL PROTECTED] wrote: >On Wed, Dec 13, 2006 at 09:11:29PM +0100, Christoph Anton Mitterer wrote: > >> - error in the Opteron (memory controller) >> - error in the Nvidia chipsets >> - error in the kernel > >My guess without further information would be that some, but not all >BIOSes are doing so

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Dax Kelson
On Sat, 2006-12-02 at 01:56 +0100, Christoph Anton Mitterer wrote: > Hi. > > Perhaps some of you have read my older two threads: > http://marc.theaimsgroup.com/?t=11631244001&r=1&w=2 and the even > older http://marc.theaimsgroup.com/?t=11629131451&r=1&w=2 > > The issue was basically the f

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Christoph Anton Mitterer
Muli Ben-Yehuda wrote: >> 4) >> And does someone know if the nforce/opteron iommu requires IBM Calgary >> IOMMU support? >> > It doesn't, Calgary isn't found in machine with Opteron CPUs or NForce > chipsets (AFAIK). However, compiling Calgary in should make no > difference, as we detect in ru

Re: [PATCH 2nd try] Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Muli Ben-Yehuda
On Thu, Dec 14, 2006 at 02:16:31PM +0100, Karsten Weiss wrote: > On Thu, 14 Dec 2006, Muli Ben-Yehuda wrote: > > > The rest looks good. Please resend and I'll add my Acked-by. > > Thanks a lot for your comments and suggestions. Here's my 2nd try: > > === > > From: Karsten Weiss <[EMAIL PROTECTE

[PATCH 2nd try] Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Karsten Weiss
On Thu, 14 Dec 2006, Muli Ben-Yehuda wrote: > The rest looks good. Please resend and I'll add my Acked-by. Thanks a lot for your comments and suggestions. Here's my 2nd try: === From: Karsten Weiss <[EMAIL PROTECTED]> $ diffstat ~/iommu-patch_v2.patch Documentation/kernel-parameters.txt |

Re: [PATCH] Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Muli Ben-Yehuda
On Thu, Dec 14, 2006 at 12:38:08PM +0100, Karsten Weiss wrote: > On Thu, 14 Dec 2006, Muli Ben-Yehuda wrote: > > > On Wed, Dec 13, 2006 at 09:34:16PM +0100, Karsten Weiss wrote: > > > > > BTW: It would be really great if this area of the kernel would get some > > > more and better documentation.

[PATCH] Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Karsten Weiss
On Thu, 14 Dec 2006, Muli Ben-Yehuda wrote: > On Wed, Dec 13, 2006 at 09:34:16PM +0100, Karsten Weiss wrote: > > > BTW: It would be really great if this area of the kernel would get some > > more and better documentation. The information at > > linux-2.6/Documentation/x86_64/boot_options.txt is

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Muli Ben-Yehuda
On Thu, Dec 14, 2006 at 02:52:35AM -0700, Erik Andersen wrote: > On Thu Dec 14, 2006 at 11:23:11AM +0200, Muli Ben-Yehuda wrote: > > > I just realized that booting with "iommu=soft" makes my pcHDTV > > > HD5500 DVB cards not work. Time to go back to disabling the > > > memhole and losing 1 GB. :-

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Erik Andersen
On Thu Dec 14, 2006 at 11:23:11AM +0200, Muli Ben-Yehuda wrote: > > I just realized that booting with "iommu=soft" makes my pcHDTV > > HD5500 DVB cards not work. Time to go back to disabling the > > memhole and losing 1 GB. :-( > > That points to a bug in the driver (likely) or swiotlb (unlikely

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Chris Wedgwood
On Wed, Dec 13, 2006 at 09:11:29PM +0100, Christoph Anton Mitterer wrote: > - error in the Opteron (memory controller) > - error in the Nvidia chipsets > - error in the kernel My guess without further information would be that some, but not all BIOSes are doing some work to avoid this. Does anyo

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Muli Ben-Yehuda
On Thu, Dec 14, 2006 at 12:33:23AM +0100, Christoph Anton Mitterer wrote: > 4) > And does someone know if the nforce/opteron iommu requires IBM Calgary > IOMMU support? It doesn't, Calgary isn't found in machine with Opteron CPUs or NForce chipsets (AFAIK). However, compiling Calgary in should ma

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Muli Ben-Yehuda
On Wed, Dec 13, 2006 at 01:29:25PM -0700, Erik Andersen wrote: > On Mon Dec 11, 2006 at 10:24:02AM +0100, Karsten Weiss wrote: > > We could not reproduce the data corruption anymore if we boot > > the machines with the kernel parameter "iommu=soft" i.e. if we > > use software bounce buffering inste

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-14 Thread Muli Ben-Yehuda
On Wed, Dec 13, 2006 at 09:34:16PM +0100, Karsten Weiss wrote: > FWIW: As far as I understand the linux kernel code (I am no kernel > developer so please correct me if I am wrong) the PCI dma mapping code is > abstracted by struct dma_mapping_ops. I.e. there are currently four > possible implem

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Hi. I've just looked for some kernel config options that might relate to our issue: 1) Old style AMD Opteron NUMA detection (CONFIG_K8_NUMA) Enable K8 NUMA node topology detection. You should say Y here if you have a multi processor AMD K8 system. This uses an old method to read the NUMA config

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Lennart Sorensen
On Wed, Dec 13, 2006 at 08:57:23PM +0100, Christoph Anton Mitterer wrote: > Don't understand me wrong,.. I don't use Windows (expect for upgrading > my Plextor firmware and EAC ;) )... but I ask because the more > information we get (even if it's not Linux specific) the more steps we > can take ;)

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Lennart Sorensen wrote: > I upgrade my plextor firmware using linux. pxupdate for most devices, > and pxfw for new drivers (like the PX760). Works perfectly for me. It > is one of the reasons I buy plextors. Yes I know about it,.. although never tested it,... anyway the main reason for Windows i

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Karsten Weiss
On Wed, 13 Dec 2006, Chris Wedgwood wrote: > > Any ideas why iommu=disabled in the bios does not solve the issue? > > The kernel will still use the IOMMU if the BIOS doesn't set it up if > it can, check your dmesg for IOMMU strings, there might be something > printed to this effect. FWIW: As far

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Erik Andersen wrote: > I just realized that booting with "iommu=soft" makes my pcHDTV > HD5500 DVB cards not work. Time to go back to disabling the > memhole and losing 1 GB. :-( Crazy,... I have a Hauppauge Nova-T 500 DualDVB-T card,... I'll check it later if I have the same problem and will inf

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Erik Andersen
On Mon Dec 11, 2006 at 10:24:02AM +0100, Karsten Weiss wrote: > We could not reproduce the data corruption anymore if we boot > the machines with the kernel parameter "iommu=soft" i.e. if we > use software bounce buffering instead of the hw-iommu. I just realized that booting with "iommu=soft" mak

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Karsten Weiss
On Wed, 13 Dec 2006, Erik Andersen wrote: > On Mon Dec 11, 2006 at 10:24:02AM +0100, Karsten Weiss wrote: > > Last week we did some more testing with the following result: > > > > We could not reproduce the data corruption anymore if we boot the machines > > with the kernel parameter "iommu=soft

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Erik Andersen
On Mon Dec 11, 2006 at 10:24:02AM +0100, Karsten Weiss wrote: > Last week we did some more testing with the following result: > > We could not reproduce the data corruption anymore if we boot the machines > with the kernel parameter "iommu=soft" i.e. if we use software bounce > buffering instead

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Chris Wedgwood wrote: >> Did anyone made any test under Windows? I cannot set there >> iommu=soft, can I? >> > Windows never uses the hardware iommu, so it's always doing the > equivalent on iommu=soft > That would mean that I'm not able to reproduce the issue unter windows, right? Does tha

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Karsten Weiss wrote: > "Memory hole mapping" was set to "hardware". With "disabled" we only > see 3 of our 4 GB memory. > That sounds reasonable,... I even only see 2,5 GB,.. as my memhole takes 1536 MB (don't ask me which PCI device needs that much address space ;) ) begin:vcard fn:Mitterer, Ch

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Karsten Weiss wrote: > Of course, the big question "Why does the hardware iommu *not* > work on those machines?" still remains. > I'm going to check AMDs errata docs these days,.. perhaps I find something that relates. But I'd ask you to do the same as I don't consider myself as an expert in t

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Karsten Weiss
On Wed, 13 Dec 2006, Christoph Anton Mitterer wrote: Christoph, I will carefully re-read your entire posting and the included links on Monday and will also try the memory hole setting. And did you get out anything new? As I already mentioned the kernel parameter "iommu=soft" fixes the data c

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Chris Wedgwood
On Wed, Dec 13, 2006 at 08:18:21PM +0100, Christoph Anton Mitterer wrote: > booting with iommu=soft => works fine > booting with iommu=noagp => DOESN'T solve the error > booting with iommu=off => the system doesn't even boot and panics > When I set IOMMU to disabled in the BIOS the error is not s

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Chris Wedgwood
On Wed, Dec 13, 2006 at 08:20:59PM +0100, Christoph Anton Mitterer wrote: > Did anyone made any test under Windows? I cannot set there > iommu=soft, can I? Windows never uses the hardware iommu, so it's always doing the equivalent on iommu=soft - To unsubscribe from this list: send the line "unsu

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Ah and I forgot,... Did anyone made any test under Windows? I cannot set there iommu=soft, can I? Chris. begin:vcard fn:Mitterer, Christoph Anton n:Mitterer;Christoph Anton email;internet:[EMAIL PROTECTED] x-mozilla-html:TRUE version:2.1 end:vcard

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Karsten Weiss wrote: > Last week we did some more testing with the following result: > > We could not reproduce the data corruption anymore if we boot the machines > with the kernel parameter "iommu=soft" i.e. if we use software bounce > buffering instead of the hw-iommu. (As mentioned before, bo

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-13 Thread Christoph Anton Mitterer
Karsten Weiss wrote: > Here's a diff of a corrupted and a good file written during our > testcase: > > ("-" == corrupted file, "+" == good file) > ... > 009f2ff0 67 2a 4c c4 6d 9d 34 44 ad e6 3c 45 05 9a 4d c4 > |g*L.m.4D.. -009f3000 39 60 e6 44 20 ab 46 44 56 aa 46 44 c2 35 e6 44 |9.D .FD

amd64 iommu causing corruption? (was Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!)

2006-12-11 Thread Chris Wedgwood
On Mon, Dec 11, 2006 at 10:24:02AM +0100, Karsten Weiss wrote: > We could not reproduce the data corruption anymore if we boot the > machines with the kernel parameter "iommu=soft" i.e. if we use > software bounce buffering instead of the hw-iommu. (As mentioned > before, booting with mem=2g works

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-11 Thread Karsten Weiss
On Sat, 2 Dec 2006, Karsten Weiss wrote: > On Sat, 2 Dec 2006, Christoph Anton Mitterer wrote: > > > I found a severe bug mainly by fortune because it occurs very rarely. > > My test looks like the following: I have about 30GB of testing data on > > This sounds very familiar! One of the Linux co

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-07 Thread Christoph Anton Mitterer
Ville Herva wrote: > I saw something very similar with Via KT133 years ago. Then the culprit was > botched PCI implementation that sometimes corrupted PCI transfers when there > was heavy PCI I/O going on. Usually than meant running two disk transfers at > the same time. Doing heavy network I/O at

Re: data corruption with nvidia chipsets and IDE/SATA drives

2006-12-06 Thread Chris Wedgwood
On Wed, Dec 06, 2006 at 12:11:38PM +0100, Christian wrote: > I copied a massive amount of data (more than 500GB) several times > between the HDDs and ran md5sum each time, but it found no errors. It might be a known problem that your BIOS addresses already, or maybe it's restricted to some revisi

Re: data corruption with nvidia chipsets and IDE/SATA drives

2006-12-06 Thread Christian
Am Sonntag, 3. Dezember 2006 02:17 schrieb Kurtis D. Rader: > On Sat, 2006-12-02 01:56:06, Christoph Anton Mitterer wrote: > > The issue was basically the following: I found a severe bug mainly by > > fortune because it occurs very rarely. My test looks like the following: > > I have about 30GB of

Re: data corruption with nvidia chipsets and IDE/SATA drives

2006-12-04 Thread Kurtis D. Rader
On Sat, 2006-12-02 17:17:37, Kurtis D. Rader wrote: > I'm also experiencing silent data corruption on writes to SATA disks > connected to a Nvidia controller (nForce 4 chipset). The problem is > 100% reproducible. Details of my configuration (mainboard model, lspci, > etc.) are near the bottom of t

Re: data corruption with nvidia chipsets and IDE/SATA drives

2006-12-03 Thread Steffen Moser
Hi! * On Sat, Dec 02, 2006 at 05:17 PM (-0800), Kurtis D. Rader wrote: > On Sat, 2006-12-02 01:56:06, Christoph Anton Mitterer wrote: > > The issue was basically the following: I found a severe bug mainly by > > fortune because it occurs very rarely. My test looks like the following: > > I have

Re: data corruption with nvidia chipsets and IDE/SATA drives

2006-12-02 Thread Kurtis D. Rader
On Sat, 2006-12-02 17:17:37, Kurtis D. Rader wrote: > The same disks attached to a Promise TX2 SATA controller (in the same > system) experience no corruption. I spoke too soon. Corruption is occurring with the disks attached to the Promise TX2 SATA controller but much less frequently. With the dr

Re: data corruption with nvidia chipsets and IDE/SATA drives

2006-12-02 Thread Kurtis D. Rader
On Sat, 2006-12-02 01:56:06, Christoph Anton Mitterer wrote: > The issue was basically the following: I found a severe bug mainly by > fortune because it occurs very rarely. My test looks like the following: > I have about 30GB of testing data on my harddisk,... I repeat verifying > sha512 sums on

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-02 Thread Christoph Anton Mitterer
Chris Wedgwood wrote: > Heh, I see this also with an Tyan S2866 (nforce4 chipset). I've been > aware something is a miss for a while because if I transfer about 40GB > of data from one machine to another there are checksum mismatches and > some files have to be transfered again. > It seems that

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-02 Thread Christoph Anton Mitterer
Alan wrote: > See the thread http://lkml.org/lkml/2006/8/16/305 > Hi Alan. Thanks for your reply. I've read this thread already some weeks ago but from my limited knowledge I understood, that this was an issue related to a SCSI adapter or so. Or did I understand this wrong. And as soon as

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-02 Thread Alan
On Sat, 2 Dec 2006 12:00:36 +0100 (CET) Karsten Weiss <[EMAIL PROTECTED]> wrote: > Hello Christoph! > > On Sat, 2 Dec 2006, Christoph Anton Mitterer wrote: > > > I found a severe bug mainly by fortune because it occurs very rarely. > > My test looks like the following: I have about 30GB of testi

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-02 Thread Karsten Weiss
Hello Christoph! On Sat, 2 Dec 2006, Christoph Anton Mitterer wrote: I found a severe bug mainly by fortune because it occurs very rarely. My test looks like the following: I have about 30GB of testing data on This sounds very familiar! One of the Linux compute clusters I administer at work i

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-01 Thread Chris Wedgwood
On Sat, Dec 02, 2006 at 01:56:06AM +0100, Christoph Anton Mitterer wrote: > I found a severe bug mainly by fortune because it occurs very > rarely. My test looks like the following: I have about 30GB of > testing data on my harddisk,... I repeat verifying sha512 sums on > these files and check if

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-01 Thread Christoph Anton Mitterer
Erik Andersen wrote: > Doh! I have a Tyan S2895 in my system, and I've been pulling my > hair out trying to track down the cause of a similar somewhat > rare failure for the pre-computer sha1 of a block of data to > actually match the calculated sha1. I'd been hunting in vain the > past few days

Re: data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-01 Thread Erik Andersen
On Sat Dec 02, 2006 at 01:56:06AM +0100, Christoph Anton Mitterer wrote: > The issue was basically the following: > I found a severe bug mainly by fortune because it occurs very rarely. > My test looks like the following: I have about 30GB of testing data on > my harddisk,... I repeat verifying sha

data corruption with nvidia chipsets and IDE/SATA drives // memory hole mapping related bug?!

2006-12-01 Thread Christoph Anton Mitterer
Hi. Perhaps some of you have read my older two threads: http://marc.theaimsgroup.com/?t=11631244001&r=1&w=2 and the even older http://marc.theaimsgroup.com/?t=11629131451&r=1&w=2 The issue was basically the following: I found a severe bug mainly by fortune because it occurs very rarely. M