Re: disabling sata_nv ADMA for 2.6.24

2008-01-09 Thread Tejun Heo
Robert Hancock wrote: > However, how about putting in a printk in nv_adma_interrupt handler here: > > /* freeze if hotplugged or controller error */ > if (unlikely(status & (NV_ADMA_STAT_HOTPLUG | >NV_ADMA_STAT_HOTUNPLUG | >NV_ADMA_STAT_TIMEOUT | >NV

Re: disabling sata_nv ADMA for 2.6.24

2008-01-09 Thread Robert Hancock
Tejun Heo wrote: How about putting a bunch of printks inside the interrupt handler? That would tell us if it's even reaching the interrupt handler.. If you give me a patch, I'll apply it and cause lock up and report the result. Just shoot the patches my way. But maybe reproducing the lock up

Re: disabling sata_nv ADMA for 2.6.24

2008-01-08 Thread Tejun Heo
Robert Hancock wrote: > Tejun Heo wrote: >> Tejun Heo wrote: >>> Robert Hancock wrote: Tejun Heo wrote: > Tejun Heo wrote: >>> [ 34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI >>> appears >>> to be stuck (0->0)! >>> [ 34.555056] WARNING: CPU#1: NMI appears t

Re: disabling sata_nv ADMA for 2.6.24

2008-01-08 Thread Robert Hancock
Tejun Heo wrote: Tejun Heo wrote: Robert Hancock wrote: Tejun Heo wrote: Tejun Heo wrote: [ 34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears to be stuck (0->0)! [ 34.555056] WARNING: CPU#1: NMI appears to be stuck (0->0)! Oops, missed that. I'll see whether there's IRQ

Re: disabling sata_nv ADMA for 2.6.24

2008-01-08 Thread Tejun Heo
Tejun Heo wrote: > Robert Hancock wrote: >> Tejun Heo wrote: >>> Tejun Heo wrote: > [ 34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears > to be stuck (0->0)! > [ 34.555056] WARNING: CPU#1: NMI appears to be stuck (0->0)! Oops, missed that. I'll see whether the

Re: disabling sata_nv ADMA for 2.6.24

2008-01-08 Thread Tejun Heo
Robert Hancock wrote: > Tejun Heo wrote: >> Tejun Heo wrote: [ 34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears to be stuck (0->0)! [ 34.555056] WARNING: CPU#1: NMI appears to be stuck (0->0)! >>> Oops, missed that. I'll see whether there's IRQ storm going on.

Re: disabling sata_nv ADMA for 2.6.24

2008-01-08 Thread Robert Hancock
Tejun Heo wrote: Tejun Heo wrote: [ 34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears to be stuck (0->0)! [ 34.555056] WARNING: CPU#1: NMI appears to be stuck (0->0)! Oops, missed that. I'll see whether there's IRQ storm going on. I made the nv irq handler to print messa

Re: disabling sata_nv ADMA for 2.6.24

2008-01-08 Thread Tejun Heo
Tejun Heo wrote: >> [ 34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears >> to be stuck (0->0)! >> [ 34.555056] WARNING: CPU#1: NMI appears to be stuck (0->0)! > > Oops, missed that. I'll see whether there's IRQ storm going on. I made the nv irq handler to print message every

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Robert Hancock wrote: > Tejun Heo wrote: >> Tejun Heo wrote: >>> Robert Hancock wrote: > Okay, just succeeded on the current #upstream-fixes, attaching the > log. > The machine is a brick after the crash. I assume the cable got reconnected at 325 seconds? It looks like that w

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Robert Hancock
Tejun Heo wrote: Tejun Heo wrote: Robert Hancock wrote: Okay, just succeeded on the current #upstream-fixes, attaching the log. The machine is a brick after the crash. I assume the cable got reconnected at 325 seconds? It looks like that was during error handling for the previous unplug? I d

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Tejun Heo wrote: > Robert Hancock wrote: >>> Okay, just succeeded on the current #upstream-fixes, attaching the log. >>> The machine is a brick after the crash. >> I assume the cable got reconnected at 325 seconds? It looks like that >> was during error handling for the previous unplug? > > I don

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Robert Hancock wrote: >> Okay, just succeeded on the current #upstream-fixes, attaching the log. >> The machine is a brick after the crash. > > I assume the cable got reconnected at 325 seconds? It looks like that > was during error handling for the previous unplug? I don't remember too well (th

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Robert Hancock
Tejun Heo wrote: Tejun Heo wrote: Robert Hancock wrote: This has only been reported on one person's MSI board. Apparently another revision of the same board is reported to work, and I can't duplicate the problem on my Asus board, so it could just be some hardware problem on that motherboard. I

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Tejun Heo wrote: > Robert Hancock wrote: This has only been reported on one person's MSI board. Apparently another revision of the same board is reported to work, and I can't duplicate the problem on my Asus board, so it could just be some hardware problem on that motherboard. >

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Robert Hancock wrote: >>> This has only been reported on one person's MSI board. Apparently >>> another revision of the same board is reported to work, and I can't >>> duplicate the problem on my Asus board, so it could just be some >>> hardware problem on that motherboard. >> >> IIRC, I have two f

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Robert Hancock
Tejun Heo wrote: Robert Hancock wrote: Mark Lord wrote: Tejun Heo wrote: Hello, guys. We still have three problems with ADMA. * hard lockup during resume * occasional hard lockup after hotplug or other erros (probably related to the above?) This has only been reported on one person's MSI boa

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Robert Hancock wrote: > Mark Lord wrote: >> Tejun Heo wrote: >>> Hello, guys. >>> >>> We still have three problems with ADMA. >>> >>> * hard lockup during resume * occasional hard lockup after >>> hotplug or other erros (probably related to the above?) > > This has only been reported on one pers

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Robert Hancock
Mark Lord wrote: Tejun Heo wrote: Hello, guys. We still have three problems with ADMA. * hard lockup during resume * occasional hard lockup after hotplug or other erros (probably related to the above?) This has only been reported on one person's MSI board. Apparently another revision of the

Re: disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Mark Lord
Tejun Heo wrote: Hello, guys. We still have three problems with ADMA. * hard lockup during resume * occasional hard lockup after hotplug or other erros (probably related to the above?) * occasional timeout of FLUSH after NCQ writes I think we should disable ADMA for 2.6.24 and -stable for now.

disabling sata_nv ADMA for 2.6.24

2008-01-07 Thread Tejun Heo
Hello, guys. We still have three problems with ADMA. * hard lockup during resume * occasional hard lockup after hotplug or other erros (probably related to the above?) * occasional timeout of FLUSH after NCQ writes I think we should disable ADMA for 2.6.24 and -stable for now. What do you guys