Re: excess timeouts accessing pci-sata card both ports
[EMAIL PROTECTED] wrote: > Removed 3 drives. Both data and power. > Left only port1 connected > It's the drive and power cable that in the first test was port2 failing > and in second test was port1 and ok > both cards plugged in and initialized. > third test it's the port1 alone sda working ok > fourth test it's the port2 alone sda failing. > > any idea what's wrong as this does not seem to be power related problem. > > vanilla 2.6.23.14 > sata_via module... details in the inlined emails. > > (4th test config) > ata1: SATA link down (SStatus 0 SControl 310) > ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310) > ata4: SATA link down (SStatus 0 SControl 310) > ata5: SATA link down (SStatus 0 SControl 310) > > Faulty card? Both cards? Driver bug? Or design bug/incompatibility on second > port? It seems like you're experiencing transmission problems on reads. SError value of 0x3000400 is Protocol Error, Link Sequence Error and Transport State Transition Error. Hardresetting the link should usually recover from such conditions but error handling on via has never been reliable. How easily can you trigger the problem? Thanks. -- tejun - To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: excess timeouts accessing pci-sata card both ports
Hmm. Removed 3 drives. Both data and power. Left only port1 connected It's the drive and power cable that in the first test was port2 failing and in second test was port1 and ok both cards plugged in and initialized. third test it's the port1 alone sda working ok fourth test it's the port2 alone sda failing. any idea what's wrong as this does not seem to be power related problem. vanilla 2.6.23.14 sata_via module... details in the inlined emails. (4th test config) ata1: SATA link down (SStatus 0 SControl 310) ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310) ata4: SATA link down (SStatus 0 SControl 310) ata5: SATA link down (SStatus 0 SControl 310) Faulty card? Both cards? Driver bug? Or design bug/incompatibility on second port? __ > Od: [EMAIL PROTECTED] > Komu: > Datum: 24.01.2008 23:59 > Předmět: Re: excess timeouts accessing pci-sata card both ports > >Hi. > >More through test revealed that only the second port of each card is failing. > >Accessing (cat /dev/sdX > /dev/null) just the sdb or sdd alone = one drive at time produces the error message. > >Switched data cables on the card. ata1 port for ata2 port. This should eliminate defective power cable to sdb. >And also switched ata4 port for ata5 port data cable. This sould eliminate defective power cable to sdd. >(ata3 and 6 are not connected PATA ports on each card) > >Failing is again port 2 and port 5 which is now originally flawlessly working drives now failing. > > >__ >> Od: [EMAIL PROTECTED] >> Komu: >> Datum: 24.01.2008 09:27 >> Předmět: excess timeouts accessing pci-sata card both ports >> >>Hello. >> >>The scenario: >>PCI bus with 2 cards (sata_via) >>Each card has 2 SATA ports connected. >>Each disk does 40MB/s in hdparm -t read test. >> >>First card has the sda and sdb >>Second card has the sdc and sdd >>verified by watching the led blinking on the card >> >>ok - accessing two ports at same time but each port on different card: >>cat /dev/sda > /dev/null & cat /dev/sdc > /dev/null >> >>problem - accessing two ports at same time and both ports on same card: >>cat /dev/sda > /dev/null & cat /dev/sdb > /dev/null >> >>side effect of the problem: >>due to heavy traffic on one port the reset of the other port fails with >timeout and port is disabled >> >>logged error command at the initial problem (is it DMA read ext and DMA >write log?): >>ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen >>ata2.00: cmd 25/00:68:b8:83:13/00:01:00:00:00/e0 tag 0 cdb 0x0 data >184320 in >> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout) >>ata2: port is slow to respond, please be patient (Status 0xd0) >> >>ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen >>ata2.00: cmd c8/00:60:70:50:08/00:00:00:00:00/e0 tag 0 cdb 0x0 data 49152 >in >> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout) >>ata2: port is slow to respond, please be patient (Status 0xd0) >> >>logged EH sequence failure: >>ata2: port is slow to respond, please be patient (Status 0xd0) >>ata2: device not ready (errno=-16), forcing hardreset >>ata2: hard resetting port >>ata2: port is slow to respond, please be patient (Status 0x80) >>ata2: COMRESET failed (errno=-16) >>ata2: hard resetting port >>ata2: port is slow to respond, please be patient (Status 0x80) >>ata2: COMRESET failed (errno=-16) >>ata2: hard resetting port >>ata2: port is slow to respond, please be patient (Status 0x80) >>ata2: COMRESET failed (errno=-16) >>ata2: hard resetting port >>ata2: COMRESET failed (errno=-16) >>ata2: reset failed, giving up >>ata2.00: disabled >>ata2: EH complete >>sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00 >> >>tried does not work: >>- multiply the ATA_TMOUT values by 10 and later by 100 against vanilla >>- increase retry counters >> >>some lines from dmesg below >>libata version 2.21 loaded. >>sata_via :00:0d.0: version 2.3 >>ACPI: PCI Interrupt :00:0d.0[A] -> Link [LNKA] -> GSI 11 (level, low) >-> IRQ 11 >>sata_via :00:0d.0: routed to hard irq line 11 >>scsi0 : sata_via >>scsi1 : sata_via >>scsi2 : sata_via >>ata1: SATA max UDMA/133 cmd 0x0001b800 ctl 0x0001b80a bmdma 0x0001a400 >irq 11 >>ata2: SATA max UDMA/133 cmd 0x0001b400 ctl 0x0001b40a bmdma 0x0001a408 >irq 11 >>ata3: PATA max UDMA/133 cmd 0x0001b000 ctl 0x0001b00a bmdma 0x0001a
Re: excess timeouts accessing pci-sata card both ports
Hi. More through test revealed that only the second port of each card is failing. Accessing (cat /dev/sdX > /dev/null) just the sdb or sdd alone = one drive at time produces the error message. Switched data cables on the card. ata1 port for ata2 port. This should eliminate defective power cable to sdb. And also switched ata4 port for ata5 port data cable. This sould eliminate defective power cable to sdd. (ata3 and 6 are not connected PATA ports on each card) Failing is again port 2 and port 5 which is now originally flawlessly working drives now failing. __ > Od: [EMAIL PROTECTED] > Komu: > Datum: 24.01.2008 09:27 > Předmět: excess timeouts accessing pci-sata card both ports > >Hello. > >The scenario: >PCI bus with 2 cards (sata_via) >Each card has 2 SATA ports connected. >Each disk does 40MB/s in hdparm -t read test. > >First card has the sda and sdb >Second card has the sdc and sdd >verified by watching the led blinking on the card > >ok - accessing two ports at same time but each port on different card: >cat /dev/sda > /dev/null & cat /dev/sdc > /dev/null > >problem - accessing two ports at same time and both ports on same card: >cat /dev/sda > /dev/null & cat /dev/sdb > /dev/null > >side effect of the problem: >due to heavy traffic on one port the reset of the other port fails with timeout and port is disabled > >logged error command at the initial problem (is it DMA read ext and DMA write log?): >ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen >ata2.00: cmd 25/00:68:b8:83:13/00:01:00:00:00/e0 tag 0 cdb 0x0 data 184320 in > res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout) >ata2: port is slow to respond, please be patient (Status 0xd0) > >ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen >ata2.00: cmd c8/00:60:70:50:08/00:00:00:00:00/e0 tag 0 cdb 0x0 data 49152 in > res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout) >ata2: port is slow to respond, please be patient (Status 0xd0) > >logged EH sequence failure: >ata2: port is slow to respond, please be patient (Status 0xd0) >ata2: device not ready (errno=-16), forcing hardreset >ata2: hard resetting port >ata2: port is slow to respond, please be patient (Status 0x80) >ata2: COMRESET failed (errno=-16) >ata2: hard resetting port >ata2: port is slow to respond, please be patient (Status 0x80) >ata2: COMRESET failed (errno=-16) >ata2: hard resetting port >ata2: port is slow to respond, please be patient (Status 0x80) >ata2: COMRESET failed (errno=-16) >ata2: hard resetting port >ata2: COMRESET failed (errno=-16) >ata2: reset failed, giving up >ata2.00: disabled >ata2: EH complete >sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00 > >tried does not work: >- multiply the ATA_TMOUT values by 10 and later by 100 against vanilla >- increase retry counters > >some lines from dmesg below >libata version 2.21 loaded. >sata_via :00:0d.0: version 2.3 >ACPI: PCI Interrupt :00:0d.0[A] -> Link [LNKA] -> GSI 11 (level, low) -> IRQ 11 >sata_via :00:0d.0: routed to hard irq line 11 >scsi0 : sata_via >scsi1 : sata_via >scsi2 : sata_via >ata1: SATA max UDMA/133 cmd 0x0001b800 ctl 0x0001b80a bmdma 0x0001a400 irq 11 >ata2: SATA max UDMA/133 cmd 0x0001b400 ctl 0x0001b40a bmdma 0x0001a408 irq 11 >ata3: PATA max UDMA/133 cmd 0x0001b000 ctl 0x0001b00a bmdma 0x0001a410 irq 11 >ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 310) >ata1.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133 >ata1.00: 490234752 sectors, multi 16: LBA48 NCQ (not used) >ata1.00: configured for UDMA/133 >ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310) >ata2.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133 >ata2.00: 490234752 sectors, multi 16: LBA48 NCQ (not used) >ata2.00: configured for UDMA/133 >scsi 0:0:0:0: Direct-Access ATA Maxtor 6L250S0 BANC PQ: 0 ANSI: 5 >sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB) >sd 0:0:0:0: [sda] Write Protect is off >sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00 >sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA >sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB) >sd 0:0:0:0: [sda] Write Protect is off >sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00 >sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA > sda: sda1 sda2 sda3 >sd 0:0:0:0: [sda] Attached SCSI disk >sd 0:0:0:0: Attached scsi generic sg0 type 0 >scsi 1:0:0:0: Direct-Access ATA Maxtor 6L250S0 BANC PQ: 0 ANSI: 5 >sd 1:0:0:0: [sdb] 490234752 512-byte hardware sectors (251000 MB) >sd 1:0:0:0: [sdb] Write Protect is off >sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 >sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA >sd 1:0:0:0: [sdb] 490234752 512-byte hardware sectors (251000 MB) >sd 1:0:0:0: [sdb] Write Protect is off >sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 >sd 1:0:0:0: [sdb] Write cac