Re: excess timeouts accessing pci-sata card both ports

2008-02-01 Thread Tejun Heo
[EMAIL PROTECTED] wrote:
> Removed 3 drives. Both data and power.
> Left only port1 connected
> It's the drive and power cable that in the first test was port2 failing
> and in second test was port1 and ok
> both cards plugged in and initialized.
> third test it's the port1 alone sda working ok
> fourth test it's the port2 alone sda failing.
> 
> any idea what's wrong as this does not seem to be power related problem.
> 
> vanilla 2.6.23.14
> sata_via module... details in the inlined emails.
> 
> (4th test config)
> ata1: SATA link down (SStatus 0 SControl 310)
> ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
> ata4: SATA link down (SStatus 0 SControl 310)
> ata5: SATA link down (SStatus 0 SControl 310)
> 
> Faulty card? Both cards? Driver bug? Or design bug/incompatibility on second 
> port?

It seems like you're experiencing transmission problems on reads.
SError value of 0x3000400 is Protocol Error, Link Sequence Error and
Transport State Transition Error.  Hardresetting the link should usually
recover from such conditions but error handling on via has never been
reliable.  How easily can you trigger the problem?

Thanks.

-- 
tejun
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: excess timeouts accessing pci-sata card both ports

2008-01-24 Thread drtyc
Hmm.

Removed 3 drives. Both data and power.
Left only port1 connected
It's the drive and power cable that in the first test was port2 failing
and in second test was port1 and ok
both cards plugged in and initialized.
third test it's the port1 alone sda working ok
fourth test it's the port2 alone sda failing.

any idea what's wrong as this does not seem to be power related problem.

vanilla 2.6.23.14
sata_via module... details in the inlined emails.

(4th test config)
ata1: SATA link down (SStatus 0 SControl 310)
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata4: SATA link down (SStatus 0 SControl 310)
ata5: SATA link down (SStatus 0 SControl 310)

Faulty card? Both cards? Driver bug? Or design bug/incompatibility on second 
port?
__
> Od: [EMAIL PROTECTED]
> Komu: 
> Datum: 24.01.2008 23:59
> Předmět: Re: excess timeouts accessing pci-sata card both ports
>
>Hi.
>
>More through test revealed that only the second port of each card is
failing.
>
>Accessing (cat /dev/sdX > /dev/null) just the sdb or sdd alone = one
drive at time produces the error message.
>
>Switched data cables on the card. ata1 port for ata2 port. This should
eliminate defective power cable to sdb.
>And also switched ata4 port for ata5 port data cable. This sould
eliminate defective power cable to sdd.
>(ata3 and 6 are not connected PATA ports on each card)
>
>Failing is again port 2 and port 5 which is now originally flawlessly
working drives now failing.
>
>
>__
>> Od: [EMAIL PROTECTED]
>> Komu: 
>> Datum: 24.01.2008 09:27
>> Předmět: excess timeouts accessing pci-sata card both ports
>>
>>Hello.
>>
>>The scenario:
>>PCI bus with 2 cards (sata_via)
>>Each card has 2 SATA ports connected.
>>Each disk does 40MB/s in hdparm -t read test.
>>
>>First card has the sda and sdb
>>Second card has the sdc and sdd
>>verified by watching the led blinking on the card
>>
>>ok - accessing two ports at same time but each port on different card:
>>cat /dev/sda > /dev/null & cat /dev/sdc > /dev/null
>>
>>problem - accessing two ports at same time and both ports on same card:
>>cat /dev/sda > /dev/null & cat /dev/sdb > /dev/null
>>
>>side effect of the problem:
>>due to heavy traffic on one port the reset of the other port fails with
>timeout and port is disabled
>>
>>logged error command at the initial problem (is it DMA read ext and DMA
>write log?):
>>ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
>>ata2.00: cmd 25/00:68:b8:83:13/00:01:00:00:00/e0 tag 0 cdb 0x0 data
>184320 in
>> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
>>ata2: port is slow to respond, please be patient (Status 0xd0)
>>
>>ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
>>ata2.00: cmd c8/00:60:70:50:08/00:00:00:00:00/e0 tag 0 cdb 0x0 data
49152
>in
>> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
>>ata2: port is slow to respond, please be patient (Status 0xd0)
>>
>>logged EH sequence failure:
>>ata2: port is slow to respond, please be patient (Status 0xd0)
>>ata2: device not ready (errno=-16), forcing hardreset
>>ata2: hard resetting port
>>ata2: port is slow to respond, please be patient (Status 0x80)
>>ata2: COMRESET failed (errno=-16)
>>ata2: hard resetting port
>>ata2: port is slow to respond, please be patient (Status 0x80)
>>ata2: COMRESET failed (errno=-16)
>>ata2: hard resetting port
>>ata2: port is slow to respond, please be patient (Status 0x80)
>>ata2: COMRESET failed (errno=-16)
>>ata2: hard resetting port
>>ata2: COMRESET failed (errno=-16)
>>ata2: reset failed, giving up
>>ata2.00: disabled
>>ata2: EH complete
>>sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
>>
>>tried does not work:
>>- multiply the ATA_TMOUT values by 10 and later by 100 against vanilla 
>>- increase retry counters
>>
>>some lines from dmesg below
>>libata version 2.21 loaded.
>>sata_via :00:0d.0: version 2.3
>>ACPI: PCI Interrupt :00:0d.0[A] -> Link [LNKA] -> GSI 11 (level,
low)
>-> IRQ 11
>>sata_via :00:0d.0: routed to hard irq line 11
>>scsi0 : sata_via
>>scsi1 : sata_via
>>scsi2 : sata_via
>>ata1: SATA max UDMA/133 cmd 0x0001b800 ctl 0x0001b80a bmdma 0x0001a400
>irq 11
>>ata2: SATA max UDMA/133 cmd 0x0001b400 ctl 0x0001b40a bmdma 0x0001a408
>irq 11
>>ata3: PATA max UDMA/133 cmd 0x0001b000 ctl 0x0001b00a bmdma 0x0001a

Re: excess timeouts accessing pci-sata card both ports

2008-01-24 Thread drtyc
Hi.

More through test revealed that only the second port of each card is failing.

Accessing (cat /dev/sdX > /dev/null) just the sdb or sdd alone = one drive at 
time produces the error message.

Switched data cables on the card. ata1 port for ata2 port. This should 
eliminate defective power cable to sdb.
And also switched ata4 port for ata5 port data cable. This sould eliminate 
defective power cable to sdd.
(ata3 and 6 are not connected PATA ports on each card)

Failing is again port 2 and port 5 which is now originally flawlessly working 
drives now failing.


__
> Od: [EMAIL PROTECTED]
> Komu: 
> Datum: 24.01.2008 09:27
> Předmět: excess timeouts accessing pci-sata card both ports
>
>Hello.
>
>The scenario:
>PCI bus with 2 cards (sata_via)
>Each card has 2 SATA ports connected.
>Each disk does 40MB/s in hdparm -t read test.
>
>First card has the sda and sdb
>Second card has the sdc and sdd
>verified by watching the led blinking on the card
>
>ok - accessing two ports at same time but each port on different card:
>cat /dev/sda > /dev/null & cat /dev/sdc > /dev/null
>
>problem - accessing two ports at same time and both ports on same card:
>cat /dev/sda > /dev/null & cat /dev/sdb > /dev/null
>
>side effect of the problem:
>due to heavy traffic on one port the reset of the other port fails with
timeout and port is disabled
>
>logged error command at the initial problem (is it DMA read ext and DMA
write log?):
>ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
>ata2.00: cmd 25/00:68:b8:83:13/00:01:00:00:00/e0 tag 0 cdb 0x0 data
184320 in
> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
>ata2: port is slow to respond, please be patient (Status 0xd0)
>
>ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
>ata2.00: cmd c8/00:60:70:50:08/00:00:00:00:00/e0 tag 0 cdb 0x0 data 49152
in
> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
>ata2: port is slow to respond, please be patient (Status 0xd0)
>
>logged EH sequence failure:
>ata2: port is slow to respond, please be patient (Status 0xd0)
>ata2: device not ready (errno=-16), forcing hardreset
>ata2: hard resetting port
>ata2: port is slow to respond, please be patient (Status 0x80)
>ata2: COMRESET failed (errno=-16)
>ata2: hard resetting port
>ata2: port is slow to respond, please be patient (Status 0x80)
>ata2: COMRESET failed (errno=-16)
>ata2: hard resetting port
>ata2: port is slow to respond, please be patient (Status 0x80)
>ata2: COMRESET failed (errno=-16)
>ata2: hard resetting port
>ata2: COMRESET failed (errno=-16)
>ata2: reset failed, giving up
>ata2.00: disabled
>ata2: EH complete
>sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
>
>tried does not work:
>- multiply the ATA_TMOUT values by 10 and later by 100 against vanilla 
>- increase retry counters
>
>some lines from dmesg below
>libata version 2.21 loaded.
>sata_via :00:0d.0: version 2.3
>ACPI: PCI Interrupt :00:0d.0[A] -> Link [LNKA] -> GSI 11 (level, low)
-> IRQ 11
>sata_via :00:0d.0: routed to hard irq line 11
>scsi0 : sata_via
>scsi1 : sata_via
>scsi2 : sata_via
>ata1: SATA max UDMA/133 cmd 0x0001b800 ctl 0x0001b80a bmdma 0x0001a400
irq 11
>ata2: SATA max UDMA/133 cmd 0x0001b400 ctl 0x0001b40a bmdma 0x0001a408
irq 11
>ata3: PATA max UDMA/133 cmd 0x0001b000 ctl 0x0001b00a bmdma 0x0001a410
irq 11
>ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
>ata1.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133
>ata1.00: 490234752 sectors, multi 16: LBA48 NCQ (not used)
>ata1.00: configured for UDMA/133
>ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
>ata2.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133
>ata2.00: 490234752 sectors, multi 16: LBA48 NCQ (not used)
>ata2.00: configured for UDMA/133
>scsi 0:0:0:0: Direct-Access ATA  Maxtor 6L250S0   BANC PQ: 0
ANSI: 5
>sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
>sd 0:0:0:0: [sda] Write Protect is off
>sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
>sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
>sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
>sd 0:0:0:0: [sda] Write Protect is off
>sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
>sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
> sda: sda1 sda2 sda3
>sd 0:0:0:0: [sda] Attached SCSI disk
>sd 0:0:0:0: Attached scsi generic sg0 type 0
>scsi 1:0:0:0: Direct-Access ATA  Maxtor 6L250S0   BANC PQ: 0
ANSI: 5
>sd 1:0:0:0: [sdb] 490234752 512-byte hardware sectors (251000 MB)
>sd 1:0:0:0: [sdb] Write Protect is off
>sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
>sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
>sd 1:0:0:0: [sdb] 490234752 512-byte hardware sectors (251000 MB)
>sd 1:0:0:0: [sdb] Write Protect is off
>sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
>sd 1:0:0:0: [sdb] Write cac