Re: excess timeouts accessing pci-sata card both ports

2008-02-01 Thread Tejun Heo
[EMAIL PROTECTED] wrote:
 Removed 3 drives. Both data and power.
 Left only port1 connected
 It's the drive and power cable that in the first test was port2 failing
 and in second test was port1 and ok
 both cards plugged in and initialized.
 third test it's the port1 alone sda working ok
 fourth test it's the port2 alone sda failing.
 
 any idea what's wrong as this does not seem to be power related problem.
 
 vanilla 2.6.23.14
 sata_via module... details in the inlined emails.
 
 (4th test config)
 ata1: SATA link down (SStatus 0 SControl 310)
 ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
 ata4: SATA link down (SStatus 0 SControl 310)
 ata5: SATA link down (SStatus 0 SControl 310)
 
 Faulty card? Both cards? Driver bug? Or design bug/incompatibility on second 
 port?

It seems like you're experiencing transmission problems on reads.
SError value of 0x3000400 is Protocol Error, Link Sequence Error and
Transport State Transition Error.  Hardresetting the link should usually
recover from such conditions but error handling on via has never been
reliable.  How easily can you trigger the problem?

Thanks.

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: excess timeouts accessing pci-sata card both ports

2008-01-24 Thread drtyc
Hi.

More through test revealed that only the second port of each card is failing.

Accessing (cat /dev/sdX  /dev/null) just the sdb or sdd alone = one drive at 
time produces the error message.

Switched data cables on the card. ata1 port for ata2 port. This should 
eliminate defective power cable to sdb.
And also switched ata4 port for ata5 port data cable. This sould eliminate 
defective power cable to sdd.
(ata3 and 6 are not connected PATA ports on each card)

Failing is again port 2 and port 5 which is now originally flawlessly working 
drives now failing.


__
 Od: [EMAIL PROTECTED]
 Komu: linux-ide@vger.kernel.org
 Datum: 24.01.2008 09:27
 Předmět: excess timeouts accessing pci-sata card both ports

Hello.

The scenario:
PCI bus with 2 cards (sata_via)
Each card has 2 SATA ports connected.
Each disk does 40MB/s in hdparm -t read test.

First card has the sda and sdb
Second card has the sdc and sdd
verified by watching the led blinking on the card

ok - accessing two ports at same time but each port on different card:
cat /dev/sda  /dev/null  cat /dev/sdc  /dev/null

problem - accessing two ports at same time and both ports on same card:
cat /dev/sda  /dev/null  cat /dev/sdb  /dev/null

side effect of the problem:
due to heavy traffic on one port the reset of the other port fails with
timeout and port is disabled

logged error command at the initial problem (is it DMA read ext and DMA
write log?):
ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
ata2.00: cmd 25/00:68:b8:83:13/00:01:00:00:00/e0 tag 0 cdb 0x0 data
184320 in
 res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
ata2: port is slow to respond, please be patient (Status 0xd0)

ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
ata2.00: cmd c8/00:60:70:50:08/00:00:00:00:00/e0 tag 0 cdb 0x0 data 49152
in
 res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
ata2: port is slow to respond, please be patient (Status 0xd0)

logged EH sequence failure:
ata2: port is slow to respond, please be patient (Status 0xd0)
ata2: device not ready (errno=-16), forcing hardreset
ata2: hard resetting port
ata2: port is slow to respond, please be patient (Status 0x80)
ata2: COMRESET failed (errno=-16)
ata2: hard resetting port
ata2: port is slow to respond, please be patient (Status 0x80)
ata2: COMRESET failed (errno=-16)
ata2: hard resetting port
ata2: port is slow to respond, please be patient (Status 0x80)
ata2: COMRESET failed (errno=-16)
ata2: hard resetting port
ata2: COMRESET failed (errno=-16)
ata2: reset failed, giving up
ata2.00: disabled
ata2: EH complete
sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00

tried does not work:
- multiply the ATA_TMOUT values by 10 and later by 100 against vanilla 
- increase retry counters

some lines from dmesg below
libata version 2.21 loaded.
sata_via :00:0d.0: version 2.3
ACPI: PCI Interrupt :00:0d.0[A] - Link [LNKA] - GSI 11 (level, low)
- IRQ 11
sata_via :00:0d.0: routed to hard irq line 11
scsi0 : sata_via
scsi1 : sata_via
scsi2 : sata_via
ata1: SATA max UDMA/133 cmd 0x0001b800 ctl 0x0001b80a bmdma 0x0001a400
irq 11
ata2: SATA max UDMA/133 cmd 0x0001b400 ctl 0x0001b40a bmdma 0x0001a408
irq 11
ata3: PATA max UDMA/133 cmd 0x0001b000 ctl 0x0001b00a bmdma 0x0001a410
irq 11
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata1.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133
ata1.00: 490234752 sectors, multi 16: LBA48 NCQ (not used)
ata1.00: configured for UDMA/133
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata2.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133
ata2.00: 490234752 sectors, multi 16: LBA48 NCQ (not used)
ata2.00: configured for UDMA/133
scsi 0:0:0:0: Direct-Access ATA  Maxtor 6L250S0   BANC PQ: 0
ANSI: 5
sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
 sda: sda1 sda2 sda3
sd 0:0:0:0: [sda] Attached SCSI disk
sd 0:0:0:0: Attached scsi generic sg0 type 0
scsi 1:0:0:0: Direct-Access ATA  Maxtor 6L250S0   BANC PQ: 0
ANSI: 5
sd 1:0:0:0: [sdb] 490234752 512-byte hardware sectors (251000 MB)
sd 1:0:0:0: [sdb] Write Protect is off
sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
sd 1:0:0:0: [sdb] 490234752 512-byte hardware sectors (251000 MB)
sd 1:0:0:0: [sdb] Write Protect is off
sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't
support DPO or FUA
 sdb: sdb1 sdb2 sdb3

Re: excess timeouts accessing pci-sata card both ports

2008-01-24 Thread drtyc
Hmm.

Removed 3 drives. Both data and power.
Left only port1 connected
It's the drive and power cable that in the first test was port2 failing
and in second test was port1 and ok
both cards plugged in and initialized.
third test it's the port1 alone sda working ok
fourth test it's the port2 alone sda failing.

any idea what's wrong as this does not seem to be power related problem.

vanilla 2.6.23.14
sata_via module... details in the inlined emails.

(4th test config)
ata1: SATA link down (SStatus 0 SControl 310)
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata4: SATA link down (SStatus 0 SControl 310)
ata5: SATA link down (SStatus 0 SControl 310)

Faulty card? Both cards? Driver bug? Or design bug/incompatibility on second 
port?
__
 Od: [EMAIL PROTECTED]
 Komu: linux-ide@vger.kernel.org
 Datum: 24.01.2008 23:59
 Předmět: Re: excess timeouts accessing pci-sata card both ports

Hi.

More through test revealed that only the second port of each card is
failing.

Accessing (cat /dev/sdX  /dev/null) just the sdb or sdd alone = one
drive at time produces the error message.

Switched data cables on the card. ata1 port for ata2 port. This should
eliminate defective power cable to sdb.
And also switched ata4 port for ata5 port data cable. This sould
eliminate defective power cable to sdd.
(ata3 and 6 are not connected PATA ports on each card)

Failing is again port 2 and port 5 which is now originally flawlessly
working drives now failing.


__
 Od: [EMAIL PROTECTED]
 Komu: linux-ide@vger.kernel.org
 Datum: 24.01.2008 09:27
 Předmět: excess timeouts accessing pci-sata card both ports

Hello.

The scenario:
PCI bus with 2 cards (sata_via)
Each card has 2 SATA ports connected.
Each disk does 40MB/s in hdparm -t read test.

First card has the sda and sdb
Second card has the sdc and sdd
verified by watching the led blinking on the card

ok - accessing two ports at same time but each port on different card:
cat /dev/sda  /dev/null  cat /dev/sdc  /dev/null

problem - accessing two ports at same time and both ports on same card:
cat /dev/sda  /dev/null  cat /dev/sdb  /dev/null

side effect of the problem:
due to heavy traffic on one port the reset of the other port fails with
timeout and port is disabled

logged error command at the initial problem (is it DMA read ext and DMA
write log?):
ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
ata2.00: cmd 25/00:68:b8:83:13/00:01:00:00:00/e0 tag 0 cdb 0x0 data
184320 in
 res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
ata2: port is slow to respond, please be patient (Status 0xd0)

ata2.00: exception Emask 0x2 SAct 0x0 SErr 0x3000400 action 0x2 frozen
ata2.00: cmd c8/00:60:70:50:08/00:00:00:00:00/e0 tag 0 cdb 0x0 data
49152
in
 res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x6 (timeout)
ata2: port is slow to respond, please be patient (Status 0xd0)

logged EH sequence failure:
ata2: port is slow to respond, please be patient (Status 0xd0)
ata2: device not ready (errno=-16), forcing hardreset
ata2: hard resetting port
ata2: port is slow to respond, please be patient (Status 0x80)
ata2: COMRESET failed (errno=-16)
ata2: hard resetting port
ata2: port is slow to respond, please be patient (Status 0x80)
ata2: COMRESET failed (errno=-16)
ata2: hard resetting port
ata2: port is slow to respond, please be patient (Status 0x80)
ata2: COMRESET failed (errno=-16)
ata2: hard resetting port
ata2: COMRESET failed (errno=-16)
ata2: reset failed, giving up
ata2.00: disabled
ata2: EH complete
sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00

tried does not work:
- multiply the ATA_TMOUT values by 10 and later by 100 against vanilla 
- increase retry counters

some lines from dmesg below
libata version 2.21 loaded.
sata_via :00:0d.0: version 2.3
ACPI: PCI Interrupt :00:0d.0[A] - Link [LNKA] - GSI 11 (level,
low)
- IRQ 11
sata_via :00:0d.0: routed to hard irq line 11
scsi0 : sata_via
scsi1 : sata_via
scsi2 : sata_via
ata1: SATA max UDMA/133 cmd 0x0001b800 ctl 0x0001b80a bmdma 0x0001a400
irq 11
ata2: SATA max UDMA/133 cmd 0x0001b400 ctl 0x0001b40a bmdma 0x0001a408
irq 11
ata3: PATA max UDMA/133 cmd 0x0001b000 ctl 0x0001b00a bmdma 0x0001a410
irq 11
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata1.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133
ata1.00: 490234752 sectors, multi 16: LBA48 NCQ (not used)
ata1.00: configured for UDMA/133
ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata2.00: ATA-7: Maxtor 6L250S0, BANC1G10, max UDMA/133
ata2.00: 490234752 sectors, multi 16: LBA48 NCQ (not used)
ata2.00: configured for UDMA/133
scsi 0:0:0:0: Direct-Access ATA  Maxtor 6L250S0   BANC PQ: 0
ANSI: 5
sd 0:0:0:0: [sda] 490234752 512-byte hardware sectors (251000 MB)
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled