Daniel,

You may consider to try LSI 9211-8I HBA.

I found 6G HBA work better in our environment   

Rocky 
-----Original Message-----
From: storage-discuss-boun...@opensolaris.org
[mailto:storage-discuss-boun...@opensolaris.org] On Behalf Of Daniel J.
Priem
Sent: Sunday, April 17, 2011 10:57 PM
To: storage-discuss@opensolaris.org
Subject: [storage-discuss] mpt driver issues

Hi,
i am running for testing puposes 10 u9 and seeing regular

command timeout for Target

wich results in hanging the entire system.
the only way to "solve" this is resetting the system.


i have read so much post about this problems.
first my setup was:
- sc847e16 with single expander
- one lsi 1068
- 24 harddrives

problem still exists

then i replaced all 24 sata drives with enterprise SAS drives

problem still exists

then added an additional lsi 1068 so that the front and back have their
own controller

problem still exists

replaced the entire sc847e16 expander chassis with an  expanderless,
so now all 24 drives are connected to 4 8 channels lsi1068

problem still exists.

updated all firmware on controller

problem still exists

also added
set mpt:mpt_enable_msi = 0
set mptsas:mptsas_enable_msi=0

any usefull help is recommend.
i am thinking about switching back to 10 u8 for later production use.

best regards
daniel


setup remote syslogging and see:
intresting is that all errors begin at the same time.

 20:36:26  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:36:26  command timeout for Target 0
 20:36:27  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:36:27  command timeout for Target 7
 20:36:27  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:36:27  command timeout for Target 6
 20:36:27  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0 (mpt2):
 20:36:27  command timeout for Target 10
 20:37:28  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:37:28  command timeout for Target 4
 20:37:28  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:37:28  command timeout for Target 3
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:37:39  command timeout for Target 1
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0/sd@0,0 (sd51):
 20:37:39  for Command: write(10)               Error Level: Retryable
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0 (mpt2):
 20:37:39  command timeout for Target 10
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0/sd@a,0 (sd48):
 20:37:39  for Command: write(10)               Error Level: Retryable
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:38:40  command timeout for Target 0
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0/sd@1,0 (sd5):
 20:38:40  for Command: write(10)               Error Level: Retryable
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0/sd@0,0 (sd51):
 20:38:40  transport failed: reason \'reset\': retrying command
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:38:40  command timeout for Target 7
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0/sd@4,0 (sd7):
 20:38:40  for Command: write(10)               Error Level: Retryable
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0/sd@7,0 (sd14):
 20:38:40  transport failed: reason \'reset\': retrying command
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:38:40  command timeout for Target 13
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0/sd@3,0 (sd43):
 20:38:40  for Command: write(10)               Error Level: Retryable
 20:38:41  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0 (mpt2):
 20:38:41  command timeout for Target 9
 20:38:41  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0/sd@a,0 (sd48):
 20:38:41  for Command: write(10)               Error Level: Retryable
 20:39:41  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:39:41  command timeout for Target 5
 20:39:41  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0/sd@7,0 (sd14):
 20:39:41  for Command: write(10)               Error Level: Retryable
 20:39:42  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:39:42  command timeout for Target 3
 20:39:42  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0/sd@3,0 (sd43):
 20:39:42  transport failed: reason \'reset\': retrying command
 20:39:52  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:39:52  command timeout for Target 1


in messages files i have
 20:36:26  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:36:26  command timeout for Target 0
 20:36:27  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:36:27  command timeout for Target 7
 20:36:27  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:36:27  command timeout for Target 6
 20:36:27  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0 (mpt2):
 20:36:27  command timeout for Target 10
 20:37:28  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:37:28  command timeout for Target 4
 20:37:28  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:37:28  command timeout for Target 3
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:37:39  command timeout for Target 1
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0/sd@0,0 (sd51):
 20:37:39  for Command: write(10)               Error Level: Retryable
 20:37:39  scsi: [ID 107833 kern.notice] \011Requested Block: 728396797
Error Block: 728396797
 20:37:39  scsi: [ID 107833 kern.notice] \011Vendor: SEAGATE
Serial Number: 9WJ0K4NK
 20:37:39  scsi: [ID 107833 kern.notice] \011Sense Key: Unit Attention
 20:37:39  scsi: [ID 107833 kern.notice] \011ASC: 0x29 (scsi bus reset
occurred), ASCQ: 0x2, FRU: 0x2
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0 (mpt2):
 20:37:39  command timeout for Target 10
 20:37:39  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340c@5/pci1000,3140@0/sd@a,0 (sd48):
 20:37:39  for Command: write(10)               Error Level: Retryable
 20:37:39  scsi: [ID 107833 kern.notice] \011Requested Block: 791838419
Error Block: 791838419
 20:37:39  scsi: [ID 107833 kern.notice] \011Vendor: SEAGATE
Serial Number: 9WJ0JFJ6
 20:37:39  scsi: [ID 107833 kern.notice] \011Sense Key: Unit Attention
 20:37:39  scsi: [ID 107833 kern.notice] \011ASC: 0x29 (scsi bus reset
occurred), ASCQ: 0x2, FRU: 0x2
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0 (mpt0):
 20:38:40  command timeout for Target 0
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0/sd@1,0 (sd5):
 20:38:40  for Command: write(10)               Error Level: Retryable
 20:38:40  scsi: [ID 107833 kern.notice] \011Requested Block: 791838419
Error Block: 791838419
 20:38:40  scsi: [ID 107833 kern.notice] \011Vendor: SEAGATE
Serial Number: 9WJ0K45D
 20:38:40  scsi: [ID 107833 kern.notice] \011Sense Key: Unit Attention
 20:38:40  scsi: [ID 107833 kern.notice] \011ASC: 0x29 (scsi bus reset
occurred), ASCQ: 0x2, FRU: 0x2
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,3a40@1c/pci1000,3140@0/sd@0,0 (sd51):
 20:38:40  transport failed: reason \'reset\': retrying command
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0 (mpt1):
 20:38:40  command timeout for Target 7
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0/sd@4,0 (sd7):
 20:38:40  for Command: write(10)               Error Level: Retryable
 20:38:40  scsi: [ID 107833 kern.notice] \011Requested Block: 317608183
Error Block: 317608183
 20:38:40  scsi: [ID 107833 kern.notice] \011Vendor: SEAGATE
Serial Number: 9WJ0K4KL
 20:38:40  scsi: [ID 107833 kern.notice] \011Sense Key: Unit Attention
 20:38:40  scsi: [ID 107833 kern.notice] \011ASC: 0x29 (scsi bus reset
occurred), ASCQ: 0x2, FRU: 0x2
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340e@7/pci1000,3140@0/sd@7,0 (sd14):
 20:38:40  transport failed: reason \'reset\': retrying command
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0 (mpt3):
 20:38:40  command timeout for Target 13
 20:38:40  scsi: [ID 107833 kern.warning] WARNING:
/pci@0,0/pci8086,340a@3/pci1000,3140@0/sd@3,0 (sd43):
 20:38:40  for Command: write(10)               Error Level: Retryable



the zpool layout looks:
# zpool status
  pool: sasinfra
 state: ONLINE
 scrub: none requested
config:

        NAME         STATE     READ WRITE CKSUM
        sasinfra     ONLINE       0     0     0
          mirror-0   ONLINE       0     0     0
            c3t4d0   ONLINE       0     0     0
            c0t5d0   ONLINE       0     0     0
          mirror-1   ONLINE       0     0     0
            c3t5d0   ONLINE       0     0     0
            c0t6d0   ONLINE       0     0     0
          mirror-2   ONLINE       0     0     0
            c3t6d0   ONLINE       0     0     0
            c0t7d0   ONLINE       0     0     0
          mirror-3   ONLINE       0     0     0
            c4t9d0   ONLINE       0     0     0
            c2t0d0   ONLINE       0     0     0
          mirror-4   ONLINE       0     0     0
            c4t10d0  ONLINE       0     0     0
            c2t1d0   ONLINE       0     0     0
        spares
          c4t13d0    AVAIL
          c4t14d0    AVAIL

errors: No known data errors

  pool: sasterra
 state: ONLINE
 scrub: none requested
config:

        NAME        STATE     READ WRITE CKSUM
        sasterra    ONLINE       0     0     0
          mirror-0  ONLINE       0     0     0
            c3t0d0  ONLINE       0     0     0
            c0t1d0  ONLINE       0     0     0
          mirror-1  ONLINE       0     0     0
            c3t1d0  ONLINE       0     0     0
            c0t2d0  ONLINE       0     0     0
          mirror-2  ONLINE       0     0     0
            c3t2d0  ONLINE       0     0     0
            c0t3d0  ONLINE       0     0     0
          mirror-3  ONLINE       0     0     0
            c3t3d0  ONLINE       0     0     0
            c0t4d0  ONLINE       0     0     0
        spares
          c4t13d0   AVAIL
          c4t14d0   AVAIL

errors: No known data errors

  pool: sysstg6
 state: ONLINE
 scrub: none requested
config:

        NAME           STATE     READ WRITE CKSUM
        sysstg6        ONLINE       0     0     0
          mirror-0     ONLINE       0     0     0
            c3t13d0s0  ONLINE       0     0     0
            c0t0d0s0   ONLINE       0     0     0

errors: No known data errors


_______________________________________________
storage-discuss mailing list
storage-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/storage-discuss

_______________________________________________
storage-discuss mailing list
storage-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/storage-discuss

Reply via email to