Hello,
I'm having an issue with a PERC 6/i card and I was hoping to get some
guidance from the gurus on this list.
We're having an issue with the controller 'puncturing bad blocks', but
it's 'remembering' the sectors after swapping out with a hotspare.
Example:
*Nov 1st -> hotswapped S3*
a0 PERC 6/i Integrated bios:2.04.00 fw:1.22.02-0612 encl:1
ldrv:2 rbld:30% mem:256MiB batt:good/4054mV/26C
a0d0 136GiB RAID 1 1x2 optimal
row 0: a0e32s0 a0e32s1
a0d1 2TiB RAID 5 1x4 optimal
row 0: a0e32s2 a0e32s3 a0e32s4 a0e32s5
a0e32s0 SEAGATE ST3146356SS rev:HS0F s/n:3QN23Y2J
136GiB a0d0 online
a0e32s1 SEAGATE ST3146356SS rev:HS0F s/n:3QN260CF
136GiB a0d0 online
a0e32s2 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2F
931GiB a0d1 online
a0e32s3 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NDVC
931GiB a0d1 online errs: media:76 other:2
a0e32s4 SEAGATE ST31000640SS rev:MS0A s/n:9QJ6BWRR
931GiB a0d1 online
a0e32s5 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2V
931GiB a0d1 online
*Nov 3rd -> hotswapped s4*
a0 PERC 6/i Integrated bios:2.04.00 fw:1.22.02-0612 encl:1
ldrv:2 rbld:30% mem:256MiB batt:good/4044mV/26C
a0d0 136GiB RAID 1 1x2 optimal
row 0: a0e32s0 a0e32s1
a0d1 2TiB RAID 5 1x4 optimal
row 0: a0e32s2 a0e32s3 a0e32s4 a0e32s5
a0e32s0 SEAGATE ST3146356SS rev:HS0F s/n:3QN23Y2J
136GiB a0d0 online
a0e32s1 SEAGATE ST3146356SS rev:HS0F s/n:3QN260CF
136GiB a0d0 online
a0e32s2 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2F
931GiB a0d1 online
a0e32s3 SEAGATE ST31000640SS rev:0004 s/n:9QJ1WP3P
931GiB a0d1 online errs: media:0 other:1
a0e32s4 SEAGATE ST31000640SS rev:MS0A s/n:9QJ6BWRR
931GiB a0d1 online errs: media:76 other:0
*Nov 4th -> hotswapped s3*
a0 PERC 6/i Integrated bios:2.04.00 fw:1.22.02-0612 encl:1
ldrv:2 rbld:30% mem:256MiB batt:good/4038mV/26C
a0d0 136GiB RAID 1 1x2 optimal
row 0: a0e32s0 a0e32s1
a0d1 2TiB RAID 5 1x4 optimal
row 0: a0e32s2 a0e32s3 a0e32s4 a0e32s5
a0e32s0 SEAGATE ST3146356SS rev:HS0F s/n:3QN23Y2J
136GiB a0d0 online
a0e32s1 SEAGATE ST3146356SS rev:HS0F s/n:3QN260CF
136GiB a0d0 online
a0e32s2 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2F
931GiB a0d1 online
a0e32s3 SEAGATE ST31000640SS rev:0004 s/n:9QJ1WP3P
931GiB a0d1 online errs: media:76 other:1 predictive-failure
a0e32s4 SEAGATE ST31000640SS rev:MS0A s/n:9QJ63CGY
931GiB a0d1 online
a0e32s5 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2V
931GiB a0d1 online
*Today:
*a0 PERC 6/i Integrated bios:2.04.00 fw:1.22.02-0612 encl:1
ldrv:2 rbld:30% mem:256MiB batt:good/4019mV/26C
a0d0 136GiB RAID 1 1x2 optimal
row 0: a0e32s0 a0e32s1
a0d1 2TiB RAID 5 1x4 optimal
row 0: a0e32s2 a0e32s3 a0e32s4 a0e32s5
a0e32s0 SEAGATE ST3146356SS rev:HS0F s/n:3QN23Y2J
136GiB a0d0 online
a0e32s1 SEAGATE ST3146356SS rev:HS0F s/n:3QN260CF
136GiB a0d0 online
a0e32s2 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2F
931GiB a0d1 online
a0e32s3 SEAGATE ST31000640SS rev:MS04 s/n:9QJ636RE
931GiB a0d1 online errs: media:76 other:0
a0e32s4 SEAGATE ST31000640SS rev:MS0A s/n:9QJ6C4XE
931GiB a0d1 online
a0e32s5 SEAGATE ST31000640SS rev:MS0A s/n:9QJ5NX2V
931GiB a0d1 online
As you can see, the '76' errors seem to be moving from pd to pd, but
even though they are different drives. This is leading me to believe
that some how the controller is taking these sectors offline because
it's remembering them somehow. Has anyone seen this before, or perhaps
have any suggestions on how I should proceed?
Wall of text logs below. Thanks again for any help!!
11/07/10 19:16:55: EVT#00372-11/07/10 19:16:55: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 00 00 00 80 00,
Sense: 3/11/00
11/07/10 19:16:55: Raw Sense for PD 3: f0 00 03 74 2f e9 37 0a 00 00 00
00 11 00 81 80 00 96
11/07/10 19:16:55: DEV_REC:Medium Error DevId[3] Tgt 3 RDM=a05caa00
retires=0
11/07/10 19:16:55: MedErr is for: cmdId=422, ld=1, src=4, cmd=1,
lba=e85fd2, cnt=80, rmwOp=0
11/07/10 19:16:55: -> recoveryChild: ld=1 orgLi=0 recPhysArm=2
badPhysArm=ff doneFun=a0c02268 sRef=0 eRef=7f recFlags=0
11/07/10 19:16:55: -> RecParent: cmdId=422, src=4, cmd=1, lba=e85fd2,
cnt=80, rmwOp=0, refs=0/7f
11/07/10 19:16:55: ErrLBAOffset (37) LBA(742fe900) BadLba=742fe937
11/07/10 19:16:55: EVT#00373-11/07/10 19:16:55: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe937
11/07/10 19:16:55: BBMProcessReadError: RECOVERY, pd=03,
pdErrLba=742fe937 - puncture source/target drives
11/07/10 19:16:55: BBMMarkBadBlock: pd=03, pdLBA=742fe937
11/07/10 19:16:55: BBMMarkBadBlock: pd=04, pdLBA=742fe937
T51: EVT#00046-T51: 91=Inserted: PD 03(e0x20/s3)
T51: EVT#00047-T51: 247=Inserted: PD 03(e0x20/s3) Info: enclPd=20,
scsiType=0, portMap=03, sasAddr=5000c50010157171,0000000000000000
T56: EVT#00055-T56: 114=State change on PD 03(e0x20/s3) from
UNCONFIGURED_GOOD(0) to ONLINE(18)
T49: EVT#00087-T49: 91=Inserted: PD 03(e0x20/s3)
T49: EVT#00088-T49: 247=Inserted: PD 03(e0x20/s3) Info: enclPd=20,
scsiType=0, portMap=03, sasAddr=5000c50010157171,0000000000000000
11/07/10 19:16:55: EVT#00372-11/07/10 19:16:55: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 00 00 00 80 00,
Sense: 3/11/00
11/07/10 19:16:55: EVT#00373-11/07/10 19:16:55: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe937
11/07/10 19:16:58: EVT#00374-11/07/10 19:16:58: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 38 00 00 48 00,
Sense: 3/11/00
11/07/10 19:16:58: EVT#00375-11/07/10 19:16:58: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe938
11/07/10 19:17:01: EVT#00376-11/07/10 19:17:01: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f ed 00 00 00 80 00,
Sense: 3/11/00
11/07/10 19:17:01: EVT#00377-11/07/10 19:17:01: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fed49
11/07/10 19:17:04: EVT#00378-11/07/10 19:17:04: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f1 00 00 00 80 00,
Sense: 3/11/00
11/07/10 19:17:04: EVT#00379-11/07/10 19:17:04: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff15b
11/07/10 19:17:06: EVT#00380-11/07/10 19:17:06: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 39 00 00 47 00,
Sense: 3/11/00
11/07/10 19:17:06: EVT#00381-11/07/10 19:17:06: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe93a
11/07/10 19:17:08: EVT#00382-11/07/10 19:17:08: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f ed 4a 00 00 36 00,
Sense: 3/11/00
11/07/10 19:17:09: EVT#00383-11/07/10 19:17:09: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fed4a
11/07/10 19:17:11: EVT#00384-11/07/10 19:17:11: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 3b 00 00 45 00,
Sense: 3/11/00
11/07/10 19:17:11: EVT#00385-11/07/10 19:17:11: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe93b
11/07/10 19:17:14: EVT#00386-11/07/10 19:17:14: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f ed 4b 00 00 35 00,
Sense: 3/11/00
11/07/10 19:17:14: EVT#00387-11/07/10 19:17:14: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fed4d
11/07/10 19:17:17: EVT#00388-11/07/10 19:17:17: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f1 5c 00 00 24 00,
Sense: 3/11/00
11/07/10 19:17:17: EVT#00389-11/07/10 19:17:17: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff15c
11/07/10 19:17:20: EVT#00390-11/07/10 19:17:20: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 3c 00 00 44 00,
Sense: 3/11/00
11/07/10 19:17:20: EVT#00391-11/07/10 19:17:20: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe93c
11/07/10 19:17:23: EVT#00392-11/07/10 19:17:23: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f ed 4e 00 00 32 00,
Sense: 3/11/00
11/07/10 19:17:23: EVT#00393-11/07/10 19:17:23: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fed50
11/07/10 19:17:26: EVT#00394-11/07/10 19:17:26: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f1 5d 00 00 23 00,
Sense: 3/11/00
11/07/10 19:17:26: EVT#00395-11/07/10 19:17:26: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff15e
11/07/10 19:17:29: EVT#00396-11/07/10 19:17:29: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f9 00 00 00 80 00,
Sense: 3/11/00
11/07/10 19:17:29: EVT#00397-11/07/10 19:17:29: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff97d
11/07/10 19:17:32: EVT#00398-11/07/10 19:17:32: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 3d 00 00 43 00,
Sense: 3/11/00
11/07/10 19:17:32: EVT#00399-11/07/10 19:17:32: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe93d
11/07/10 19:17:34: EVT#00400-11/07/10 19:17:34: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f ed 51 00 00 2f 00,
Sense: 3/11/00
11/07/10 19:17:34: EVT#00401-11/07/10 19:17:34: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fed51
11/07/10 19:17:37: EVT#00402-11/07/10 19:17:37: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f1 5f 00 00 21 00,
Sense: 3/11/00
11/07/10 19:17:37: EVT#00403-11/07/10 19:17:37: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff15f
11/07/10 19:17:40: EVT#00404-11/07/10 19:17:40: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f9 80 00 00 80 00,
Sense: 3/11/00
11/07/10 19:17:40: EVT#00405-11/07/10 19:17:40: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff981
11/07/10 19:17:43: EVT#00406-11/07/10 19:17:43: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 3e 00 00 42 00,
Sense: 3/11/00
11/07/10 19:17:43: EVT#00407-11/07/10 19:17:43: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe93e
11/07/10 19:17:45: EVT#00408-11/07/10 19:17:45: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f9 7e 00 00 02 00,
Sense: 3/11/00
11/07/10 19:17:46: EVT#00409-11/07/10 19:17:46: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff97e
11/07/10 19:17:48: EVT#00410-11/07/10 19:17:48: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f ed 52 00 00 2e 00,
Sense: 3/11/00
11/07/10 19:17:48: EVT#00411-11/07/10 19:17:48: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fed55
11/07/10 19:17:51: EVT#00412-11/07/10 19:17:51: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f f1 60 00 00 20 00,
Sense: 3/11/00
11/07/10 19:17:51: EVT#00413-11/07/10 19:17:51: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742ff160
11/07/10 19:17:54: EVT#00415-11/07/10 19:17:54: 113=Unexpected sense: PD
03(e0x20/s3) Path 5000c50010157171, CDB: 28 00 74 2f e9 3f 00 00 41 00,
Sense: 3/11/00
11/07/10 19:17:54: EVT#00416-11/07/10 19:17:54: 111=Unrecoverable medium
error during recovery on PD 03(e0x20/s3) at 742fe93f
11/07/10 19:17:55: EVT#00419-11/07/10 19:17:55: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742ff981
11/07/10 19:17:55: EVT#00424-11/07/10 19:17:55: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742ff97d
11/07/10 19:17:55: EVT#00426-11/07/10 19:17:55: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742ff97e
11/07/10 19:17:55: EVT#00428-11/07/10 19:17:55: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742fed55
11/07/10 19:17:55: EVT#00429-11/07/10 19:17:55: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742fed51
11/07/10 19:17:56: EVT#00430-11/07/10 19:17:56: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742fed50
11/07/10 19:17:56: EVT#00431-11/07/10 19:17:56: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742fed4d
11/07/10 19:17:56: EVT#00432-11/07/10 19:17:56: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742fed4a
11/07/10 19:17:56: EVT#00433-11/07/10 19:17:56: 97=Puncturing bad block
on PD 03(e0x20/s3) at 742fed49
_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq