Hey All,
We recently got a new DDS4 backup drive, as I had thought our old one had
died.
Anyway, when I got the tape drive, and put it in the external housing, it
was suffering the same symptoms as the previous one.
After much testing, I had discovered the problem was not the drive, but the
power supply.

Anyway, I have since got a new power supply and it seems to work OK, but
every few days the backup dies with /dev/nst0: file input/output error.
The kern.log is at the bottom of this email with the card dump. to online
the device, I have to cycle the power to the tape drive, and remove the
device from the scsi bus (echo "scsi remove-single-device 1 0 0 0 0"
>/proc/scsi/scsi) then add it again (which I use rescan-scsi-bus.sh) , and
it comes good again for a while.

The scsi card is a Adaptec 39320, dual host controller, of which I have 3
disks on host 0, and the tape on host 1.
There are no problems with the disks.

Anyone have any Idea's on where I could look, could the power supply have
damaged the tape drive, could it be the scsi card, perhaps this power
supply is damaged?

Thanks,

Scott


kern.log output:
Mar 17 00:40:15 lotus-server kernel: scsi1:0:0:0: Attempting to abort cmd
e56b0680: 0xa 0x1 0x0 0x0 0x40 0x0
Mar 17 00:40:15 lotus-server kernel: scsi1: At time of recovery, card was
not paused
Mar 17 00:40:15 lotus-server kernel: >>>>>>>>>>>>>>>>>> Dump Card State
Begins <<<<<<<<<<<<<<<<<
Mar 17 00:40:15 lotus-server kernel: scsi1: Dumping Card State at program
address 0x207 Mode 0x33
Mar 17 00:40:15 lotus-server kernel: Card was paused
Mar 17 00:40:15 lotus-server kernel: HS_MAILBOX[0x0]
INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0]
Mar 17 00:40:15 lotus-server kernel: SAVED_MODE[0x11]
DFFSTAT[0x11]:(CURRFIFO_1|FIFO0FREE)
Mar 17 00:40:15 lotus-server kernel: SCSISIGI[0x4]:(P_DATAOUT|BSYI)
SCSIPHASE[0x0] SCSIBUS[0x47]
Mar 17 00:40:15 lotus-server kernel: LASTPHASE[0x0]:(P_DATAOUT)
SCSISEQ0[0x0] SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI)
Mar 17 00:40:15 lotus-server kernel: SEQCTL0[0x10]:(FASTMODE)
SEQINTCTL[0x0] SEQ_FLAGS[0x20]:(DPHASE)
Mar 17 00:40:15 lotus-server kernel: SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x0] SSTAT2[0x0]
Mar 17 00:40:15 lotus-server kernel: SSTAT3[0x0]
PERRDIAG[0xc0]:(HIPERR|HIZERO)
SIMODE1[0xac]:(ENSCSIPERR|ENBUSFREE|ENSCSIRST|ENSELTIMO)
Mar 17 00:40:15 lotus-server kernel: LQISTAT0[0x0] LQISTAT1[0x0]
LQISTAT2[0x0] LQOSTAT0[0x0]
Mar 17 00:40:15 lotus-server kernel: LQOSTAT1[0x0] LQOSTAT2[0x80]
Mar 17 00:40:15 lotus-server kernel:
Mar 17 00:40:15 lotus-server kernel: SCB Count = 4 CMDS_PENDING = 1 LASTSCB
0x3 CURRSCB 0x3 NEXTSCB 0x0
Mar 17 00:40:15 lotus-server kernel: qinstart = 50838 qinfifonext = 50838
Mar 17 00:40:15 lotus-server kernel: QINFIFO:
Mar 17 00:40:15 lotus-server kernel: WAITING_TID_QUEUES:
Mar 17 00:40:15 lotus-server kernel: Pending list:
Mar 17 00:40:15 lotus-server kernel:   3 FIFO_USE[0x0]
SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0x7]
Mar 17 00:40:15 lotus-server kernel: Total 1
Mar 17 00:40:15 lotus-server kernel: Kernel Free SCB list: 2 1 0
Mar 17 00:40:15 lotus-server kernel: Sequencer Complete DMA-inprog list:
Mar 17 00:40:15 lotus-server kernel: Sequencer Complete list:
Mar 17 00:40:15 lotus-server kernel: Sequencer DMA-Up and Complete list:
Mar 17 00:40:15 lotus-server kernel:
Mar 17 00:40:15 lotus-server kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff,
SCB 0x0
Mar 17 00:40:15 lotus-server kernel:
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
Mar 17 00:40:15 lotus-server kernel: SEQINTSRC[0x0] DFCNTRL[0x0]
DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
Mar 17 00:40:15 lotus-server kernel: SG_CACHE_SHADOW[0x2]:(LAST_SEG)
SG_STATE[0x0] DFFSXFRCTL[0x0]
Mar 17 00:40:15 lotus-server kernel: SOFFCNT[0x0]
MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
Mar 17 00:40:15 lotus-server kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x0]
Mar 17 00:40:15 lotus-server kernel: scsi1: FIFO1 Active, LONGJMP == 0x1ec,
SCB 0x3
Mar 17 00:40:15 lotus-server kernel:
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
Mar 17 00:40:15 lotus-server kernel: SEQINTSRC[0x0]
DFCNTRL[0x2c]:(DIRECTION|HDMAEN|SCSIEN)
Mar 17 00:40:15 lotus-server kernel: DFSTATUS[0x0] SG_CACHE_SHADOW[0x28]
SG_STATE[0x3]:(SEGS_AVAIL|LOADING_NEEDED)
Mar 17 00:40:15 lotus-server kernel: DFFSXFRCTL[0x0] SOFFCNT[0x0]
MDFFSTAT[0xc]:(DLZERO|SHVALID)
Mar 17 00:40:15 lotus-server kernel: SHADDR = 0x03e00426c, SHCNT = 0xd94
HADDR = 0x03e004600, HCNT = 0xa00
Mar 17 00:40:15 lotus-server kernel: CCSGCTL[0x10]:(SG_CACHE_AVAIL)
Mar 17 00:40:15 lotus-server kernel: LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
Mar 17 00:40:15 lotus-server kernel: scsi1: LQISTATE = 0x0, LQOSTATE = 0x0,
OPTIONMODE = 0x42
Mar 17 00:40:15 lotus-server kernel: scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT =
0x0
Mar 17 00:40:15 lotus-server kernel: SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
Mar 17 00:40:15 lotus-server kernel: CCSCBCTL[0x4]:(CCSCBDIR)
Mar 17 00:40:15 lotus-server kernel: scsi1: REG0 == 0x3, SINDEX = 0x133,
DINDEX = 0x102
Mar 17 00:40:15 lotus-server kernel: scsi1: SCBPTR == 0x3, SCB_NEXT ==
0xff00, SCB_NEXT2 == 0xff56
Mar 17 00:40:15 lotus-server kernel: CDB a 1 0 0 38 8c
Mar 17 00:40:15 lotus-server kernel: STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x17
0x207
Mar 17 00:40:15 lotus-server kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends
>>>>>>>>>>>>>>>>>>
Mar 17 00:40:15 lotus-server kernel: DevQ(0:0:0): 0 waiting
Mar 17 00:40:15 lotus-server kernel: scsi1:0:0:0: Device is active,
asserting ATN
Mar 17 00:40:15 lotus-server kernel: Recovery code sleeping
Mar 17 00:40:20 lotus-server kernel: Recovery code awake
Mar 17 00:40:20 lotus-server kernel: Timer Expired
Mar 17 00:40:20 lotus-server kernel: Recovery code sleeping
Mar 17 00:40:25 lotus-server kernel: Recovery code awake
Mar 17 00:40:25 lotus-server kernel: Timer Expired
Mar 17 00:40:25 lotus-server kernel: scsi1: Device reset returning 0x2003
Mar 17 00:40:25 lotus-server kernel: Recovery SCB completes
Mar 17 00:40:25 lotus-server kernel: Recovery SCB completes
Mar 17 00:40:35 lotus-server kernel: scsi: Device offlined - not ready
after error recovery: host 1 channel 0 id 0 lun 0
Mar 17 00:40:35 lotus-server kernel: st0: Error 10000 (sugg. bt 0x0, driver
bt 0x0, host bt 0x1).

-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

Reply via email to