Hey All, We recently got a new DDS4 backup drive, as I had thought our old one had died. Anyway, when I got the tape drive, and put it in the external housing, it was suffering the same symptoms as the previous one. After much testing, I had discovered the problem was not the drive, but the power supply. Anyway, I have since got a new power supply and it seems to work OK, but every few days the backup dies with /dev/nst0: file input/output error. The kern.log is at the bottom of this email with the card dump. to online the device, I have to cycle the power to the tape drive, and remove the device from the scsi bus (echo "scsi remove-single-device 1 0 0 0 0" >/proc/scsi/scsi) then add it again (which I use rescan-scsi-bus.sh) , and it comes good again for a while. The scsi card is a Adaptec 39320, dual host controller, of which I have 3 disks on host 0, and the tape on host 1. There are no problems with the disks. Anyone have any Idea's on where I could look, could the power supply have damaged the tape drive, could it be the scsi card, perhaps this power supply is damaged? Thanks, Scott kern.log output: Mar 17 00:40:15 lotus-server kernel: scsi1:0:0:0: Attempting to abort cmd e56b0680: 0xa 0x1 0x0 0x0 0x40 0x0 Mar 17 00:40:15 lotus-server kernel: scsi1: At time of recovery, card was not paused Mar 17 00:40:15 lotus-server kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Mar 17 00:40:15 lotus-server kernel: scsi1: Dumping Card State at program address 0x207 Mode 0x33 Mar 17 00:40:15 lotus-server kernel: Card was paused Mar 17 00:40:15 lotus-server kernel: HS_MAILBOX[0x0] INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0] Mar 17 00:40:15 lotus-server kernel: SAVED_MODE[0x11] DFFSTAT[0x11]:(CURRFIFO_1|FIFO0FREE) Mar 17 00:40:15 lotus-server kernel: SCSISIGI[0x4]:(P_DATAOUT|BSYI) SCSIPHASE[0x0] SCSIBUS[0x47] Mar 17 00:40:15 lotus-server kernel: LASTPHASE[0x0]:(P_DATAOUT) SCSISEQ0[0x0] SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) Mar 17 00:40:15 lotus-server kernel: SEQCTL0[0x10]:(FASTMODE) SEQINTCTL[0x0] SEQ_FLAGS[0x20]:(DPHASE) Mar 17 00:40:15 lotus-server kernel: SEQ_FLAGS2[0x0] SSTAT0[0x0] SSTAT1[0x0] SSTAT2[0x0] Mar 17 00:40:15 lotus-server kernel: SSTAT3[0x0] PERRDIAG[0xc0]:(HIPERR|HIZERO) SIMODE1[0xac]:(ENSCSIPERR|ENBUSFREE|ENSCSIRST|ENSELTIMO) Mar 17 00:40:15 lotus-server kernel: LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] Mar 17 00:40:15 lotus-server kernel: LQOSTAT1[0x0] LQOSTAT2[0x80] Mar 17 00:40:15 lotus-server kernel: Mar 17 00:40:15 lotus-server kernel: SCB Count = 4 CMDS_PENDING = 1 LASTSCB 0x3 CURRSCB 0x3 NEXTSCB 0x0 Mar 17 00:40:15 lotus-server kernel: qinstart = 50838 qinfifonext = 50838 Mar 17 00:40:15 lotus-server kernel: QINFIFO: Mar 17 00:40:15 lotus-server kernel: WAITING_TID_QUEUES: Mar 17 00:40:15 lotus-server kernel: Pending list: Mar 17 00:40:15 lotus-server kernel: 3 FIFO_USE[0x0] SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0x7] Mar 17 00:40:15 lotus-server kernel: Total 1 Mar 17 00:40:15 lotus-server kernel: Kernel Free SCB list: 2 1 0 Mar 17 00:40:15 lotus-server kernel: Sequencer Complete DMA-inprog list: Mar 17 00:40:15 lotus-server kernel: Sequencer Complete list: Mar 17 00:40:15 lotus-server kernel: Sequencer DMA-Up and Complete list: Mar 17 00:40:15 lotus-server kernel: Mar 17 00:40:15 lotus-server kernel: scsi1: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 Mar 17 00:40:15 lotus-server kernel: SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS) Mar 17 00:40:15 lotus-server kernel: SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) Mar 17 00:40:15 lotus-server kernel: SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0] Mar 17 00:40:15 lotus-server kernel: SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0 Mar 17 00:40:15 lotus-server kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x0] Mar 17 00:40:15 lotus-server kernel: scsi1: FIFO1 Active, LONGJMP == 0x1ec, SCB 0x3 Mar 17 00:40:15 lotus-server kernel: SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS) Mar 17 00:40:15 lotus-server kernel: SEQINTSRC[0x0] DFCNTRL[0x2c]:(DIRECTION|HDMAEN|SCSIEN) Mar 17 00:40:15 lotus-server kernel: DFSTATUS[0x0] SG_CACHE_SHADOW[0x28] SG_STATE[0x3]:(SEGS_AVAIL|LOADING_NEEDED) Mar 17 00:40:15 lotus-server kernel: DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0xc]:(DLZERO|SHVALID) Mar 17 00:40:15 lotus-server kernel: SHADDR = 0x03e00426c, SHCNT = 0xd94 HADDR = 0x03e004600, HCNT = 0xa00 Mar 17 00:40:15 lotus-server kernel: CCSGCTL[0x10]:(SG_CACHE_AVAIL) Mar 17 00:40:15 lotus-server kernel: LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 Mar 17 00:40:15 lotus-server kernel: scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 Mar 17 00:40:15 lotus-server kernel: scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 Mar 17 00:40:15 lotus-server kernel: SIMODE0[0xc]:(ENOVERRUN|ENIOERR) Mar 17 00:40:15 lotus-server kernel: CCSCBCTL[0x4]:(CCSCBDIR) Mar 17 00:40:15 lotus-server kernel: scsi1: REG0 == 0x3, SINDEX = 0x133, DINDEX = 0x102 Mar 17 00:40:15 lotus-server kernel: scsi1: SCBPTR == 0x3, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xff56 Mar 17 00:40:15 lotus-server kernel: CDB a 1 0 0 38 8c Mar 17 00:40:15 lotus-server kernel: STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x17 0x207 Mar 17 00:40:15 lotus-server kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> Mar 17 00:40:15 lotus-server kernel: DevQ(0:0:0): 0 waiting Mar 17 00:40:15 lotus-server kernel: scsi1:0:0:0: Device is active, asserting ATN Mar 17 00:40:15 lotus-server kernel: Recovery code sleeping Mar 17 00:40:20 lotus-server kernel: Recovery code awake Mar 17 00:40:20 lotus-server kernel: Timer Expired Mar 17 00:40:20 lotus-server kernel: Recovery code sleeping Mar 17 00:40:25 lotus-server kernel: Recovery code awake Mar 17 00:40:25 lotus-server kernel: Timer Expired Mar 17 00:40:25 lotus-server kernel: scsi1: Device reset returning 0x2003 Mar 17 00:40:25 lotus-server kernel: Recovery SCB completes Mar 17 00:40:25 lotus-server kernel: Recovery SCB completes Mar 17 00:40:35 lotus-server kernel: scsi: Device offlined - not ready after error recovery: host 1 channel 0 id 0 lun 0 Mar 17 00:40:35 lotus-server kernel: st0: Error 10000 (sugg. bt 0x0, driver bt 0x0, host bt 0x1). -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
