Hello,

for some time now, eventually since switching to 1.38.5  (18  January
2006),   I'm   experiencing   strange  SCSI  problems.  These  happen
identically on 4 different  tape  drives  connected  to  3  different
hosts.  The 3 machines are running Fedora Core 2 and Fedora Core 4. A
test setup on a FC4 standalone machine with different  type  of  SCSI
controller  and a new tape drive showed the same symptoms, too. So it
seems the only common factor is the new version of bacula.

Using other tools (btape, tar) to write to the tape never  shows  any
such problems.

What I see in bacula is this:

13-Mar 12:24 castor-sd: User specified spool size reached.
13-Mar 12:24 castor-sd: Writing spooled data to Volume. Despooling 
5,000,028,410 bytes ...
13-Mar 12:24 castor-sd: Castor-Home.2006-03-13_10.51.13 Error: block.c:538 
Write error at 9:3891 on device "SLR100-2" (/dev/nst1). ERR=Input/output error.
13-Mar 12:24 castor-sd: Castor-Home.2006-03-13_10.51.13 Error: Error writing 
final EOF to tape. This Volume may not be readable.
dev.c:1536 ioctl MTWEOF error on "SLR100-2" (/dev/nst1). ERR=Input/output error.
13-Mar 12:24 castor-sd: End of medium on Volume "K-F-2" Bytes=9,092,562,656 
Blocks=140,951 at 13-Mar-2006 12:24.


The system log shows:

Mar 13 12:24:45 castor kernel: mptbase: ioc0: IOCStatus(0x0047): SCSI Protocol 
Error
Mar 13 12:24:45 castor kernel: st1: Error b0000 (sugg. bt 0x0, driver bt 0x0, 
host bt 0xb).
Mar 13 12:24:45 castor kernel: mptbase: ioc0: IOCStatus(0x0043): SCSI Device 
Not There
Mar 13 12:24:45 castor kernel: st1: Error 10000 (sugg. bt 0x0, driver bt 0x0, 
host bt 0x1).
Mar 13 12:24:45 castor kernel: mptbase: ioc0: IOCStatus(0x0043): SCSI Device 
Not There
Mar 13 12:24:45 castor kernel: st1: Error 10000 (sugg. bt 0x0, driver bt 0x0, 
host bt 0x1).
Mar 13 12:26:58 castor kernel: st1: Error with sense data: Current st1: sense 
key Unit Attention
Mar 13 12:26:58 castor kernel: Additional sense: Power on, reset, or bus device 
reset occurred


To me this looks like a SCSI driver problem in the Linux kernel,  but
all  circumstances  point  at  bacula, or at least at the bacula tape
access pattern.


Has anybody else ever seen something like this?  Any  ideas  what  to
look for?

Best regards,

Wolfgang Denk

-- 
Software Engineering:  Embedded and Realtime Systems,  Embedded Linux
Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: [EMAIL PROTECTED]
What is research but a blind date with knowledge?      -- Will Harvey


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to