On Thursday 01 March 2007 03:58, Michael Brennen wrote:
> On Mon, 26 Feb 2007, Kern Sibbald wrote:
> > On Sunday 25 February 2007 09:42, Michael Brennen wrote:
> >> Hello again,
> >>
> >> The long weekend backups started running Friday night.  This is a 4
> >> drive 60 slot autochanger.  The bacula version is "archive-dir
> >> Version: 2.1.4 (21 February 2007)" to get the encryption patch.
> >>
> >> Consistent with what I posted a couple of days ago the tape drives
> >> are still a bit confused, and I've had to manually unmount and mount
> >> tapes to accomodate the needs of a few jobs.
> >>
> >> I now have a job that is waiting for a particular mount on drive 0,
> >> but thd drive is unmountable.  I unmounted a volume FNI0005 in drive
> >> 0, used by a different job, and unmounted volume FNI0006 in drive 3
> >> in order to mount FNI0006 on drive 0, which is what the waiting job
> >> wanted.  That process left drive 0 in the following state, taken
> >> with debug 200:
> >>
> >> ===============================================
> >> Device "Drive-0" (/dev/nst0) is not open.
> >>      Device is BLOCKED. User unmounted during wait for media/mount.
> >>      Drive 0 status unknown.
> >> Configured device capabilities:
> >> EOF BSR BSF FSR FSF EOM REM !RACCESS AUTOMOUNT !LABEL !ANONVOLS
> >> ALWAYSOPEN Device state:
> >> !OPENED TAPE !LABEL !MALLOC !APPEND !READ !EOT !WEOT !EOF !NEXTVOL
> >> !SHORT !MOUNTED num_writers=1 block=5
> >> ===============================================
> >>
> >> If I read this correctly, the lack of a bang in front of TAPE
> >> indicates that bacula thinks there is a tape in the drive.  There is
> >> not; a raw mtx status query shows drive 0 empty.  Trying to unmount
> >> FNI0005 left the drive in this state, and it is now impossible to
> >> mount another drive, as a mount command results in trying to unmount
> >> a non-existent tape into an already full slot 1, which of course
> >> fails.
> >>
> >> When I try to mount FNI0006 on drive 0, this is the result:
> >>
> >> ===============================================
> >> forked processes:
> >>
> >> /bin/sh /path/to/bacula/mtx-changer /dev/sg3 unload 1 /dev/nst0 0
> >>    \_ mt -f /dev/nst0 offline
> >>    ... eventual timeout...
> >> 3995 Bad autochanger "unload slot 1, drive 0": ERR=Child exited with
> >> code 1
> >> Results=/dev/nst0: No medium found
> >> Storage Element 1 is Already Full
> >>
> >> 3901 open device failed: ERR=dev.c:424 Unable to open device
> >> "Drive-0" (/dev/nst0): ERR=No medium found
> >> ===============================================
> >>
> >> Every time I try to mount a tape to the drive it tries to unload a
> >> non existent tape and will go no further.  I've tried releasing the
> >> drive, unmounting it, etc. and nothing has succeeded so far.  I've
> >> had no results googling for a resolution either; this drive status
> >> does not come up much in searches.
> >>
> >> The autochanger and drive 0 config are below; the other drives are
> >> identical with appropriate name changes.  This configuration has
> >> been working for weeks, and it just began showing problems.  The
> >> pools just now began recycling volumes, and it seems to be the low
> >> numbered recycled tapes that are problematic.  I do not know if that
> >> has anything to do with this or not.
> >>
> >> Autochanger {
> >>    Name = Q47
> >>    Device = Drive-0, Drive-1, Drive-2, Drive-3
> >>    Changer Command = "/path/to/bacula/mtx-changer %c %o %S %a %d"
> >>    Changer Device = "/dev/sg3"
> >> }
> >>
> >> Device {
> >>    Name = Drive-0
> >>    Drive Index = 0
> >>    Media Type = DLT-7000
> >>    Archive Device = /dev/nst0
> >>    AutomaticMount = yes;
> >>    AlwaysOpen = yes;
> >>    RemovableMedia = yes;
> >>    RandomAccess = no;
> >>    AutoChanger = yes
> >>    LabelMedia = no
> >>    # Enable the Alert command only if you have the mtx package loaded
> >>    Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
> >>    SpoolDirectory = /some/dir
> >> }
> >>
> >> If anyone has any insight on how to clear this up, or how it got in
> >> this state, I would appreciate it.  For now I will muddle through
> >> the long backups and manually rerun the failed/blocked jobs after I
> >> can restart things to get it to an unblocked state.  Thanks...
> >
> > The first thing to do is to get back on a stable Bacula.  I have not
> > tested the trunk with a tape drive for well over a month, and yet there
> > have been some *major* modifications to the device handling.
> >
> > If you don't know how to get onto Branch-2.0, then I recommend going back
> > to the released 2.0.2.  (Sorry I don't have the proper command just in
> > front of me). Hopefully that will clear up all your problems.  If you
> > need the fix for the encryption problem, download the updated restore.c
> > file from the bug. By the end of next week, I hope to have version 2.0.3
> > ready for release ...
>
> I went back to 2.0.2 source with the filed restore.c, and all is
> working well now.  I did not know the difference between the branch
> and trunk; this is my first exposure to an svn tree.  That's what I
> get for working with things I don't fully understand. :)

OK.  What you did is the best what to have proceeded in the situation you were 
in.   Now I need some testing of 2.0.3.  However, last night, I ran into a 
problem with two pools that seems to be exactly what you were describing, so 
before doing any beta testing (i.e. making it available to you or anyone 
else), I need to investigate that problem.  As far as I can tell, it is a 
problem that has been around for some time (at least from 1.39.x), but only 
triggers under some circumstances.  This regression test has been run many 
times.  We'll see if I can reproduce it today ...

I have house guests all this week, so programming is going slowly (mostly 
early morning and late night).  I'll send another email when I am sure there 
are no problems.

>
>     -- Michael
>
> -------------------------------------------------------------------------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share
> your opinions on IT & business topics through brief surveys-and earn cash
> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
> _______________________________________________
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to