I'm getting a little frustrated trying to bring my new backup server online.  
I'm running Debian Stretch on a Dell R730, hooked to a Dell TL200 library over 
SAS.  The library has a single LTO6 drive.  Bacula 7.4.4 is installed, and as 
far as I can tell is correctly set up and configured.  It passes all the btape 
diagnostics with no trouble.  However, nothing I try from inside bacula itself 
seems able to access the tape drive itself.  It *is* successful in manipulating 
the robotics however:

 *mount storage=Library2 slot=1 drive=0
 3304 Issuing autochanger "load slot 1, drive 0" command for vol .
 3305 Autochanger "load slot 1, drive 0", status is OK for vol .
 3001 Mounted Volume: 000015L6
 3001 Device ""Drive-1" (/dev/nst0)" is already mounted with Volume "000015L6"

This is my second bacula installation.  My previous backup server (currently 
running on Debian Jessie) has been successfully backing up half a dozen clients 
for ten years on a different TL2000 using LTO3 tapes.

It started with an inability to label the tapes.  Then I noticed that btape 
allows you to do a label, so I went through and labeled each cartridge by hand 
in btape, recognizing that I'd have to add them to the catalog by hand, hoping 
that the problem was just with the label command.  OK, wishful thinking...

All my volumes are now labeled and in the catalog:

 *list volumes
 Using Catalog "MyCatalog"
 Pool: Default
 
+---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+
 | MediaId | VolumeName | VolStatus | Enabled | VolBytes | VolFiles | 
VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | ExpiresIn 
|
 
+---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+
 |       1 | CLNU00L1   | Cleaning  |       1 |        1 |        0 |   
31,536,000 |       1 |    4 |         1 |           | NULL        | NULL      |
 |       2 | CLN005L3   | Cleaning  |       1 |        1 |        0 |   
31,536,000 |       1 |    8 |         1 |           | NULL        | NULL      |
 |       3 | 000015L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    1 |         1 | LTO6      | NULL        | NULL      |
 |       5 | 000021L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    3 |         1 | LTO6      | NULL        | NULL      |
 |       6 | 000018L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    2 |         1 | LTO6      | NULL        | NULL      |
 |       7 | 000014L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    5 |         1 | LTO6      | NULL        | NULL      |
 |       8 | 000017L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    6 |         1 | LTO6      | NULL        | NULL      |
 |       9 | 000020L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    7 |         1 | LTO6      | NULL        | NULL      |
 |      10 | 000013L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |    9 |         1 | LTO6      | NULL        | NULL      |
 |      11 | 000016L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   10 |         1 | LTO6      | NULL        | NULL      |
 |      12 | 000019L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   11 |         1 | LTO6      | NULL        | NULL      |
 |      13 | 000012L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   12 |         1 | LTO6      | NULL        | NULL      |
 |      14 | 000009L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   13 |         1 | LTO6      | NULL        | NULL      |
 |      15 | 000006L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   14 |         1 | LTO6      | NULL        | NULL      |
 |      16 | 000003L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   15 |         1 | LTO6      | NULL        | NULL      |
 |      17 | 000011L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   16 |         1 | LTO6      | NULL        | NULL      |
 |      18 | 000008L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   17 |         1 | LTO6      | NULL        | NULL      |
 |      19 | 000005L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   18 |         1 | LTO6      | NULL        | NULL      |
 |      20 | 000002L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   19 |         1 | LTO6      | NULL        | NULL      |
 |      23 | 000004L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   22 |         1 | LTO6      | NULL        | NULL      |
 |      24 | 000001L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   23 |         1 | LTO6      | NULL        | NULL      |
 |      25 | 000010L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   20 |         1 | LTO6      | NULL        | NULL      |
 |      26 | 000007L6   | Append    |       1 |        0 |        0 |   
31,536,000 |       1 |   21 |         1 | LTO6      | NULL        | NULL      |
 
+---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+
 Pool: File
 No results to list.
 Pool: Scratch
 No results to list.

Still, when I try to do a backup, I still get *immediate* errors.  No pause for 
the system to go the the hardware, which is a hint that it's probably a 
configuration error.

 *run
 A job name must be specified.
 The defined Job resources are:
     1: Odin
     2: BackupCatalog
     3: RestoreFiles
 Select Job resource (1-3): 1
 Run Backup job
 JobName:  Odin
 Level:    Incremental
 Client:   odin-fd
 FileSet:  Full Set Odin
 Pool:     Default (From Job resource)
 Storage:  Library2 (From Pool resource)
 When:     2017-12-18 10:23:58
 Priority: 10
 OK to run? (yes/mod/no): yes
 Job queued. JobId=46
 You have messages.
 *messages
 18-Dec 10:24 odin-dir JobId 46: No prior Full backup Job record found.
 18-Dec 10:24 odin-dir JobId 46: No prior or suitable Full backup found in 
catalog. Doing FULL backup.
 18-Dec 10:24 odin-dir JobId 46: Start Backup JobId 46, 
Job=Odin.2017-12-18_10.24.00_08
 18-Dec 10:24 odin-sd JobId 46: Fatal error: Device reservation failed for 
JobId=46: 1000 OK VolName=000015L6 VolJobs=0 VolFiles=0 VolBlocks=0 VolBytes=0 
VolABytes=0 VolHoleBytes=0 VolHoles=0 VolMounts=0  VolErrors=0 VolWrites=0 
MaxVolBytes=0 VolCapacityBytes=0 VolStatus=Append Slot=1 MaxVolJobs=0 
MaxVolFiles=0 InChanger=1 VolReadTime=0 VolWriteTime=0 EndFile=0 EndBlock=0 
VolType=0 LabelType=0 MediaId=3 ScratchPoolId=0

 18-Dec 10:24 odin-dir JobId 46: Fatal error: 
     Storage daemon didn't accept Device "AutochangerOdin" because:
     3924 Device "AutochangerOdin" not in SD Device resources or no matching 
Media Type.
 18-Dec 10:24 odin-dir JobId 46: Error: Bacula odin-dir 7.4.4 (202Sep16):
  Build OS:               x86_64-pc-linux-gnu debian 9.0
  JobId:                  46
  Job:                    Odin.2017-12-18_10.24.00_08
  Backup Level:           Full (upgraded from Incremental)
  Client:                 "odin-fd" 
  FileSet:                "Full Set Odin" 2017-08-09 23:05:00
  Pool:                   "Default" (From Job resource)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "Library2" (From Pool resource)
  Scheduled time:         18-Dec-2017 10:23:58
  Start time:             18-Dec-2017 10:24:02
  End time:               18-Dec-2017 10:24:02
  Elapsed time:           0 secs
  Priority:               10
  FD Files Written:       0
  SD Files Written:       0
  FD Bytes Written:       0 (0 B)
  SD Bytes Written:       0 (0 B)
  Rate:                   0.0 KB/s
  Software Compression:   None
  Snapshot/VSS:           no
  Encryption:             no
  Accurate:               no
  Volume name(s):         
  Volume Session Id:      2
  Volume Session Time:    1513619722
  Last Volume Bytes:      0 (0 B)
  Non-fatal FD errors:    1
  SD Errors:              0
  FD termination status:  
  SD termination status:  
  Termination:            *** Backup Error ***

What the error message seems to be telling me is that there is no device 
"AutochangerOdin" defined in the storage daemon.  The following is the stanza 
for the autochanger in bacula-dir.conf:

 #From bacula-dir.conf
 # Definition of tape library device
 Storage {
   Name = Library2
 # Do not use "localhost" here    
   Address = odin               # N.B. Use a fully qualified name here
   SDPort = 9103
   Password = "mypassword"
   Device = AutochangerOdin
   Media Type = LTO6
   Autochanger = yes                   # enable for autochanger device
 }


Here is the section in bacula-sd.conf:

 #From bacula-sd.conf
 #
 # An autochanger device with 
 #  ONE drive!
 Autochanger {
   Name = AutochangerOdin
   Device = Drive-1
   Changer Command = "/etc/bacula/scripts/mtx-changer %c %o %S %a %d"
   Changer Device = /dev/autochanger1
 #  Changer Device = /dev/tape/by-id/scsi-1IBM_3573-TL_00X2U78BZ022_LL0
 }

 Device {
   Name = Drive-1                      #
   Description = "LT06 inside Dell TL2000 Library"
   Drive Index = 0
   Media Type = LT06
   Archive Device = /dev/nst0
 #  Archive Device = /dev/tape/by-id/scsi-35000e11164c42001-nst
 #  Archive Device = /dev/tape/by-path/pci-0000:05:00.0-sas-phy2-lun-0-nst 
   AutomaticMount = yes;               # when device opened, read it
   AlwaysOpen = yes;
   RemovableMedia = yes;
   RandomAccess = no;
   AutoChanger = yes
   SpoolDirectory = "/var/spool/bacula"
   MaximumSpoolSize = 485G
   Maximum Network Buffer Size = 65536
   Offline On Unmount = no
 #  #
 #  # Enable the Alert command only if you have the mtx package loaded
 #  # Note, apparently on some systems, tapeinfo resets the SCSI controller
 #  #  thus if you turn this on, make sure it does not reset your SCSI
 #  #  controller.  I have never had any problems, and smartctl does
 #  #  not seem to cause such problems.
 #  #
 #Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
 #  If you have smartctl, enable this, it has more info than tapeinfo
   Alert Command = "sh -c 'smartctl -H -l error %c'"
 }

My previous run at this (last fall) I got the advice to use by-id or by-path 
devices, but that is irrelevent, because those are just symlinks to the real 
devices, so I went back to the real devices.  It might make a difference if I 
was rebooting the system regularly and if I didn't have udev rules in place to 
keep the devices constant, but it's not a factor here.

What I'm hoping for is that there *is* a misconfiguration that I'm just not 
seeing.  When I look at it, it seems fine to me, but I'm hoping someone else 
can see what's not obvious to me.  Why would the director think the device 
"AutochangerOdin" does not exist?  Or is that a red herring?  Is it a problem 
with the definition of the Media Type?  The catalog shows a MediaType of LTO6, 
but the error message from the failed backup shows a lot of parameters, but no 
MediaType.  Could this be related?  How would I fix that?

Thanks for any advice.

Steve Garcia
Ignorance killed the cat, curiosity was framed.

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to