I'm getting a little frustrated trying to bring my new backup server online. I'm running Debian Stretch on a Dell R730, hooked to a Dell TL200 library over SAS. The library has a single LTO6 drive. Bacula 7.4.4 is installed, and as far as I can tell is correctly set up and configured. It passes all the btape diagnostics with no trouble. However, nothing I try from inside bacula itself seems able to access the tape drive itself. It *is* successful in manipulating the robotics however:
*mount storage=Library2 slot=1 drive=0 3304 Issuing autochanger "load slot 1, drive 0" command for vol . 3305 Autochanger "load slot 1, drive 0", status is OK for vol . 3001 Mounted Volume: 000015L6 3001 Device ""Drive-1" (/dev/nst0)" is already mounted with Volume "000015L6" This is my second bacula installation. My previous backup server (currently running on Debian Jessie) has been successfully backing up half a dozen clients for ten years on a different TL2000 using LTO3 tapes. It started with an inability to label the tapes. Then I noticed that btape allows you to do a label, so I went through and labeled each cartridge by hand in btape, recognizing that I'd have to add them to the catalog by hand, hoping that the problem was just with the label command. OK, wishful thinking... All my volumes are now labeled and in the catalog: *list volumes Using Catalog "MyCatalog" Pool: Default +---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+ | MediaId | VolumeName | VolStatus | Enabled | VolBytes | VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | ExpiresIn | +---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+ | 1 | CLNU00L1 | Cleaning | 1 | 1 | 0 | 31,536,000 | 1 | 4 | 1 | | NULL | NULL | | 2 | CLN005L3 | Cleaning | 1 | 1 | 0 | 31,536,000 | 1 | 8 | 1 | | NULL | NULL | | 3 | 000015L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 1 | 1 | LTO6 | NULL | NULL | | 5 | 000021L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 3 | 1 | LTO6 | NULL | NULL | | 6 | 000018L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 2 | 1 | LTO6 | NULL | NULL | | 7 | 000014L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 5 | 1 | LTO6 | NULL | NULL | | 8 | 000017L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 6 | 1 | LTO6 | NULL | NULL | | 9 | 000020L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 7 | 1 | LTO6 | NULL | NULL | | 10 | 000013L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 9 | 1 | LTO6 | NULL | NULL | | 11 | 000016L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 10 | 1 | LTO6 | NULL | NULL | | 12 | 000019L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 11 | 1 | LTO6 | NULL | NULL | | 13 | 000012L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 12 | 1 | LTO6 | NULL | NULL | | 14 | 000009L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 13 | 1 | LTO6 | NULL | NULL | | 15 | 000006L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 14 | 1 | LTO6 | NULL | NULL | | 16 | 000003L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 15 | 1 | LTO6 | NULL | NULL | | 17 | 000011L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 16 | 1 | LTO6 | NULL | NULL | | 18 | 000008L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 17 | 1 | LTO6 | NULL | NULL | | 19 | 000005L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 18 | 1 | LTO6 | NULL | NULL | | 20 | 000002L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 19 | 1 | LTO6 | NULL | NULL | | 23 | 000004L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 22 | 1 | LTO6 | NULL | NULL | | 24 | 000001L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 23 | 1 | LTO6 | NULL | NULL | | 25 | 000010L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 20 | 1 | LTO6 | NULL | NULL | | 26 | 000007L6 | Append | 1 | 0 | 0 | 31,536,000 | 1 | 21 | 1 | LTO6 | NULL | NULL | +---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+ Pool: File No results to list. Pool: Scratch No results to list. Still, when I try to do a backup, I still get *immediate* errors. No pause for the system to go the the hardware, which is a hint that it's probably a configuration error. *run A job name must be specified. The defined Job resources are: 1: Odin 2: BackupCatalog 3: RestoreFiles Select Job resource (1-3): 1 Run Backup job JobName: Odin Level: Incremental Client: odin-fd FileSet: Full Set Odin Pool: Default (From Job resource) Storage: Library2 (From Pool resource) When: 2017-12-18 10:23:58 Priority: 10 OK to run? (yes/mod/no): yes Job queued. JobId=46 You have messages. *messages 18-Dec 10:24 odin-dir JobId 46: No prior Full backup Job record found. 18-Dec 10:24 odin-dir JobId 46: No prior or suitable Full backup found in catalog. Doing FULL backup. 18-Dec 10:24 odin-dir JobId 46: Start Backup JobId 46, Job=Odin.2017-12-18_10.24.00_08 18-Dec 10:24 odin-sd JobId 46: Fatal error: Device reservation failed for JobId=46: 1000 OK VolName=000015L6 VolJobs=0 VolFiles=0 VolBlocks=0 VolBytes=0 VolABytes=0 VolHoleBytes=0 VolHoles=0 VolMounts=0 VolErrors=0 VolWrites=0 MaxVolBytes=0 VolCapacityBytes=0 VolStatus=Append Slot=1 MaxVolJobs=0 MaxVolFiles=0 InChanger=1 VolReadTime=0 VolWriteTime=0 EndFile=0 EndBlock=0 VolType=0 LabelType=0 MediaId=3 ScratchPoolId=0 18-Dec 10:24 odin-dir JobId 46: Fatal error: Storage daemon didn't accept Device "AutochangerOdin" because: 3924 Device "AutochangerOdin" not in SD Device resources or no matching Media Type. 18-Dec 10:24 odin-dir JobId 46: Error: Bacula odin-dir 7.4.4 (202Sep16): Build OS: x86_64-pc-linux-gnu debian 9.0 JobId: 46 Job: Odin.2017-12-18_10.24.00_08 Backup Level: Full (upgraded from Incremental) Client: "odin-fd" FileSet: "Full Set Odin" 2017-08-09 23:05:00 Pool: "Default" (From Job resource) Catalog: "MyCatalog" (From Client resource) Storage: "Library2" (From Pool resource) Scheduled time: 18-Dec-2017 10:23:58 Start time: 18-Dec-2017 10:24:02 End time: 18-Dec-2017 10:24:02 Elapsed time: 0 secs Priority: 10 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Snapshot/VSS: no Encryption: no Accurate: no Volume name(s): Volume Session Id: 2 Volume Session Time: 1513619722 Last Volume Bytes: 0 (0 B) Non-fatal FD errors: 1 SD Errors: 0 FD termination status: SD termination status: Termination: *** Backup Error *** What the error message seems to be telling me is that there is no device "AutochangerOdin" defined in the storage daemon. The following is the stanza for the autochanger in bacula-dir.conf: #From bacula-dir.conf # Definition of tape library device Storage { Name = Library2 # Do not use "localhost" here Address = odin # N.B. Use a fully qualified name here SDPort = 9103 Password = "mypassword" Device = AutochangerOdin Media Type = LTO6 Autochanger = yes # enable for autochanger device } Here is the section in bacula-sd.conf: #From bacula-sd.conf # # An autochanger device with # ONE drive! Autochanger { Name = AutochangerOdin Device = Drive-1 Changer Command = "/etc/bacula/scripts/mtx-changer %c %o %S %a %d" Changer Device = /dev/autochanger1 # Changer Device = /dev/tape/by-id/scsi-1IBM_3573-TL_00X2U78BZ022_LL0 } Device { Name = Drive-1 # Description = "LT06 inside Dell TL2000 Library" Drive Index = 0 Media Type = LT06 Archive Device = /dev/nst0 # Archive Device = /dev/tape/by-id/scsi-35000e11164c42001-nst # Archive Device = /dev/tape/by-path/pci-0000:05:00.0-sas-phy2-lun-0-nst AutomaticMount = yes; # when device opened, read it AlwaysOpen = yes; RemovableMedia = yes; RandomAccess = no; AutoChanger = yes SpoolDirectory = "/var/spool/bacula" MaximumSpoolSize = 485G Maximum Network Buffer Size = 65536 Offline On Unmount = no # # # # Enable the Alert command only if you have the mtx package loaded # # Note, apparently on some systems, tapeinfo resets the SCSI controller # # thus if you turn this on, make sure it does not reset your SCSI # # controller. I have never had any problems, and smartctl does # # not seem to cause such problems. # # #Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'" # If you have smartctl, enable this, it has more info than tapeinfo Alert Command = "sh -c 'smartctl -H -l error %c'" } My previous run at this (last fall) I got the advice to use by-id or by-path devices, but that is irrelevent, because those are just symlinks to the real devices, so I went back to the real devices. It might make a difference if I was rebooting the system regularly and if I didn't have udev rules in place to keep the devices constant, but it's not a factor here. What I'm hoping for is that there *is* a misconfiguration that I'm just not seeing. When I look at it, it seems fine to me, but I'm hoping someone else can see what's not obvious to me. Why would the director think the device "AutochangerOdin" does not exist? Or is that a red herring? Is it a problem with the definition of the Media Type? The catalog shows a MediaType of LTO6, but the error message from the failed backup shows a lot of parameters, but no MediaType. Could this be related? How would I fix that? Thanks for any advice. Steve Garcia Ignorance killed the cat, curiosity was framed. ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users