Hello all, even with bacula version 5.2.12 we're still seeing the errors below where bacula sometimes fails to properly recycle a disk based volume.
There are lots of error messages like these: ###################################################################### 10-Jan 21:16 serverl186-dir JobId 51100: Start Backup JobId 51100, Job=serverw2107.2013-01-10_16.39.52_51 10-Jan 21:16 serverl186-dir JobId 51100: Purging oldest volume "incremental-0509" 10-Jan 21:16 serverl186-dir JobId 51100: 0 File on Volume "incremental-0509" purged from catalog. 10-Jan 21:16 serverl186-dir JobId 51100: Purging oldest volume "incremental-0509" 10-Jan 21:16 serverl186-dir JobId 51100: 0 File on Volume "incremental-0509" purged from catalog. 10-Jan 21:16 serverl186-dir JobId 51100: Purging oldest volume "incremental-0509" .... <SNIP SNIP> .... 10-Jan 22:19 serverw2107-fd JobId 51100: DIR and FD clocks differ by -30 seconds, FD automatically compensating. 10-Jan 22:19 serverl186-dir JobId 51100: Purging oldest volume "incremental-0509" 10-Jan 22:19 serverl186-dir JobId 51100: 0 File on Volume "incremental-0509" purged from catalog. 10-Jan 22:19 serverl186-dir JobId 51100: Purging oldest volume "incremental-0509" 10-Jan 22:19 serverl186-dir JobId 51100: 0 File on Volume "incremental-0509" purged from catalog. 10-Jan 22:19 serverl186-sd JobId 51100: Job serverw2107.2013-01-10_16.39.52_51 is waiting. Cannot find any appendable volumes. Please use the "label" command to create a new Volume for: Storage: "FileStorage_incremental" (/mnt/msa/online_backup) Pool: Online_incremental Media type: File 10-Jan 22:24 serverl186-sd JobId 51100: Volume "incremental-0508" previously written, moving to end of data. 10-Jan 22:24 serverl186-sd JobId 51100: Ready to append to end of Volume "incremental-0508" size=1202 10-Jan 22:24 serverw2107-fd JobId 51100: Generate VSS snapshots. Driver="VSS Vista", Drive(s)="C" 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "Task Scheduler Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "VSS Metadata Store Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "Performance Counters Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "System Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "ASR Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "Shadow Copy Optimization Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "COM+ REGDB Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "Registry Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "IIS Config Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "IIS Metabase Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "BITS Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:24 serverw2107-fd JobId 51100: VSS Writer (BackupComplete): "WMI Writer", State: 0x1 (VSS_WS_STABLE) 10-Jan 22:25 serverl186-sd JobId 51100: Elapsed time=00:00:24, Transfer rate=21 Bytes/second 10-Jan 22:25 serverl186-dir JobId 51100: Fatal error: Catalog error creating JobMedia record. sql_create.c:155 Update Media record UPDATE Media SET EndFile=50, EndBlock=1638829081 WHERE MediaId=58 failed: ERR= 10-Jan 22:25 serverl186-sd JobId 51100: Fatal error: Error creating JobMedia record: 1992 Create JobMedia error 10-Jan 22:25 serverl186-sd JobId 51100: Fatal error: acquire.c:516 Could not create JobMedia record for Volume="incremental-0508" Job=serverw2107.2013-01-10_16.39.52_51 10-Jan 22:25 serverl186-dir JobId 51100: Error: Unable to get Media record for Volume incremental-0058: ERR=sql_get.c:1094 Media record for Volume "incremental-0058" not found. ###################################################################### At this time in mysql, I can still see a long running query initiated by the volume recycle process: ###################################################################### # mystat Id User Host db Command Time State Info Progress 16715 bacula localhost bacula Query 18534 updating DELETE FROM File WHERE JobId IN (48428,48430,48432,48433,48434,48566,48568,48572,48574,48576,48578,4 0.000 16774 root localhost NULL Query 0 NULL show processlist 0.000 ###################################################################### As you can see above, the query to delete the files for the job records found on the volume has been running for over 18,000 seconds. We don't generally seem to have a problem with long recycle times, but every once in a while bacula hangs in the circumstances described above. The error seems to occur roughly one hour after the recycling starts, so could this be some sort of maximum wait interval expiring or something? Any ideas how to fix this situation would be greatly appreciated. We're running on Centos 6 with bacula community 5.2.12 compiled from source. We've also been seeing this behaviour with version 5.2.6. All the best, Uwe -- NIONEX --- Ein Unternehmen der Bertelsmann SE & Co. KGaA ------------------------------------------------------------------------------ Master HTML5, CSS3, ASP.NET, MVC, AJAX, Knockout.js, Web API and much more. Get web development skills now with LearnDevNow - 350+ hours of step-by-step video tutorials by Microsoft MVPs and experts. SALE $99.99 this month only -- learn more at: http://p.sf.net/sfu/learnmore_122812 _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users