On 9/2/21 9:50 AM, Dan-Gabriel CALUGARU wrote:
Hello everybody,
I would like to ask for your help to continue the backup of space of
around 300 TB.
I'am using Bacula 9.6.7 version.
I was able to divide this work into several jobs of about 15-20 TB
(one week for each job) to be able to resume more easily if there was
a problem.
After several such jobs successfully completed (I have already backed
up nearly 250 TB), the machine hosting the bacula server crashed while
my last backup job (jobID = 25) was running.
Could you advise me what is the best way to continue in such a case ?
If something happens to the network communications or if the client
crashes, then a job may be marked Incomplete, rather than Failed. In
that case, the job can be restarted because the Bacula server knows that
the files it has received so far are correct and that it can restart
with the file that was being received when the problem occurred. The
server still has the cached/spooled data. For your job 25, that is not
the case. Instead, the Bacula server machine itself crashed, so it
cannot determine where to restart and did not retain any cached/spooled
data.
As additional information, I would note that this job appears with
Failed status and that it had written (before the crash) on 2 volumes
(which are LTO-7 tape cartridges with a capacity of approximately 6TB):
- about 2TB on the 1st volume "volume41" (which became Full), knowing
that the previous job (well finished) had already written the first 4TB
- about 1TB on the 2nd volume "volume 42" (which was empty before the
job, ans allways in Append status)
I have tried so far:
1) purge files jobid=25
but this command seems to have nothing done because jobID=25 was still
present in the catalog (the outputs of the commands list jobid=25 and
list joblog jobid=25 have not changed after this command)
then
2) delete jobid=25
who deleted this job from the catalog because I got this message :
/JobId = 25 and associated records deleted from the catalog./
and the outputs of the commands list jobid=25 and list joblog jobid=25
have changed ("No results to list")
On the other hand, the information on the two volumes has not changed
and if I restart with restart jobid=25 I have the impression that
bacula acts as if it is another job, so it continues to write on the
2nd volume ("volume 42") after the 1TB already written (by the
previous Failed job). Therefore, the space written by the Failes job
(jobID = 25) no longer seems to be used and will therefore remain "lost".
Instead, I would like bacula reuse this space (the 2TB on the 1st
volume "volume41" and the 1TB on the 2nd volume "volume 42").
Indeed, from what I understood, for Failed jobs, we have to start from
scratch, but I would like to re-use the space it had written by Failed
job (because unusable).
Do you have a technique for doing this ?
Thank you in advance for any response
Best regards,
Dan
--
Dan-Gabriel CALUGARU
IR en Calcul Scientifique (CNRS)
Dr de Mathématiques et Applications
Laboratoire de Mécanique des Fluides et d'Acoustique
UMR 5509 CNRS - ECL - UCBL - INSA Lyon - Univ. de Lyon
Bâtiment I11 - bureau 11098
ECOLE CENTRALE de LYON
36, avenue Guy de Collongue
69134 ECULLY
tel: +33 (0)4 72 18 61 73
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users