Dear all,
I'm using bacula 9.6.5 in a production for a month now. I'm experiencing random 
backup failures from my clients. Specific hosts report errors like the outputs 
attached. The same host is able to perform backup at some other time. The error 
is more often at large backups (more errors at full backups than incremental, 
more errors at hosts with large data sets).

I have tried to implement heartbeat interval 
(https://www.bacula.org/9.6.x-manuals/en/main/Client_File_daemon_Configur.html#SECTION002210000000000000000)
 but there is no improvement.
The error occures also on hosts in the same zone as bacula server (no 
router/firewall in between).
Storage deamon is installed on the same server as bacula director. I'm using 
File cloud driver (backup to local disk via cloud resource).

Could you please suggest a solution or a way to troubleshoot this further?
Thx!
Regards,Ziga Zvan

Backup from linux hosts (on 05-dec 3 hosts failed, 20 hosts completed without 
error):
05-Dec 03:26 bacula-dir JobId 1721: Fatal error: Network error with FD during 
Backup: ERR=Connection reset by peer
05-Dec 03:27 bacula-dir JobId 1721: Fatal error: No Job status returned from FD.
05-Dec 03:27 bacula-dir JobId 1721: Error: Bacula bacula-dir 9.6.5 (11Jun20):

Backup from windows hosts (on 05-dec 2 hosts failed, 5 hosts completed without 
error):
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: Error: lib/bsock.c:383 
Write error sending 57172 bytes to Storage daemon:192.168.66.35:9103: 
ERR=Input/output error
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: Fatal error: 
filed/backup.c:848 Network send error to SD. ERR=Input/output error
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: VSS Writer (BackupComplete): 
"Task Scheduler Writer", State: 0x1 (VSS_WS_STABLE)
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: VSS Writer (BackupComplete): 
"VSS Metadata Store Writer", State: 0x1 (VSS_WS_STABLE)
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: VSS Writer (BackupComplete): 
"Performance Counters Writer", State: 0x1 (VSS_WS_STABLE)
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: VSS Writer (BackupComplete): 
"System Writer", State: 0x1 (VSS_WS_STABLE)
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: VSS Writer (BackupComplete): 
"ASR Writer", State: 0x1 (VSS_WS_STABLE)
05-Dec 00:40 iwhost01.kranj.cetrtapot.si-fd JobId 1726: VSS Writer (BackupComplete): 
"Shadow Copy Optimization Writer", State: 0x1 (VSS_WS_STABLE)
05-Dec 01:01 bacula-dir JobId 1726: Error: bsock.c:551 Read error from Client: 
iwhost01.kranj.cetrtapot.si-fd:iwhost01.kranj.cetrtapot.si:9102: ERR=Connection 
timed out
05-Dec 01:01 bacula-dir JobId 1726: Fatal error: Network error with FD during 
Backup: ERR=Connection timed out
05-Dec 01:02 bacula-dir JobId 1726: Fatal error: No Job status returned from FD.
05-Dec 01:02 bacula-dir JobId 1726: Error: Bacula bacula-dir 9.6.5 (11Jun20):

Similar output from 21-Nov:
21-Nov 05:30 dc1.kranj.cetrtapot.si-fd JobId 1393: Error: lib/bsock.c:383 Write 
error sending 4 bytes to Storage daemon:192.168.66.35:9103: ERR=Input/output 
error
21-Nov 05:30 dc1.kranj.cetrtapot.si-fd JobId 1393: Error: lib/bsock.c:271 
Socket has errors=1 on call to Storage daemon:192.168.66.35:9103
21-Nov 05:30 dc1.kranj.cetrtapot.si-fd JobId 1393: Error: lib/bsock.c:271 
Socket has errors=1 on call to Storage daemon:192.168.66.35:9103
21-Nov 05:30 dc1.kranj.cetrtapot.si-fd JobId 1393: Fatal error: 
filed/backup.c:607 Network send error to SD. ERR=Input/output error
21-Nov 05:49 bacula-dir JobId 1393: Fatal error: Network error with FD during 
Backup: ERR=Connection timed out




_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to