Hi.

Recently we changed the network connection for our backup server which is 
Bacula 5.2.3 on FreeBSD 9.0.

After that many jobs running across WAN started failing with various "broken 
pipe" errors. Some examples:

21-Feb 22:42 fbsd1-fd JobId 57779: Error: bsock.c:398 Wrote 32151 bytes to 
Storage daemon:backupsrv.url:9103, but only 16384 accepted.
21-Feb 22:42 fbsd1-fd JobId 57779: Fatal error: backup.c:1024 Network send 
error to SD. ERR=Broken pipe
21-Feb 22:42 fbsd1-fd JobId 57779: Error: bsock.c:339 Socket has errors=1 on 
call to Storage daemon:backupsrv.url:9103

(this one runs from the same backup-network to another SD)
22-Feb 00:14 backupsrv-dir JobId 57852: Fatal error: Network error with FD 
during Backup: ERR=Broken pipe
22-Feb 00:14 backupsrv-sd2 JobId 57852: JobId=57852 
Job="linux1-userdata.2012-02-21_23.05.02_02" marked to be canceled.
22-Feb 00:14 backupsrv-sd2 JobId 57852: Job write elapsed time = 00:33:20, 
Transfer rate = 591.5 K Bytes/second
22-Feb 00:14 backupsrv-sd2 JobId 57852: Error: bsock.c:529 Read expected 65568 
got 1448 from client:123.45.67.81:36643
22-Feb 00:14 backupsrv-dir JobId 57852: Fatal error: No Job status returned 
from FD.

22-Feb 00:16 backupsrv-dir JobId 57821: Fatal error: Network error with FD 
during Backup: ERR=Broken pipe
22-Feb 00:16 backupsrv-sd JobId 57821: Job write elapsed time = 00:57:00, 
Transfer rate = 26.69 K Bytes/second
22-Feb 00:16 backupsrv-dir JobId 57821: Fatal error: No Job status returned 
from FD.

22-Feb 00:24 winsrv1-fd JobId 57784: Error: 
/home/kern/bacula/k/bacula/src/lib/bsock.c:393 Write error sending 9363 bytes 
to Storage daemon:backupsrv.url:9103: ERR=Input/output error
22-Feb 00:24 winsrv1-fd JobId 57784: Fatal error: 
/home/kern/bacula/k/bacula/src/filed/backup.c:1024 Network send error to SD. 
ERR=Input/output error
22-Feb 00:26 winsrv1-fd JobId 57784: Error: 
/home/kern/bacula/k/bacula/src/lib/bsock.c:339 Socket has errors=1 on call to 
Storage daemon:backupsrv.url:9103

(this one runs from the same backup-network to another SD)
22-Feb 01:33 backupsrv-dir JobId 57872: Fatal error: Socket error on 
ClientRunBeforeJob command: ERR=Broken pipe
22-Feb 01:33 backupsrv-dir JobId 57872: Fatal error: Client "winsrv2-fd" 
RunScript failed.
22-Feb 01:33 backupsrv-dir JobId 57872: Fatal error: Network error with FD 
during Backup: ERR=Broken pipe
22-Feb 01:33 backupsrv-sd2 JobId 57872: JobId=57872 
Job="winsrv2.2012-02-22_01.00.00_27" marked to be canceled.
22-Feb 01:33 backupsrv-dir JobId 57872: Fatal error: No Job status returned 
from FD.

22-Feb 01:51 fbsd2-fd JobId 57806: Error: bsock.c:398 Wrote 61750 bytes to 
Storage daemon:backupsrv.url:9103, but only 16384 accepted.
22-Feb 01:51 fbsd2-fd JobId 57806: Fatal error: backup.c:1024 Network send 
error to SD. ERR=Broken pipe
22-Feb 01:51 fbsd2-fd JobId 57806: Error: bsock.c:339 Socket has errors=1 on 
call to Storage daemon:backupsrv.url:9103

22-Feb 02:15 backupsrv-dir JobId 57819: Fatal error: Network error with FD 
during Backup: ERR=Connection reset by peer
22-Feb 02:15 backupsrv-dir JobId 57819: Fatal error: No Job status returned 
from FD.


These jobs have been failing every day for a week now. Meanwhile other jobs 
complete just fine, and it seems not to about jobs' size or scripts to be run 
before jobs on clients etc.

Any idea what could be wrong?

-- 
Silver
------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to