Re: [Bacula-users] Fwd: Windows jobs keep failing part way or midway through

2015-01-14 Thread Radosław Korzeniewski
Hello,

2015-01-09 16:36 GMT+01:00 Justin Edmands :

>
> We have a bunch of Linux and a bunch of Windows clients. All Linux clients
> finish the nightly backup successfully. Most Windows clients will fail to
> complete the job after a few GB. It doesn't appear to be the same file
> across all backups. I watched a few die off and didn't notice a pattern in
> the files. Attempting to run a new Full of a failing Windows box also
> fails.
>
> Box is new machine with Xeon processor/gigE/plenty of RAM/ and dedicated
> RAID via Areca card.
>
> The job fails with:
>
> Error: /home/kern/bacula/k/bacula/src/lib/bsock.c:393 Write error sending
> 63071 bytes to Storage daemon:hqbacula1:9103: ERR=Input/output error"
>

This is a network related problem. A dozen possibilities available.
Firewall or IDS killing a connection, bad network card or driver, etc...

You have to check it all progressively.

best regrds
-- 
Radosław Korzeniewski
rados...@korzeniewski.net
--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


[Bacula-users] persistent errors after 5.2.13 upgrade

2015-01-14 Thread Dimitri Maziuk
Hi all,

I recently rebuilt one of my centos 6 setups with 5.2.13 giallu rpms,
now I'm getting these all the time. This was not happening before with
the stock 5.0 :

> 11-Jan 23:43 rendena-sd JobId 51: New volume "vchanger_0002_0011" mounted on 
> device "Vchanger-Drive" (/var/spool/bacula/vchanger/0/drive0) at 11-Jan-2015 
> 23:43.
> 12-Jan 02:20 rendena-sd JobId 51: Despooling elapsed time = 06:46:13, 
> Transfer rate = 2.043 M Bytes/second
> 12-Jan 02:20 rendena-sd JobId 51: Spooling data again ...
> 13-Jan 14:51 rendena-sd JobId 51: User specified Job spool size reached: 
> JobSpoolSize=49,807,365,050 MaxJobSpoolSize=49,807,360,000
> 13-Jan 14:51 rendena-sd JobId 51: Writing spooled data to Volume. Despooling 
> 49,807,365,050 bytes ...
> 13-Jan 21:13 rendena-dir JobId 51: Error: Watchdog sending kill after 518401 
> secs to thread stalled reading File daemon.
> 13-Jan 21:13 rendena-dir JobId 51: Fatal error: Network error with FD during 
> Backup: ERR=Interrupted system call
> 13-Jan 21:14 rendena-dir JobId 51: Fatal error: No Job status returned from 
> FD.
> 13-Jan 21:14 rendena-dir JobId 51: Error: Bacula rendena-dir 5.2.13 (19Jan13):
>   Build OS:   x86_64-redhat-linux-gnu redhat Enterprise release

This keeps happening on several clients. One thing I can see in the logs
is that the ones that fail fill up the spool disk whereas the ones that
succeed don't, e.g.

> 13-Jan 21:16 rendena-sd JobId 249: Committing spooled data to Volume 
> "vchanger_0002_0014". Despooling 171,322,987 bytes ...
> 13-Jan 21:18 rendena-sd JobId 249: Despooling elapsed time = 00:01:47, 
> Transfer rate = 1.601 M Bytes/second
> 13-Jan 21:18 rendena-sd JobId 249: Elapsed time=00:13:27, Transfer rate=212.0 
> K Bytes/second
> 13-Jan 21:18 rendena-sd JobId 249: Sending spooled attrs to the Director. 
> Despooling 482,710 bytes ...

Is there a timeout I need to change someplace because I have a 4TB spool
disk?

TIA
-- 
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu



signature.asc
Description: OpenPGP digital signature
--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] 0 files restored from bacula backup

2015-01-14 Thread Peter van Heusden
No, about 900 files were marked, as can be seen in the files expected line.
I did this by marking a directory, which, as I understand it, recursively
marks everything under that directory.

Peter
On Jan 14, 2015 3:59 PM, "John Drescher"  wrote:

> On Wed, Jan 14, 2015 at 8:21 AM, Peter van Heusden 
> wrote:
> > Hi there
> >
> > I did a backup of one of our servers in early December 2014. The backup
> was
> > written to a LTO3 tape on the IBM TS3200 tape changer that we have. The
> > backup server is running bacula 5.0.0 and the client is bacula 2.2.8.
> >
> > I did a test restore soon after the backup completed and successfully
> > restored some 4700 files (totalling 50 MB). Today I tried to do another
> > restore from the data job and when the restore completed I got the
> message:
> >
> > 14-Jan 13:31 bacula-dir JobId 694: Bacula bacula-dir 5.0.0 (26Jan10):
> > 14-Jan-2015 13:31:00
> >   Build OS:   x86_64-redhat-linux-gnu redhat
> >   JobId:  694
> >   Job:RestoreFiles.2015-01-14_13.28.46_09
> >   Restore Client: backup-server
> >   Start time: 14-Jan-2015 13:28:48
> >   End time:   14-Jan-2015 13:31:00
> >   Files Expected: 915
> >   Files Restored: 0
> >   Bytes Restored: 0
> >   Rate:   0.0 KB/s
> >   FD Errors:  0
> >   FD termination status:  OK
> >   SD termination status:  OK
> >   Termination:Restore OK -- warning file count mismatch
> >
> >
> > No errors were reported but nothing was restored. Does anyone know what
> has
> > happened to the data? And is there anything that can be done to get it
> back?
> >
>
> Did you forget to mark the files you want to restore?
>
> John
>
--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Bacula 7 does not like duplicate Address for Storage

2015-01-14 Thread Dan Langille

> On Jan 14, 2015, at 2:29 AM, Eric Bollengier 
>  wrote:
> 
> Hello Dan,
> 
> 
> On 14/01/2015 01:58, Dan Langille wrote:
>> 
>> If they all have distinct values for Address, it works. If not, one or more 
>> Storage are silently ignored.
>> 
> 
> You are right, some users have many Storage resources in their
> configuration, and now, by default, bacula is trying to list only
> Storage resources that are unique in the status command.
> 
> If you do a status of the first Resource or the second one, you end up
> with the same output.
> 
> To get the full list, you can use the keyword "select" in your status
> command.
> 
> status storage select

I was sure this was a bug because when I first tried a copy Job, it failed to 
locate the SD.  I now think
this was related to TLS issues.  After the upgrade, I found that I could no 
longer run a particular Copy Job.
It would complain:

14-Jan 00:09 bacula-dir JobId 196621: Using Device "DTL03" to write.
14-Jan 00:09 crey-sd JobId 196621: Fatal error: bnet.c:278 TLS host certificate 
verification failed. Host name "crey.example.org" did not match presented 
certificate
14-Jan 00:09 crey-sd JobId 196621: Fatal error: TLS negotiation failed.
14-Jan 00:09 bacula-dir JobId 196621: Fatal error: Bad response to Storage 
command: wanted 2000 OK storage

Copy jobs had run successfully earlier in the day without issue.  The only 
change since then was the upgrade to Bacula 7.

Granted, I may have changed the Address field in bacula-dir.conf some time ago 
and not issued a reload, but I can't be sure.

Reissuing the certs with an amended FQDN fixed that issue.

— 
Dan Langille
http://langille.org/






--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] 0 files restored from bacula backup

2015-01-14 Thread John Drescher
On Wed, Jan 14, 2015 at 8:21 AM, Peter van Heusden  wrote:
> Hi there
>
> I did a backup of one of our servers in early December 2014. The backup was
> written to a LTO3 tape on the IBM TS3200 tape changer that we have. The
> backup server is running bacula 5.0.0 and the client is bacula 2.2.8.
>
> I did a test restore soon after the backup completed and successfully
> restored some 4700 files (totalling 50 MB). Today I tried to do another
> restore from the data job and when the restore completed I got the message:
>
> 14-Jan 13:31 bacula-dir JobId 694: Bacula bacula-dir 5.0.0 (26Jan10):
> 14-Jan-2015 13:31:00
>   Build OS:   x86_64-redhat-linux-gnu redhat
>   JobId:  694
>   Job:RestoreFiles.2015-01-14_13.28.46_09
>   Restore Client: backup-server
>   Start time: 14-Jan-2015 13:28:48
>   End time:   14-Jan-2015 13:31:00
>   Files Expected: 915
>   Files Restored: 0
>   Bytes Restored: 0
>   Rate:   0.0 KB/s
>   FD Errors:  0
>   FD termination status:  OK
>   SD termination status:  OK
>   Termination:Restore OK -- warning file count mismatch
>
>
> No errors were reported but nothing was restored. Does anyone know what has
> happened to the data? And is there anything that can be done to get it back?
>

Did you forget to mark the files you want to restore?

John

--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


[Bacula-users] 0 files restored from bacula backup

2015-01-14 Thread Peter van Heusden
Hi there

I did a backup of one of our servers in early December 2014. The backup was
written to a LTO3 tape on the IBM TS3200 tape changer that we have. The
backup server is running bacula 5.0.0 and the client is bacula 2.2.8.

I did a test restore soon after the backup completed and successfully
restored some 4700 files (totalling 50 MB). Today I tried to do another
restore from the data job and when the restore completed I got the message:

14-Jan 13:31 bacula-dir JobId 694: Bacula bacula-dir 5.0.0 (26Jan10):
14-Jan-2015 13:31:00
  Build OS:   x86_64-redhat-linux-gnu redhat
  JobId:  694
  Job:RestoreFiles.2015-01-14_13.28.46_09
  Restore Client: backup-server
  Start time: 14-Jan-2015 13:28:48
  End time:   14-Jan-2015 13:31:00
  Files Expected: 915
  Files Restored: 0
  Bytes Restored: 0
  Rate:   0.0 KB/s
  FD Errors:  0
  FD termination status:  OK
  SD termination status:  OK
  Termination:Restore OK -- warning file count mismatch


No errors were reported but nothing was restored. Does anyone know what has
happened to the data? And is there anything that can be done to get it back?

Thanks!
Peter
--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] btape fill failure with LTO-3 on FreeBSD 9.3

2015-01-14 Thread Georg Altmann
Am 14.01.2015 um 00:52 schrieb Dan Langille:
> 
>> On Jan 4, 2015, at 5:05 PM, Georg Altmann  wrote:
>> The prompt
>>
>> Mount Volume "TestVolume1" on device "LTO3-0" (/dev/nsa0) and press
>> return when ready:
>>
>> just re-appears every time I hit enter. I have attached the full btape
>> output.
>>
>> What about the error message "Error: mount.c:834 Hey! WroteVol
>> non-zero !" ?
>>
>> This is the bacula-sd configuration of the device:
>>
>> Device {
>>  Name = LTO3-0
>>  Media Type = LTO3
>>  Archive Device = /dev/nsa0
>>  AutoChanger = no;
>>  Spool Directory = /bspool/bacula
>>  AutomaticMount = yes;   # when device opened, read it
>>  AlwaysOpen = yes;
>>  RemovableMedia = yes;
>>  RandomAccess = no;
>>  Hardware End of Medium = no;
>> }
> 
> Your configuration is interesting.  Is this a standalone tape drive?

Yes, this is a standalone drive.

>>
>> This comes as a bit of a surprise to me, since I successfully operated
>> an HP LTO-1 drive with the same configuration on FreeBSD successfully.
>> However, this was a previous version of FreeBSD and bacula.
>> Operating the drive with tar and mt (fsf) works just fine.
>>
>> Might this just be a regression with btape?
> 
> When you ran your successful backup and restore spanning two LTO3 tapes, did 
> you do a diff on the original version?

Yes, I md5'ed the files before the backup and after the restore and
diffed the checksums. The checksums were identical.

I am quite convinced that that configuration I have is fine and that
this is a btape problem.
The drive is now in productive operation (remote!) and I don't want to
mess with the config. I might do some tests once I am on location again.
To my understanding, it would make sense to look into how btape differs
from bacula-sd in handling the drive.

As said before, this configuration has worked just fine for LTO1 drives
on FreeBSD with and without an autoloader. I am guessing that the
SCSI/IOCTL interface hasn't changed between the LTO generations.

Thank you for your input, Dan!

Regards,
Georg

-- 
PGP-Key: 0x1E320E65
D150 7783 A0D1 7507 1266  C5B3 BBF1 9C42 1E32 0E65

I don't like the idea of secret agencies to analyse and archive
personal communication. GnuPG is available as open source, free as as in
freedom, as a countermeasure. I use http://www.enigmail.net/ for Mozilla
Thunderbird. If you can, please use a frontend of your choice to send me
encrypted e-mail. See http://www.gnupg.org/ for an overview.



signature.asc
Description: OpenPGP digital signature
--
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users