Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-15 Thread Bob Hetzel

 In the case here it's rather a large issue (granted due to other 
 problems like hardware or software issues that require restores of data 
 and is NOT something that I want to continue, however we are living in 
 an imperfect world) but to give you an idea the dataset size is about 
 30-40TiB with restores anywhere from 2TiB to a full restore to back that 
 back up again not only takes a LOT of tapes which is costly it also 
 takes a LOT of time (day/weeks) where nothing else can be run.
 
 What I have been doing which is just painful is do a full restore, then 
 have to do a full backup right after then continue so a restore is 
 actually the time of about 2x of a full.  (for a full set this is about 
 9-10 days on LTO4).   This has happened about 3 times so far in the past 
 2 months.

Have you considered breaking up such a big dataset into pieces that only 
take up perhaps 5 tapes or less each?  I know it can be a pain to manage, 
but it seems like that would go a long way toward.  The other benefit is 
that if a tape breaks during the restore (or is just discovered to be 
unreadable) you can still get 100% of the other datasets restored.  IMHO, 
any time you realize you can't back up your fileset in one day or even one 
weekend that's a good opportunity to take a step back and re-think things. 
  Perhaps you might even need to take the step of scheduling the fulls so 
they don't occur on the same schedule.

I realize you that your periodic audits of your filesets to make sure 
you're not missing something become a lot more complicated, but a painful 
audit once per quarter seems like a lot better than what you've described 
above.




--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-15 Thread Steve Costaras


Yes, unfortunately the directory names and the amount of data in them 
varies dramatically (image file processing for each project) so there is 
no real way to break it apart without a large chance of missing 
something.Ideally I would like to have the SD multi-plex the files 
to different tape drives (i.e. get linear improvement with the number of 
drives) but haven't looked into it as it's only a problem with full 
backups/restores which /should/ be infrequent except now when we're 
having other problems.




On 01/15/2010 09:02, Bob Hetzel wrote:
   

In the case here it's rather a large issue (granted due to other
problems like hardware or software issues that require restores of data
and is NOT something that I want to continue, however we are living in
an imperfect world) but to give you an idea the dataset size is about
30-40TiB with restores anywhere from 2TiB to a full restore to back that
back up again not only takes a LOT of tapes which is costly it also
takes a LOT of time (day/weeks) where nothing else can be run.

What I have been doing which is just painful is do a full restore, then
have to do a full backup right after then continue so a restore is
actually the time of about 2x of a full.  (for a full set this is about
9-10 days on LTO4).   This has happened about 3 times so far in the past
2 months.
 

Have you considered breaking up such a big dataset into pieces that only
take up perhaps 5 tapes or less each?  I know it can be a pain to manage,
but it seems like that would go a long way toward.  The other benefit is
that if a tape breaks during the restore (or is just discovered to be
unreadable) you can still get 100% of the other datasets restored.  IMHO,
any time you realize you can't back up your fileset in one day or even one
weekend that's a good opportunity to take a step back and re-think things.
   Perhaps you might even need to take the step of scheduling the fulls so
they don't occur on the same schedule.

I realize you that your periodic audits of your filesets to make sure
you're not missing something become a lot more complicated, but a painful
audit once per quarter seems like a lot better than what you've described
above.




--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


   
--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-14 Thread Martin Simmons
 On Wed, 13 Jan 2010 20:18:19 -0600, Steve Costaras said:
 
 Ok, found out why when I do a restore of files bacula keeps thinking 
 that they are 'new' and will back them up again.   Seems that bacula 
 changes ctime to the time of the restore of the file not the original 
 ctime.   atime  mtime are properly set on the files at restore but not 
 ctime.
 
 I didn't see anything in the on-line docs, is there a flag in the 
 fileset directive to keep ALL time values (atime, mtime, ctime) to be 
 exactly as they were on the original file when doing a restore?

No, there is no way to do that.  It is impossible to modify the ctime on unix
without hacking the disk directly.

__Martin

--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-14 Thread Steve Costaras
Ok, then let me ask this in a different way.  How do you prevent Bacula 
from backing up files that were just restored?


I see the mtimeonly flag in the fileset options but there are many 
caveats about using it as you will miss other files that may have been 
copied over that have retained mtimes from before the last backup.   
Since bacula does an MD5/SHA1 hash of all files I assumed (wrongly it 
seems) that it would be smart enough to not back up files that it 
already had backed up and are on tape.




On 01/14/2010 15:11, Martin Simmons wrote:

On Wed, 13 Jan 2010 20:18:19 -0600, Steve Costaras said:
 

Ok, found out why when I do a restore of files bacula keeps thinking
that they are 'new' and will back them up again.   Seems that bacula
changes ctime to the time of the restore of the file not the original
ctime.   atime  mtime are properly set on the files at restore but not
ctime.

I didn't see anything in the on-line docs, is there a flag in the
fileset directive to keep ALL time values (atime, mtime, ctime) to be
exactly as they were on the original file when doing a restore?
 

No, there is no way to do that.  It is impossible to modify the ctime on unix
without hacking the disk directly.

__Martin

--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


   
--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-14 Thread Dan Langille
Steve Costaras wrote:
 On 01/14/2010 15:11, Martin Simmons wrote:
 On Wed, 13 Jan 2010 20:18:19 -0600, Steve Costaras said:
 
 Ok, found out why when I do a restore of files bacula keeps thinking 
 that they are 'new' and will back them up again.   Seems that bacula 
 changes ctime to the time of the restore of the file not the original 
 ctime.   atime  mtime are properly set on the files at restore but not 
 ctime.

 I didn't see anything in the on-line docs, is there a flag in the 
 fileset directive to keep ALL time values (atime, mtime, ctime) to be 
 exactly as they were on the original file when doing a restore?
 
 No, there is no way to do that.  It is impossible to modify the ctime on unix
 without hacking the disk directly.

Please reply at the bottom of your message.  It makes it much easier to 
follow the issue.

  Ok, then let me ask this in a different way.  How do you prevent
  Bacula from backing up files that were just restored?

It appears you can't.  At least, not easily.

  I see the mtimeonly flag in the fileset options but there are many
  caveats about using it as you will miss other files that may have been
  copied over that have retained mtimes from before the last backup.
  Since bacula does an MD5/SHA1 hash of all files I assumed (wrongly it
  seems) that it would be smart enough to not back up files that it
  already had backed up and are on tape.

Smart enough?  Sheesh.  ;)

That hash is to ensure the file is restored properly.  And for 
verfication.  To do what you want is not easy.

How big of a problem is this for you?

Have you looked into the Virtual Backups, although I'm not sure this 
would help you.


--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-14 Thread Steve Costaras


On 01/14/2010 15:59, Dan Langille wrote:
 Steve Costaras wrote:

  I see the mtimeonly flag in the fileset options but there are many
  caveats about using it as you will miss other files that may have been
  copied over that have retained mtimes from before the last backup.
  Since bacula does an MD5/SHA1 hash of all files I assumed (wrongly it
  seems) that it would be smart enough to not back up files that it
  already had backed up and are on tape.

 Smart enough?  Sheesh.  ;)

 That hash is to ensure the file is restored properly.  And for 
 verfication.  To do what you want is not easy.

 How big of a problem is this for you?

 Have you looked into the Virtual Backups, although I'm not sure this 
 would help you.

:)  Well I figured it would be relatively easy as an option (since the 
hash is in the database and when a file is read from disk for backup 
(since it's planning on backing it up anyway it would need to read the 
file, if the file name  hash match those that are in the database the 
file could be skipped.   (for 'real' completeness and to keep in line w/ 
the accurate option perhaps update the database with permissions et al 
on the inode but since the content matches that would save a lot of tapes).

In the case here it's rather a large issue (granted due to other 
problems like hardware or software issues that require restores of data 
and is NOT something that I want to continue, however we are living in 
an imperfect world) but to give you an idea the dataset size is about 
30-40TiB with restores anywhere from 2TiB to a full restore to back that 
back up again not only takes a LOT of tapes which is costly it also 
takes a LOT of time (day/weeks) where nothing else can be run.

What I have been doing which is just painful is do a full restore, then 
have to do a full backup right after then continue so a restore is 
actually the time of about 2x of a full.  (for a full set this is about 
9-10 days on LTO4).   This has happened about 3 times so far in the past 
2 months.



--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-14 Thread Dan Langille
Steve Costaras wrote:
 
 On 01/14/2010 15:59, Dan Langille wrote:
 Steve Costaras wrote:

 I see the mtimeonly flag in the fileset options but there are many
 caveats about using it as you will miss other files that may have been
 copied over that have retained mtimes from before the last backup.
 Since bacula does an MD5/SHA1 hash of all files I assumed (wrongly it
 seems) that it would be smart enough to not back up files that it
 already had backed up and are on tape.
 Smart enough?  Sheesh.  ;)

 That hash is to ensure the file is restored properly.  And for 
 verfication.  To do what you want is not easy.

 How big of a problem is this for you?

 Have you looked into the Virtual Backups, although I'm not sure this 
 would help you.
 
 :)  Well I figured it would be relatively easy as an option (since the 
 hash is in the database and when a file is read from disk for backup 
 (since it's planning on backing it up anyway it would need to read the 
 file, if the file name  hash match those that are in the database the 
 file could be skipped.   (for 'real' completeness and to keep in line w/ 
 the accurate option perhaps update the database with permissions et al 
 on the inode but since the content matches that would save a lot of tapes).

In the database... not on the client.  That's the issue.  Let us not 
discuss this here.  It is not a trivial problem to do correctly.

 In the case here it's rather a large issue (granted due to other 
 problems like hardware or software issues that require restores of data 
 and is NOT something that I want to continue, however we are living in 
 an imperfect world) but to give you an idea the dataset size is about 
 30-40TiB with restores anywhere from 2TiB to a full restore to back that 
 back up again not only takes a LOT of tapes which is costly it also 
 takes a LOT of time (day/weeks) where nothing else can be run.
 
 What I have been doing which is just painful is do a full restore, then 
 have to do a full backup right after then continue so a restore is 
 actually the time of about 2x of a full.  (for a full set this is about 
 9-10 days on LTO4).   This has happened about 3 times so far in the past 
 2 months.

I would start a new thread.  How to avoid backing up restored files.

This one has gone its course.

--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-13 Thread Dan Langille
Steve Costaras wrote:
 Ok, found out why when I do a restore of files bacula keeps thinking 
 that they are 'new' and will back them up again.   Seems that bacula 
 changes ctime to the time of the restore of the file not the original 
 ctime.   atime  mtime are properly set on the files at restore but not 
 ctime.

It seems? Have you verified by looking at ctime?

What OS?

 I didn't see anything in the on-line docs, is there a flag in the 
 fileset directive to keep ALL time values (atime, mtime, ctime) to be 
 exactly as they were on the original file when doing a restore?
 
 (I did see the keepatime flag but from reading that only affects atime?)

Note, if you use this feature, when Bacula resets the access time, the 
change time (st_ctime) will automatically be modified by the system, so 
on the next incremental job, the file will be backed up even if it has 
not changed. As a consequence, you will probably also want to use 
mtimeonly = yes as well as keepatime (thanks to Rudolf Cejka for this 
tip). 



--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] bacula 3.0.3 maintain ctime on restored files?

2010-01-13 Thread Steve Costaras



On 01/13/2010 20:36, Dan Langille wrote:

Steve Costaras wrote:
Ok, found out why when I do a restore of files bacula keeps thinking 
that they are 'new' and will back them up again.   Seems that bacula 
changes ctime to the time of the restore of the file not the original 
ctime.   atime  mtime are properly set on the files at restore but 
not ctime.


It seems? Have you verified by looking at ctime?

What OS?


Yes, sorry for the ambiguity.   ctime is set to the time of the restore 
of the particular file.   atime/mtime show that the times are set to the 
original file's times.


OS is linux
Linux loki 2.6.24-26-server #1 SMP Tue Dec 1 18:26:43 UTC 2009 x86_64 
GNU/Linux


file system is XFS; using SQLlite3 as the database but that shouldn't 
matter here.



I didn't see anything in the on-line docs, is there a flag in the 
fileset directive to keep ALL time values (atime, mtime, ctime) to be 
exactly as they were on the original file when doing a restore?


(I did see the keepatime flag but from reading that only affects atime?)


Note, if you use this feature, when Bacula resets the access time, 
the change time (st_ctime) will automatically be modified by the 
system, so on the next incremental job, the file will be backed up 
even if it has not changed. As a consequence, you will probably also 
want to use mtimeonly = yes as well as keepatime (thanks to Rudolf 
Cejka for this tip). 


I saw that comment which is why I mentioned it as what I'm seeing is the 
opposite.  I do not have keepatime set at all and it IS being set to 
current time at the time of restore.


-
FileSet {
  Name = loki-FileSet
  Ignore FileSet Changes = yes
  Include {
Options {
  accurate=mcs1
  checkfilechanges=yes
  hardlinks=yes
  noatime=yes
  onefs=yes
  recurse=yes
  signature=SHA1
  sparse=yes
  verify=pns1
}
File = /
File = /boot
File = /home
File = /var/ftp
  }
  Exclude {
File = /cdrom
File = /dev
File = /lost+found
File = /media
File = /opt/bacula/var/bacula/spool
File = /opt/bacula/var/bacula/working
File = /proc
File = /sys
File = /tmp
File = /var/ftp/tmp
File = /var/lock
File = /var/run
File = /var/tmp
File = /.fsck
File = /.journal
  }
}


--
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users