Hello, On 2/12/2007 11:43 AM, Daniel Holtkamp wrote: > Hi ! > > My bacula 2.0.1 installation is running quite nicely except for some > servers. I`ll use only one of these as an example as the others have the > same problem. > > This one server has to backup more than 5 million files that are very > small (usually less than 2KB). The problem is that the performance > impact backing up these files is enormous. > > Here is a little sniplet from the last (unfinished) backup. > > Elapsed time: 23 hours 51 mins 42 secs > Priority: 10 > FD Files Written: 3,562,070 > SD Files Written: 3,561,858 > FD Bytes Written: 2,507,509,039 (2.507 GB) > SD Bytes Written: 3,088,552,545 (3.088 GB) > Rate: 29.2 KB/s > > At that time the backup ran for almost a complete day and it still has > to backup 2+ million files that make up for about 3 GB of data. As you > can see the rate is VERY slow. I have of course enabled attribute > spooling to take the database out of the equation. Also the backup goes > to diskbased-volumes. It only gets this slow when it gets to the loads > of small files - prior to that the backup rate is perfectly acceptable > with 2MB/s.
Such a number of tiny files is usually a problem. There are several reasons to this, IMO: Disk seeks (often 2 per file: read inode, read data) which is hard to avoid Other possible limitations on backup throughput can be minmized, I hope: > The fileset for this server is this: > > FileSet { > Name = "X400mta" > Include { > Options { > exclude = yes > wilddir = "/var/tmp" > regexdir = "/var/[cache/man|catman]/[cat?|X11R6/cat?|local/cat?]" Probably a probelm. You could try to expand the directories in the configuration. > compression=GZIP Try running the job without compression. You could even check if compression matters much with this special fileset. > signature=SHA1 This one might be the limiting factor: SHA1 means lots of CPU work. Depending on the data you store, you could perhaps run this fileset without computing signatures, or use the less cpu-intensive MD5 alternative. > } > File = / > File = /opt > File = /usr > File = /var > File = /export/home > } > Include { > Options { > regexdir = "/var/[cache/man|catman]/[cat?|X11R6/cat?|local/cat?]" > keepatime=yes > mtimeonly=yes > compression=GZIP > signature=SHA1 > } > File = /var/tmp > } > Exclude { > File = .autofsck > File = /proc > File = /tmp > File = .journal > File = /opt/rsi/archive > File = /opt/rsi/spool > File = /opt/x400/mtadata/logfiles > } > } > > Any ideas on how to improve performance here ? Can the excludes be a > problem ? Or the Regex ? The regex might be a problem, but I'd start with compression and signatures first... both can be quite important, so if these are what makes your backups slow you've got to choose... > Also what influences the performance on migrating data ? I have no idea whatsoever... except that I would observe the systems load when running migration jobs. Not only the load itself, but also i/o wait times and memory usage. Arno > I`ve had migration processes running nicely at 15MB/s (max for > tapedrive) and some go at a measily 1 MB/s - from the same disk-array to > the same tapedrive of course. > > Best regards, > Daniel Holtkamp > > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------- > Using Tomcat but need to do more? Need to support web services, security? > Get stuff done quickly with pre-integrated technology to make your job easier. > Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 > > > ------------------------------------------------------------------------ > > _______________________________________________ > Bacula-users mailing list > Bacula-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/bacula-users -- IT-Service Lehmann [EMAIL PROTECTED] Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users