[Bacula-users] Maximum Concurrent Jobs: sometimes only one job running
Hi I have configured my system to run 5 jobs at the same time. It works well with the the first 5 jobs but then it is a bit on and off. At the gant chart below show the 5 jobs starting a Full backup 1st november 22:05. Client asrv0 runs for 6 hours 11 minutes but in the time between bsrv2 ends and bsrv5 starts only one job is running. I did expect to always see at least 5 '#' marks vertically. It seems like the director starts 5 jobs and when all of them are done, it starts the 5 next jobs. I expected that as long as the 'Priority' of all the jobs are the same then a new job would be started when one is done. 1 asrv0 01 22:05 # asrv0 06:11 2 bsrv1 01 22:05 # bsrv1 00:00 3 bsrv2 01 22:05 ### bsrv2 03:04 4 asrv3 01 22:14 ### asrv3 01:25 5 bsrv4 01 22:14 # bsrv4 02:27 6 bsrv5 02 03:49 ---## bsrv5 00:38 7 bsrv6 02 03:51 --- bsrv6 01:32 8 bsrv7 02 04:09 ### bsrv7 03:09 9 bsrv8 02 04:18 bsrv8 01:53 10 tsrv9 02 04:30 -## tsrv9 06:32 11 csrv10 02 07:08 --# csrv10 00:01 12 csrv11 02 07:08 --# csrv11 00:13 13 csrv12 02 07:11 -- csrv12 01:42 14 dsrv13 02 07:21 --# dsrv13 02:28 15 csrv14 02 07:34 --- csrv14 05:57 16 dsrv15 02 13:18 --### Bacula version 5.0.2 Any help much appreciated. (Before you ask: The gant chart is from a QND perl script) -- Venlig hilsen Hans Schou -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Maximum Concurrent Jobs: sometimes only one job running
Le 2013-11-05 10:20, Hans Schou a écrit : Hi I have configured my system to run 5 jobs at the same time. It works well with the the first 5 jobs but then it is a bit on and off. At the gant chart below show the 5 jobs starting a Full backup 1st november 22:05. Client asrv0 runs for 6 hours 11 minutes but in the time between bsrv2 ends and bsrv5 starts only one job is running. I did expect to always see at least 5 '#' marks vertically. It seems like the director starts 5 jobs and when all of them are done, it starts the 5 next jobs. I expected that as long as the 'Priority' of all the jobs are the same then a new job would be started when one is done. 1 asrv0 01 22:05 # asrv0 06:11 2 bsrv1 01 22:05 # bsrv1 00:00 3 bsrv2 01 22:05 ### bsrv2 03:04 4 asrv3 01 22:14 ### asrv3 01:25 5 bsrv4 01 22:14 # bsrv4 02:27 6 bsrv5 02 03:49 ---## bsrv5 00:38 7 bsrv6 02 03:51 --- bsrv6 01:32 8 bsrv7 02 04:09 ### bsrv7 03:09 9 bsrv8 02 04:18 bsrv8 01:53 10 tsrv9 02 04:30 -## tsrv9 06:32 11 csrv10 02 07:08 --# csrv10 00:01 12 csrv11 02 07:08 --# csrv11 00:13 13 csrv12 02 07:11 -- csrv12 01:42 14 dsrv13 02 07:21 --# dsrv13 02:28 15 csrv14 02 07:34 --- csrv14 05:57 16 dsrv15 02 13:18 --### Bacula version 5.0.2 Any help much appreciated. (Before you ask: The gant chart is from a QND perl script) -- Venlig hilsen Hans Schou Hello, Do you use File tapes ? I noticed that when a file tape is full, pending jobs do not start concurrently anymore. Best regards. Jerome Blion. -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] New filesets not getting in the database
Hi Kern, Thank you. I thought that the error with the job was related with the fileset not been in the database, but then it's the other way arround. Regards, -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Explanation regarding archive pool type
Hi, Which are the differences of archive pool type versus backup? I want to have datas saved forever in my LTO tapes for archiving purposes. Any explanations/suggestions? Thanks in advance Carlo Rengo System Engineer -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Maximum Concurrent Jobs: sometimes only one job running
What is a File tape? Best regards, Kern PS: Please note that Console connections are treated internally by Bacula as a job, and there are something like 6 or 7 different places where concurrent jobs can be set. On 11/05/2013 11:21 AM, Jérôme Blion wrote: Le 2013-11-05 10:20, Hans Schou a écrit : Hi I have configured my system to run 5 jobs at the same time. It works well with the the first 5 jobs but then it is a bit on and off. At the gant chart below show the 5 jobs starting a Full backup 1st november 22:05. Client asrv0 runs for 6 hours 11 minutes but in the time between bsrv2 ends and bsrv5 starts only one job is running. I did expect to always see at least 5 '#' marks vertically. It seems like the director starts 5 jobs and when all of them are done, it starts the 5 next jobs. I expected that as long as the 'Priority' of all the jobs are the same then a new job would be started when one is done. 1 asrv0 01 22:05 # asrv0 06:11 2 bsrv1 01 22:05 # bsrv1 00:00 3 bsrv2 01 22:05 ### bsrv2 03:04 4 asrv3 01 22:14 ### asrv3 01:25 5 bsrv4 01 22:14 # bsrv4 02:27 6 bsrv5 02 03:49 ---## bsrv5 00:38 7 bsrv6 02 03:51 --- bsrv6 01:32 8 bsrv7 02 04:09 ### bsrv7 03:09 9 bsrv8 02 04:18 bsrv8 01:53 10 tsrv9 02 04:30 -## tsrv9 06:32 11 csrv10 02 07:08 --# csrv10 00:01 12 csrv11 02 07:08 --# csrv11 00:13 13 csrv12 02 07:11 -- csrv12 01:42 14 dsrv13 02 07:21 --# dsrv13 02:28 15 csrv14 02 07:34 --- csrv14 05:57 16 dsrv15 02 13:18 --### Bacula version 5.0.2 Any help much appreciated. (Before you ask: The gant chart is from a QND perl script) -- Venlig hilsen Hans Schou Hello, Do you use File tapes ? I noticed that when a file tape is full, pending jobs do not start concurrently anymore. Best regards. Jerome Blion. -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Explanation regarding archive pool type
Archive Pool Types are not implement to the best of my recollection. On 11/05/2013 12:07 PM, Carlo Rengo wrote: Hi, Which are the differences of archive pool type versus backup? I want to have datas saved forever in my LTO tapes for archiving purposes. Any explanations/suggestions? Thanks in advance Carlo Rengo System Engineer -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Maximum Concurrent Jobs: sometimes only one job running
Den 11/05/2013 11:21 AM, Jérôme Blion skrev: Do you use File tapes ? Yes I use Media Type = File I noticed that when a file tape is full, pending jobs do not start concurrently anymore. Ahh, I think you pushed me in the right direction. I think the problem is the Spool Size. My spool size is set to 50G but the actual server which runs alone has 120G of data. I will try to set the spool size to more than the amount of data the biggest client will send and then I guess it is solved. -- Venlig hilsen Hans Schou tel:46923438 -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Backup of many small files
On 11/4/2013 6:55 PM, Dimitri Maziuk wrote: On 11/04/2013 04:17 PM, Phil Stracchino wrote: ... In at least one of the cases I know about, though, the problem was not a failure of DRBD per se, it was that someone accidentally started up mysqld on the second node, which normally would not be allowed to happen because the second instance accessing the same filesystem would not be allowed to lock the InnoDB tablespace. In the DRBD situation, though, there was nothing whatsoever locking the tablespace on the local machine, so it quite cheerfully allowed both mysqlds to write to their own local instances of the tablespace on the replicated filesystem, whereupon the two mysqlds proceeded to destroy the tablespace by making it internally inconsistent. Ah, the active-active setup. I agree: avoid like plague. The simple stupid (and bulletproof) setup is an active/passive cluster with drbd, mysql, and floating ip. Unfortunately if it fails over your bacula-dir is going to lost its connection to the db and kill the running backup -- see my post from 2 days ago. And I think that will always be the case in a fail-over situation as long as Bacula requires the persistent network connection and fails jobs without attempting a reconnect. I think that Open vSwitch could allow live migration of DIR and SD without the connection loss, but not in a fail-over situation where the hardware has dropped the connection. That is why I said that Bacula is not entirely cluster friendly. That is not a deal breaker, since much of the tape hardware SD can use as backup devices require manually plugging the device into another node anyway. I have been running DIR and SD in a kvm VM on a 2-node cluster for a couple of years now without issue, other than it is not fail-over capable. I can run the VM on either node, though it requires manually migrating the VM. It would be nice if Bacula were fail-over capable, but still it works just as well running in a VM on a cluster as on bare metal. -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Maximum Concurrent Jobs: sometimes only one job running
Do you use File tapes ? Yes I use Media Type = File 'Media Type' is one of the gotchas of bacula. You think you should be describing the type of media you're saving to, but in fact the Media Type for each storage definition should be unique. From the docs: Media type names are arbitrary in that you set them to anything you want, but they must be known to the volume database to keep track of which storage daemons can read which volumes. In general, each different storage type should have a unique Media Type associated with it. I use a name that’s descriptive of the storage device. Like: FilesOn8TBRaid, FilesOn4TBDrive, etc. -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Explanation regarding archive pool type
Okay, I was asking about it only because I had seen the webmin interface with that option. However, if you want to backup some datas forever (and, then, delete them from the original source) what is the best procedure? Create a different pool every time you have to save something or keep everything in a single pool? Thanks Carlo Rengo System Engineer Tel: +39 06 5960 2069 Cell: +39 348 923 7278 Fax: +39 06 5960 6185 c.re...@i-node.it I-node s.r.l. We build you® .biz Via Laurentina, 86 - 00142 Roma (RM) Twitter: http://www.twitter.com/i_node Sito Web: http://www.i-node.it Zimbra Blog: Zimbra’s CFO Selected as Finalist for CFO of the Year Award http://bit.ly/1f45y38 - Messaggio originale - Da: Kern Sibbald k...@sibbald.com A: Carlo Rengo c.re...@i-node.it, bacula-users@lists.sourceforge.net Inviato: Martedì, 5 novembre 2013 13:54:32 Oggetto: Re: [Bacula-users] Explanation regarding archive pool type Archive Pool Types are not implement to the best of my recollection. On 11/05/2013 12:07 PM, Carlo Rengo wrote: Hi, Which are the differences of archive pool type versus backup? I want to have datas saved forever in my LTO tapes for archiving purposes. Any explanations/suggestions? Thanks in advance Carlo Rengo System Engineer -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Explanation regarding archive pool type
On Tue, Nov 5, 2013 at 11:28 AM, Carlo Rengo c.re...@i-node.it wrote: Okay, I was asking about it only because I had seen the webmin interface with that option. However, if you want to backup some datas forever (and, then, delete them from the original source) what is the best procedure? Create a different pool every time you have to save something or keep everything in a single pool? I have pools that do not expire. Also pools that do expire. When I want a job to not expire I use a volume from a pool that does not expire. John -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Explanation regarding archive pool type
Okay, I was asking about it only because I had seen the webmin interface with that option. However, if you want to backup some datas forever (and, then, delete them from the original source) what is the best procedure? Create a different pool every time you have to save something or keep everything in a single pool? Do you need to catalogue to stay available, or are you happy for it to expire and to bscan the media back in in the rare case where you might what to restore it? What would be nice is a way to store a sub-catalogue on the media itself, and a way of importing that catalogue back in again when required, as a bscan isn't always ideal. James -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] bacula-fd crashes on FreeBSD 9.2
You are on 9.2-release. Have you run freebsd-update to get the latest security patches? Did you see the post by Dean E. Weimer today? Second: read below. On Nov 4, 2013, at 5:44 PM, David Newman dnew...@networktest.com wrote: On 10/29/13 12:42 PM, Dan Langille wrote: On 2013-10-27 19:33, David Newman wrote: On 10/27/13 11:31 AM, Dan Langille wrote: On Oct 22, 2013, at 3:00 PM, David Newman wrote: On 10/19/13 11:40 PM, Kern Sibbald wrote: Hello, From what I can see -- first signal 0, and second this traceback, this looks a lot like a FreeBSD pthreads bug. First because there is no such thing, at least in userland, as a signal number 0, which I saw in an earlier email. Second, as the traceback below shows, Bacula is waiting on a pthread_cond_timedwait() and while in the pthread_cond_timedwait, which is a system subroutine, it emits a pthread_cond_signal(), probably no problem, followed by a pthread_kill(). That seems odd to me, but perhaps it is how FreeBSD does it, but the net result is that it is killing Bacula. Obviously, this could be a Bacula bug, but it is not occurring elsewhere, and it looks very suspicious to me. You can get more information by compiling with #define DEVELOPER 1 in bacula/src/version.h and ensuring that the -g option is on the compile and that the binaries are not stripped (default for Bacula Makefiles, but not for the FreeBSD ports system). Then if you get another traceback, it may be clearer what is going on. Since this is relatively serious, I would recommend running Bacula under the debugger directly, see the manual on the details of how, then when the debugger gets control after the signal, manually do the thread apply all bt command. FreeBSD gurus, a little help? That's not me. I don't see version.h under the bacula-client port directory. try this: make clean make patch find . -name version.h ./bacula-5.2.12/src/version.h OK, thanks. That works. Kern's email gave three steps. Sorry for the baby questions, but I don't know how to do steps 2 or 3, either. On 10/19/13 11:40 PM, Kern Sibbald wrote: You can get more information by compiling with #define DEVELOPER 1 in bacula/src/version.h That's step 1, which you've helped me find. and ensuring that the -g option is on the compile That's step 2. I don't see a place for that option in the Makefile. I think that goes on: CPPFLAGS+= to become: -I/usr/include/readline -I${LOCALBASE}/include -g I think. I have not tested that. Which file would this go into? After 'make patch', running 'grep -R LOCALBASE *' from the root of the port returns nothing. Make that change in /usr/ports/sysutils/bacula-server/Makefile Yes, bacula-server, not a typo. bacula-client is s slave port of bacula-server. and that the binaries are not stripped (default for Bacula Makefiles, but not for the FreeBSD ports system). Looking in /usr/ports/Mk/bsd.port.mk, I think you want WITH_DEBUG which I think you can add to the OPTIONS_DEFINE line. What's the procedure here? Is it (1) to uncomment WITH_DEBUG in /usr/ports/Mk/bsd.port.mk; and (2) to change the Makefile to OPTIONS_DEFINE= NLS OPENSSL PYTHON WITH_DEBUG Make those changes to OPTIONS_DEFINE /usr/ports/sysutils/bacula-server/Makefile as well. I suggest deleting all bacula packages on this client. Then make clean, and make install in the bacula-client dir. ?? thanks dn That's step 3. Sorry, don't know how to do that either. Also, I do have bacula-fd running fine on other FreeBSD 9.2 systems. The only delta AFAIK is that this is an i386 system and the others are amd64. To review: 1. Backup jobs complete when manually starting bacula-fd. What command are you entering? /usr/local/sbin/bacula-fd 2. Backup jobs do not complete when launching bacula-fd via the startup script in /usr/local/etc/rc.d/bacula-fd. For example: usr/local/etc/rc.d/bacula-fd start ? Yes: /usr/local/etc/rc.d/bacula-fd start # note leading stroke Thanks dn Thanks in advance for further debugging clues. dn If any of you are FreeBSD system gurus you might compare the last known working version of the OS with 9.2, particularly the pthreads routines. Perhaps they are using a signal 0 internally, and somehow that leaked back to Bacula. Best regards, Kern On 10/18/2013 01:29 AM, David Newman wrote: On 10/17/13 5:33 AM, Martin Simmons wrote: On Wed, 16 Oct 2013 12:13:26 -0700, David Newman said: On 10/14/13 2:44 AM, Martin Simmons wrote: On Sun, 13 Oct 2013 18:25:07 -0700, David Newman said: On 10/9/13 4:41 PM, David Newman wrote: FreeBSD 9.2-RELEASE, bacula-client-5.2.12_3 installed from ports Ever since upgrading this host to FreeBSD 9.2, bacula-fd crashes as soon as bacula-dir starts a backup job. The entry in /var/log/messages is: Oct 9 16:25:50 o bacula-fd: Bacula
[Bacula-users] Backup error - waiting on FD
Did you check that name resolution is working ok. I don't know if you are using DNS or hosts files. Also make sure you can connect on port 9103. +-- |This was sent by adolfo.oliv...@oliverasistemas.com via Backup Central. |Forward SPAM to ab...@backupcentral.com. +-- -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users