Hi, 16.11.2009 16:01, Kern Sibbald wrote: > Hello Arno, > > We will take a careful look at this once we are back from the US. As far as > I > can tell, this is something that could easily happen after you have done a > cancel sometime earlier.
Quite possible - with those long-running copy jobs, I've got lots of automatically cancelled jobs in between. > Could you please try to reproduce it on the current 3.1.5 version? I have > fixed several race conditions, and hopefully that will fix this problem. Yup - upgrade to a current development version is scheduled to happen as soon as possible, but some guys keep me quite busy these days ;-) Cheers, Arno > Many thanks, > > Kern > > On Monday 16 November 2009 15:25:47 Arno Lehmann wrote: >> Hello, >> >> I mentoned this about a month ago... the SD of 3.1.4 crashes on >> 'update slots scan' command. >> >> The usual procedure: >> - All volumes in the autochanger used >> - Swapped the tapes >> - ran 'update slots scan storage=sd-name drive=0' >> >> I got this in bconsole: >> >> *update slots scan storage=QuantumDLT drive=0 >> Connecting to Storage daemon QuantumDLT at gnom:9103 ... >> 3306 Issuing autochanger "slots" command. >> Device "QuantumDLT" has 7 slots. >> Connecting to Storage daemon QuantumDLT at gnom:9103 ... >> 3306 Issuing autochanger "list" command. >> Connecting to Storage daemon QuantumDLT at gnom:9103 ... >> 3301 Issuing autochanger "loaded? drive 0" command. >> 3302 Autochanger "loaded? drive 0", result: nothing loaded. >> 3304 Issuing autochanger "load slot 1, drive 0" command. >> 3305 Autochanger "load slot 1, drive 0", status is OK. >> Jmsg Job=*System* type=4 level=1258365033 gnom-sd: ERROR in >> block.c:946 Failed ASSERT: dev->is_open() >> No VolName for Slot=1 InChanger set to zero. >> Connecting to Storage daemon QuantumDLT at gnom:9103 ... >> bac...@gnom:~> su - >> Passwort: >> gnom:~ # /etc/rc.d/bacula-ctl-sd start >> Starting the Bacula Storage daemon >> >> And below the traceback: >> >> -------- Original-Nachricht -------- >> From: [email protected] >> Subject: Bacula GDB traceback of bacula-sd on gnom >> Sender: [email protected] >> To: [email protected] >> Date: Mon, 16 Nov 2009 10:50:36 +0100 (CET) >> >> [Thread debugging using libthread_db enabled] >> [New Thread 0xb4875b90 (LWP 21804)] >> [New Thread 0xb6879b90 (LWP 10741)] >> [New Thread 0xb70afb90 (LWP 3176)] >> 0xffffe430 in __kernel_vsyscall () >> $1 = '\0' <repeats 29 times> >> $2 = 0x809c7d0 "bacula-sd" >> $3 = 0x809c7f8 "/opt/bacula/sbin/bacula-sd" >> $4 = 0x0 >> $5 = 0xb7f11a62 "3.1.4 (28 September 2009)" >> $6 = 0xb7f11a7c "i686-pc-linux-gnu" >> $7 = 0xb7f11a8e "suse" >> $8 = 0xb7f11a93 "11.1" >> $9 = "gnom", '\0' <repeats 45 times> >> #0 0xffffe430 in __kernel_vsyscall () >> #1 0xb79bc411 in select () from /lib/libc.so.6 >> #2 0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41, >> client_wq=0x809ae20, >> handle_client_request=0x80686d0 >> <handle_connection_request(void*)>) at bnet_server.c:161 >> #3 0x0804ec43 in main (argc=<value optimized out>, argv=0x0) at >> stored.c:306 >> >> Thread 4 (Thread 0xb70afb90 (LWP 3176)): >> #0 0xffffe430 in __kernel_vsyscall () >> #1 0xb7d3bf62 in pthread_cond_timedwait@@GLIBC_2.3.2 () from >> /lib/libpthread.so.0 >> #2 0xb7f06c06 in watchdog_thread (arg=0x0) at watchdog.c:308 >> #3 0xb7f0ce78 in lmgr_thread_launcher (x=0x80c8390) at lockmgr.c:714 >> #4 0xb7d381b5 in start_thread () from /lib/libpthread.so.0 >> #5 0xb79c33be in clone () from /lib/libc.so.6 >> >> Thread 3 (Thread 0xb6879b90 (LWP 10741)): >> #0 0xffffe430 in __kernel_vsyscall () >> #1 0xb7d3f3f6 in nanosleep () from /lib/libpthread.so.0 >> #2 0xb7ed9b00 in bmicrosleep (sec=5, usec=0) at bsys.c:61 >> #3 0x08062c27 in DEVICE::open_tape_device (this=0x80af380, >> dcr=0x812a540, omode=2) at dev.c:470 >> #4 0x0806466b in DEVICE::open (this=0x80af380, dcr=0x812a540, >> omode=2) at dev.c:356 >> #5 0x0807710a in DCR::mount_next_write_volume (this=0x812a540) at >> mount.c:194 >> #6 0x08067240 in fixup_device_block_write_error (dcr=0x812a540) at >> device.c:127 >> #7 0x0805e5db in write_block_to_device (dcr=0x812a540) at block.c:388 >> #8 0x08082296 in despool_data (dcr=0x812a540, commit=false) at >> spool.c:299 >> #9 0x08082abb in write_block_to_spool_file (dcr=0x812a540) at spool.c:478 >> #10 0x08073a7e in record_cb (dcr=0x8129f00, rec=0x80abcc0) at mac.c:245 >> #11 0x0807af8c in read_records (dcr=0x8129f00, record_cb=0x80739e0 >> <record_cb>, mount_cb=0x8075530 <mount_next_read_volume(DCR*)>) at >> read_record.c:287 >> #12 0x08073848 in do_mac (jcr=0x80ab910) at mac.c:115 >> #13 0x0806eb00 in run_cmd (jcr=0x80ab910) at job.c:191 >> #14 0x08068aa3 in handle_connection_request (arg=0x80c1218) at >> dircmd.c:233 >> #15 0xb7f0735a in workq_server (arg=0x809ae20) at workq.c:346 >> #16 0xb7f0ce78 in lmgr_thread_launcher (x=0x80ac230) at lockmgr.c:714 >> #17 0xb7d381b5 in start_thread () from /lib/libpthread.so.0 >> #18 0xb79c33be in clone () from /lib/libc.so.6 >> >> Thread 2 (Thread 0xb4875b90 (LWP 21804)): >> #0 0xffffe430 in __kernel_vsyscall () >> #1 0xb7d3fbbb in waitpid () from /lib/libpthread.so.0 >> #2 0xb7efcd96 in signal_handler (sig=11) at signal.c:210 >> #3 <signal handler called> >> #4 0x0805c35e in read_block_from_dev (dcr=0x812d638, >> check_block_numbers=false) at block.c:946 >> #5 0x080721ca in read_dev_volume_label (dcr=0x812d638) at label.c:131 >> #6 0x08068337 in read_volume_label (jcr=0x80abd30, dcr=0x812d638, >> dev=0x80af380, Slot=1) at dircmd.c:1123 >> #7 0x08069382 in readlabel_cmd (jcr=0x80abd30) at dircmd.c:1081 >> #8 0x08068aa3 in handle_connection_request (arg=0x81a6080) at >> dircmd.c:233 >> #9 0xb7f0735a in workq_server (arg=0x809ae20) at workq.c:346 >> #10 0xb7f0ce78 in lmgr_thread_launcher (x=0x80b8008) at lockmgr.c:714 >> #11 0xb7d381b5 in start_thread () from /lib/libpthread.so.0 >> #12 0xb79c33be in clone () from /lib/libc.so.6 >> >> Thread 1 (Thread 0xb78ea8e0 (LWP 3169)): >> #0 0xffffe430 in __kernel_vsyscall () >> #1 0xb79bc411 in select () from /lib/libc.so.6 >> #2 0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41, >> client_wq=0x809ae20, >> handle_client_request=0x80686d0 >> <handle_connection_request(void*)>) at bnet_server.c:161 >> #3 0x0804ec43 in main (argc=<value optimized out>, argv=0x0) at >> stored.c:306 >> #0 0xffffe430 in __kernel_vsyscall () >> No symbol table info available. >> #1 0xb79bc411 in select () from /lib/libc.so.6 >> No symbol table info available. >> #2 0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41, >> client_wq=0x809ae20, >> handle_client_request=0x80686d0 >> <handle_connection_request(void*)>) at bnet_server.c:161 >> 161 if ((stat = select(maxfd + 1, &sockset, NULL, NULL, NULL)) < >> 0) { >> Current language: auto; currently c++ >> maxfd = 4 >> sockset = {fds_bits = {16, 0 <repeats 31 times>}} >> newsockfd = 6 >> stat = 0 >> clilen = 16 >> cli_addr = {sa_family = 2, sa_data = >> "Ø\vÀš\000\t\000\000\000\000\000\000\000"} >> tlog = 0 >> turnon = 1 >> p = (IPADDR *) 0x0 >> fd_ptr = (s_sockfd *) 0x0 >> buf = >> "192.168.0.9\0005\000\000·`Lô·\\¬Ó·\001\000\000\000\005\000\000\000\000\000 >> \000\000\001\000\000\000\000\220ì·d\216º¿ôoñ·ð\210ñ·¶Iñ·x\216º¿\006Èð·ð\210ñ >> ·\000\000\000\000\210êì·šJô·ð\210ñ·ÿÿÿÿô\237÷·xŠ÷·\234\234\004\bÀ\216º¿F\230 >> ö·0š÷·\000\000\000\000\001\000\000\000\005\000\000" sockfds = {<SMARTALLOC> >> = {<No data fields>}, head = 0xbfba8290, tail = 0xbfba8290, loffset = 0, >> num_items = 1} >> allbuf = >> "<\203º¿ö\220ö·šEö·éC\220·äÒ\v·\000\000\000\000\000\000\000\000 >> Ù\n\b\006\000º¿\030§ö·²Ò\v·\\Ø\n\bô\237÷· >> \203º¿ÔØ\n\bÔ\203º¿÷Iö·Ôš\216·\000\000\000\000\016\a\226·\000\000\000\000ا >> ··ô\237÷·x±€·0\000\000\000\200^÷·ÐeÎmó\003\000\000.sn\003\005\000\000\0008]\ >> 217·0W\217·\003", '\0' <repeats 15 times>, >> "j\a\000\000 >> Ù\n\bا··DË\v·øë\217·tÇ\v·\000\000\000\000ô\237÷·øÔ\n\bÐeÎmš\204º¿)Kö·\230\ >> 204º¿tÇ\v·\214\204º¿\234"... #3 0x0804ec43 in main (argc=<value optimized >> out>, argv=0x0) at >> stored.c:306 >> 306 &dird_workq, handle_connection_request); >> ch = <value optimized out> >> no_signals = false >> test_config = false >> thid = 3079576464 >> uid = 0xbfbaadcf "backup" >> gid = 0xbfbaadd9 "backup" >> python_args = {progname = 0x809d0d0 "gnom-sd", scriptdir = 0x0, >> modulename = 0x8089495 "SDStartUp", >> configfile = 0x809c978 "/opt/bacula/etc/bacula-sd.conf", workingdir >> = 0x809d128 "/var/bacula/working", >> job_getattr = 0x8079f60 <job_getattr(_object*, char*)>, job_setattr >> = 0x8079db0 <job_setattr(_object*, char*, _object*)>} >> #0 0x00000000 in ?? () >> No symbol table info available. >> #0 0x00000000 in ?? () >> No symbol table info available. >> #0 0x00000000 in ?? () >> No symbol table info available. >> #0 0x00000000 in ?? () >> No symbol table info available. >> >> >> >> Actually, I don't know if this is still interesting, as the current >> development has progressed a bit, but I promised the data, so here it >> is :-) >> >> Cheers, >> >> Arno > > > -- Arno Lehmann IT-Service Lehmann Sandstr. 6, 49080 Osnabrück www.its-lehmann.de ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Bacula-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/bacula-devel
