Hello Arno,

We will take a careful look at this once we are back from the US.  As far as I 
can tell, this is something that could easily happen after you have done a 
cancel sometime earlier.

Could you please try to reproduce it on the current 3.1.5 version?  I have 
fixed several race conditions, and hopefully that will fix this problem.

Many thanks,

Kern

On Monday 16 November 2009 15:25:47 Arno Lehmann wrote:
> Hello,
>
> I mentoned this about a month ago... the SD of 3.1.4 crashes on
> 'update slots scan' command.
>
> The usual procedure:
> - All volumes in the autochanger used
> - Swapped the tapes
> - ran 'update slots scan storage=sd-name drive=0'
>
> I got this in bconsole:
>
> *update slots scan storage=QuantumDLT drive=0
> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
> 3306 Issuing autochanger "slots" command.
> Device "QuantumDLT" has 7 slots.
> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
> 3306 Issuing autochanger "list" command.
> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
> 3301 Issuing autochanger "loaded? drive 0" command.
> 3302 Autochanger "loaded? drive 0", result: nothing loaded.
> 3304 Issuing autochanger "load slot 1, drive 0" command.
> 3305 Autochanger "load slot 1, drive 0", status is OK.
> Jmsg Job=*System* type=4 level=1258365033 gnom-sd: ERROR in
> block.c:946 Failed ASSERT: dev->is_open()
> No VolName for Slot=1 InChanger set to zero.
> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
> bac...@gnom:~> su -
> Passwort:
> gnom:~ # /etc/rc.d/bacula-ctl-sd start
> Starting the Bacula Storage daemon
>
> And below the traceback:
>
> -------- Original-Nachricht --------
> From: [email protected]
> Subject: Bacula GDB traceback of bacula-sd on gnom
> Sender: [email protected]
> To: [email protected]
> Date: Mon, 16 Nov 2009 10:50:36 +0100 (CET)
>
> [Thread debugging using libthread_db enabled]
> [New Thread 0xb4875b90 (LWP 21804)]
> [New Thread 0xb6879b90 (LWP 10741)]
> [New Thread 0xb70afb90 (LWP 3176)]
> 0xffffe430 in __kernel_vsyscall ()
> $1 = '\0' <repeats 29 times>
> $2 = 0x809c7d0 "bacula-sd"
> $3 = 0x809c7f8 "/opt/bacula/sbin/bacula-sd"
> $4 = 0x0
> $5 = 0xb7f11a62 "3.1.4 (28 September 2009)"
> $6 = 0xb7f11a7c "i686-pc-linux-gnu"
> $7 = 0xb7f11a8e "suse"
> $8 = 0xb7f11a93 "11.1"
> $9 = "gnom", '\0' <repeats 45 times>
> #0  0xffffe430 in __kernel_vsyscall ()
> #1  0xb79bc411 in select () from /lib/libc.so.6
> #2  0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41,
> client_wq=0x809ae20,
>      handle_client_request=0x80686d0
> <handle_connection_request(void*)>) at bnet_server.c:161
> #3  0x0804ec43 in main (argc=<value optimized out>, argv=0x0) at
> stored.c:306
>
> Thread 4 (Thread 0xb70afb90 (LWP 3176)):
> #0  0xffffe430 in __kernel_vsyscall ()
> #1  0xb7d3bf62 in pthread_cond_timedwait@@GLIBC_2.3.2 () from
> /lib/libpthread.so.0
> #2  0xb7f06c06 in watchdog_thread (arg=0x0) at watchdog.c:308
> #3  0xb7f0ce78 in lmgr_thread_launcher (x=0x80c8390) at lockmgr.c:714
> #4  0xb7d381b5 in start_thread () from /lib/libpthread.so.0
> #5  0xb79c33be in clone () from /lib/libc.so.6
>
> Thread 3 (Thread 0xb6879b90 (LWP 10741)):
> #0  0xffffe430 in __kernel_vsyscall ()
> #1  0xb7d3f3f6 in nanosleep () from /lib/libpthread.so.0
> #2  0xb7ed9b00 in bmicrosleep (sec=5, usec=0) at bsys.c:61
> #3  0x08062c27 in DEVICE::open_tape_device (this=0x80af380,
> dcr=0x812a540, omode=2) at dev.c:470
> #4  0x0806466b in DEVICE::open (this=0x80af380, dcr=0x812a540,
> omode=2) at dev.c:356
> #5  0x0807710a in DCR::mount_next_write_volume (this=0x812a540) at
> mount.c:194
> #6  0x08067240 in fixup_device_block_write_error (dcr=0x812a540) at
> device.c:127
> #7  0x0805e5db in write_block_to_device (dcr=0x812a540) at block.c:388
> #8  0x08082296 in despool_data (dcr=0x812a540, commit=false) at
> spool.c:299
> #9  0x08082abb in write_block_to_spool_file (dcr=0x812a540) at spool.c:478
> #10 0x08073a7e in record_cb (dcr=0x8129f00, rec=0x80abcc0) at mac.c:245
> #11 0x0807af8c in read_records (dcr=0x8129f00, record_cb=0x80739e0
> <record_cb>, mount_cb=0x8075530 <mount_next_read_volume(DCR*)>) at
> read_record.c:287
> #12 0x08073848 in do_mac (jcr=0x80ab910) at mac.c:115
> #13 0x0806eb00 in run_cmd (jcr=0x80ab910) at job.c:191
> #14 0x08068aa3 in handle_connection_request (arg=0x80c1218) at
> dircmd.c:233
> #15 0xb7f0735a in workq_server (arg=0x809ae20) at workq.c:346
> #16 0xb7f0ce78 in lmgr_thread_launcher (x=0x80ac230) at lockmgr.c:714
> #17 0xb7d381b5 in start_thread () from /lib/libpthread.so.0
> #18 0xb79c33be in clone () from /lib/libc.so.6
>
> Thread 2 (Thread 0xb4875b90 (LWP 21804)):
> #0  0xffffe430 in __kernel_vsyscall ()
> #1  0xb7d3fbbb in waitpid () from /lib/libpthread.so.0
> #2  0xb7efcd96 in signal_handler (sig=11) at signal.c:210
> #3  <signal handler called>
> #4  0x0805c35e in read_block_from_dev (dcr=0x812d638,
> check_block_numbers=false) at block.c:946
> #5  0x080721ca in read_dev_volume_label (dcr=0x812d638) at label.c:131
> #6  0x08068337 in read_volume_label (jcr=0x80abd30, dcr=0x812d638,
> dev=0x80af380, Slot=1) at dircmd.c:1123
> #7  0x08069382 in readlabel_cmd (jcr=0x80abd30) at dircmd.c:1081
> #8  0x08068aa3 in handle_connection_request (arg=0x81a6080) at
> dircmd.c:233
> #9  0xb7f0735a in workq_server (arg=0x809ae20) at workq.c:346
> #10 0xb7f0ce78 in lmgr_thread_launcher (x=0x80b8008) at lockmgr.c:714
> #11 0xb7d381b5 in start_thread () from /lib/libpthread.so.0
> #12 0xb79c33be in clone () from /lib/libc.so.6
>
> Thread 1 (Thread 0xb78ea8e0 (LWP 3169)):
> #0  0xffffe430 in __kernel_vsyscall ()
> #1  0xb79bc411 in select () from /lib/libc.so.6
> #2  0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41,
> client_wq=0x809ae20,
>      handle_client_request=0x80686d0
> <handle_connection_request(void*)>) at bnet_server.c:161
> #3  0x0804ec43 in main (argc=<value optimized out>, argv=0x0) at
> stored.c:306
> #0  0xffffe430 in __kernel_vsyscall ()
> No symbol table info available.
> #1  0xb79bc411 in select () from /lib/libc.so.6
> No symbol table info available.
> #2  0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41,
> client_wq=0x809ae20,
>      handle_client_request=0x80686d0
> <handle_connection_request(void*)>) at bnet_server.c:161
> 161         if ((stat = select(maxfd + 1, &sockset, NULL, NULL, NULL)) <
> 0) {
> Current language:  auto; currently c++
> maxfd = 4
> sockset = {fds_bits = {16, 0 <repeats 31 times>}}
> newsockfd = 6
> stat = 0
> clilen = 16
> cli_addr = {sa_family = 2, sa_data =
> "Ø\vÀš\000\t\000\000\000\000\000\000\000"}
> tlog = 0
> turnon = 1
> p = (IPADDR *) 0x0
> fd_ptr = (s_sockfd *) 0x0
> buf =
> "192.168.0.9\0005\000\000·`Lô·\\¬Ó·\001\000\000\000\005\000\000\000\000\000
>\000\000\001\000\000\000\000\220ì·d\216º¿ôoñ·ð\210ñ·¶Iñ·x\216º¿\006Èð·ð\210ñ
>·\000\000\000\000\210êì·šJô·ð\210ñ·ÿÿÿÿô\237÷·xŠ÷·\234\234\004\bÀ\216º¿F\230
>ö·0š÷·\000\000\000\000\001\000\000\000\005\000\000" sockfds = {<SMARTALLOC>
> = {<No data fields>}, head = 0xbfba8290, tail = 0xbfba8290, loffset = 0,
> num_items = 1}
> allbuf =
> "<\203º¿ö\220ö·šEö·éC\220·äÒ\v·\000\000\000\000\000\000\000\000
> Ù\n\b\006\000º¿\030§ö·²Ò\v·\\Ø\n\bô\237÷·
> \203º¿ÔØ\n\bÔ\203º¿÷Iö·Ôš\216·\000\000\000\000\016\a\226·\000\000\000\000ا
>··ô\237÷·x±€·0\000\000\000\200^÷·ÐeÎmó\003\000\000.sn\003\005\000\000\0008]\
>217·0W\217·\003", '\0' <repeats 15 times>,
> "j\a\000\000
> Ù\n\bا··DË\v·øë\217·tÇ\v·\000\000\000\000ô\237÷·øÔ\n\bÐeÎmš\204º¿)Kö·\230\
>204º¿tÇ\v·\214\204º¿\234"... #3  0x0804ec43 in main (argc=<value optimized
> out>, argv=0x0) at
> stored.c:306
> 306                         &dird_workq, handle_connection_request);
> ch = <value optimized out>
> no_signals = false
> test_config = false
> thid = 3079576464
> uid = 0xbfbaadcf "backup"
> gid = 0xbfbaadd9 "backup"
> python_args = {progname = 0x809d0d0 "gnom-sd", scriptdir = 0x0,
> modulename = 0x8089495 "SDStartUp",
>    configfile = 0x809c978 "/opt/bacula/etc/bacula-sd.conf", workingdir
> = 0x809d128 "/var/bacula/working",
>    job_getattr = 0x8079f60 <job_getattr(_object*, char*)>, job_setattr
> = 0x8079db0 <job_setattr(_object*, char*, _object*)>}
> #0  0x00000000 in ?? ()
> No symbol table info available.
> #0  0x00000000 in ?? ()
> No symbol table info available.
> #0  0x00000000 in ?? ()
> No symbol table info available.
> #0  0x00000000 in ?? ()
> No symbol table info available.
>
>
>
> Actually, I don't know if this is still interesting, as the current
> development has progressed a bit, but I promised the data, so here it
> is :-)
>
> Cheers,
>
> Arno



------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Bacula-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-devel

Reply via email to