Hi,

16.11.2009 16:01, Kern Sibbald wrote:
> Hello Arno,
> 
> We will take a careful look at this once we are back from the US.  As far as 
> I 
> can tell, this is something that could easily happen after you have done a 
> cancel sometime earlier.

Quite possible - with those long-running copy jobs, I've got lots of 
automatically cancelled jobs in between.

> Could you please try to reproduce it on the current 3.1.5 version?  I have 
> fixed several race conditions, and hopefully that will fix this problem.

Yup - upgrade to a current development version is scheduled to happen 
as soon as possible, but some guys keep me quite busy these days ;-)

Cheers,

Arno

> Many thanks,
> 
> Kern
> 
> On Monday 16 November 2009 15:25:47 Arno Lehmann wrote:
>> Hello,
>>
>> I mentoned this about a month ago... the SD of 3.1.4 crashes on
>> 'update slots scan' command.
>>
>> The usual procedure:
>> - All volumes in the autochanger used
>> - Swapped the tapes
>> - ran 'update slots scan storage=sd-name drive=0'
>>
>> I got this in bconsole:
>>
>> *update slots scan storage=QuantumDLT drive=0
>> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
>> 3306 Issuing autochanger "slots" command.
>> Device "QuantumDLT" has 7 slots.
>> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
>> 3306 Issuing autochanger "list" command.
>> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
>> 3301 Issuing autochanger "loaded? drive 0" command.
>> 3302 Autochanger "loaded? drive 0", result: nothing loaded.
>> 3304 Issuing autochanger "load slot 1, drive 0" command.
>> 3305 Autochanger "load slot 1, drive 0", status is OK.
>> Jmsg Job=*System* type=4 level=1258365033 gnom-sd: ERROR in
>> block.c:946 Failed ASSERT: dev->is_open()
>> No VolName for Slot=1 InChanger set to zero.
>> Connecting to Storage daemon QuantumDLT at gnom:9103 ...
>> bac...@gnom:~> su -
>> Passwort:
>> gnom:~ # /etc/rc.d/bacula-ctl-sd start
>> Starting the Bacula Storage daemon
>>
>> And below the traceback:
>>
>> -------- Original-Nachricht --------
>> From: [email protected]
>> Subject: Bacula GDB traceback of bacula-sd on gnom
>> Sender: [email protected]
>> To: [email protected]
>> Date: Mon, 16 Nov 2009 10:50:36 +0100 (CET)
>>
>> [Thread debugging using libthread_db enabled]
>> [New Thread 0xb4875b90 (LWP 21804)]
>> [New Thread 0xb6879b90 (LWP 10741)]
>> [New Thread 0xb70afb90 (LWP 3176)]
>> 0xffffe430 in __kernel_vsyscall ()
>> $1 = '\0' <repeats 29 times>
>> $2 = 0x809c7d0 "bacula-sd"
>> $3 = 0x809c7f8 "/opt/bacula/sbin/bacula-sd"
>> $4 = 0x0
>> $5 = 0xb7f11a62 "3.1.4 (28 September 2009)"
>> $6 = 0xb7f11a7c "i686-pc-linux-gnu"
>> $7 = 0xb7f11a8e "suse"
>> $8 = 0xb7f11a93 "11.1"
>> $9 = "gnom", '\0' <repeats 45 times>
>> #0  0xffffe430 in __kernel_vsyscall ()
>> #1  0xb79bc411 in select () from /lib/libc.so.6
>> #2  0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41,
>> client_wq=0x809ae20,
>>      handle_client_request=0x80686d0
>> <handle_connection_request(void*)>) at bnet_server.c:161
>> #3  0x0804ec43 in main (argc=<value optimized out>, argv=0x0) at
>> stored.c:306
>>
>> Thread 4 (Thread 0xb70afb90 (LWP 3176)):
>> #0  0xffffe430 in __kernel_vsyscall ()
>> #1  0xb7d3bf62 in pthread_cond_timedwait@@GLIBC_2.3.2 () from
>> /lib/libpthread.so.0
>> #2  0xb7f06c06 in watchdog_thread (arg=0x0) at watchdog.c:308
>> #3  0xb7f0ce78 in lmgr_thread_launcher (x=0x80c8390) at lockmgr.c:714
>> #4  0xb7d381b5 in start_thread () from /lib/libpthread.so.0
>> #5  0xb79c33be in clone () from /lib/libc.so.6
>>
>> Thread 3 (Thread 0xb6879b90 (LWP 10741)):
>> #0  0xffffe430 in __kernel_vsyscall ()
>> #1  0xb7d3f3f6 in nanosleep () from /lib/libpthread.so.0
>> #2  0xb7ed9b00 in bmicrosleep (sec=5, usec=0) at bsys.c:61
>> #3  0x08062c27 in DEVICE::open_tape_device (this=0x80af380,
>> dcr=0x812a540, omode=2) at dev.c:470
>> #4  0x0806466b in DEVICE::open (this=0x80af380, dcr=0x812a540,
>> omode=2) at dev.c:356
>> #5  0x0807710a in DCR::mount_next_write_volume (this=0x812a540) at
>> mount.c:194
>> #6  0x08067240 in fixup_device_block_write_error (dcr=0x812a540) at
>> device.c:127
>> #7  0x0805e5db in write_block_to_device (dcr=0x812a540) at block.c:388
>> #8  0x08082296 in despool_data (dcr=0x812a540, commit=false) at
>> spool.c:299
>> #9  0x08082abb in write_block_to_spool_file (dcr=0x812a540) at spool.c:478
>> #10 0x08073a7e in record_cb (dcr=0x8129f00, rec=0x80abcc0) at mac.c:245
>> #11 0x0807af8c in read_records (dcr=0x8129f00, record_cb=0x80739e0
>> <record_cb>, mount_cb=0x8075530 <mount_next_read_volume(DCR*)>) at
>> read_record.c:287
>> #12 0x08073848 in do_mac (jcr=0x80ab910) at mac.c:115
>> #13 0x0806eb00 in run_cmd (jcr=0x80ab910) at job.c:191
>> #14 0x08068aa3 in handle_connection_request (arg=0x80c1218) at
>> dircmd.c:233
>> #15 0xb7f0735a in workq_server (arg=0x809ae20) at workq.c:346
>> #16 0xb7f0ce78 in lmgr_thread_launcher (x=0x80ac230) at lockmgr.c:714
>> #17 0xb7d381b5 in start_thread () from /lib/libpthread.so.0
>> #18 0xb79c33be in clone () from /lib/libc.so.6
>>
>> Thread 2 (Thread 0xb4875b90 (LWP 21804)):
>> #0  0xffffe430 in __kernel_vsyscall ()
>> #1  0xb7d3fbbb in waitpid () from /lib/libpthread.so.0
>> #2  0xb7efcd96 in signal_handler (sig=11) at signal.c:210
>> #3  <signal handler called>
>> #4  0x0805c35e in read_block_from_dev (dcr=0x812d638,
>> check_block_numbers=false) at block.c:946
>> #5  0x080721ca in read_dev_volume_label (dcr=0x812d638) at label.c:131
>> #6  0x08068337 in read_volume_label (jcr=0x80abd30, dcr=0x812d638,
>> dev=0x80af380, Slot=1) at dircmd.c:1123
>> #7  0x08069382 in readlabel_cmd (jcr=0x80abd30) at dircmd.c:1081
>> #8  0x08068aa3 in handle_connection_request (arg=0x81a6080) at
>> dircmd.c:233
>> #9  0xb7f0735a in workq_server (arg=0x809ae20) at workq.c:346
>> #10 0xb7f0ce78 in lmgr_thread_launcher (x=0x80b8008) at lockmgr.c:714
>> #11 0xb7d381b5 in start_thread () from /lib/libpthread.so.0
>> #12 0xb79c33be in clone () from /lib/libc.so.6
>>
>> Thread 1 (Thread 0xb78ea8e0 (LWP 3169)):
>> #0  0xffffe430 in __kernel_vsyscall ()
>> #1  0xb79bc411 in select () from /lib/libc.so.6
>> #2  0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41,
>> client_wq=0x809ae20,
>>      handle_client_request=0x80686d0
>> <handle_connection_request(void*)>) at bnet_server.c:161
>> #3  0x0804ec43 in main (argc=<value optimized out>, argv=0x0) at
>> stored.c:306
>> #0  0xffffe430 in __kernel_vsyscall ()
>> No symbol table info available.
>> #1  0xb79bc411 in select () from /lib/libc.so.6
>> No symbol table info available.
>> #2  0xb7edc7b6 in bnet_thread_server (addrs=0x809d0a0, max_clients=41,
>> client_wq=0x809ae20,
>>      handle_client_request=0x80686d0
>> <handle_connection_request(void*)>) at bnet_server.c:161
>> 161        if ((stat = select(maxfd + 1, &sockset, NULL, NULL, NULL)) <
>> 0) {
>> Current language:  auto; currently c++
>> maxfd = 4
>> sockset = {fds_bits = {16, 0 <repeats 31 times>}}
>> newsockfd = 6
>> stat = 0
>> clilen = 16
>> cli_addr = {sa_family = 2, sa_data =
>> "Ø\vÀš\000\t\000\000\000\000\000\000\000"}
>> tlog = 0
>> turnon = 1
>> p = (IPADDR *) 0x0
>> fd_ptr = (s_sockfd *) 0x0
>> buf =
>> "192.168.0.9\0005\000\000·`Lô·\\¬Ó·\001\000\000\000\005\000\000\000\000\000
>> \000\000\001\000\000\000\000\220ì·d\216º¿ôoñ·ð\210ñ·¶Iñ·x\216º¿\006Èð·ð\210ñ
>> ·\000\000\000\000\210êì·šJô·ð\210ñ·ÿÿÿÿô\237÷·xŠ÷·\234\234\004\bÀ\216º¿F\230
>> ö·0š÷·\000\000\000\000\001\000\000\000\005\000\000" sockfds = {<SMARTALLOC>
>> = {<No data fields>}, head = 0xbfba8290, tail = 0xbfba8290, loffset = 0,
>> num_items = 1}
>> allbuf =
>> "<\203º¿ö\220ö·šEö·éC\220·äÒ\v·\000\000\000\000\000\000\000\000
>> Ù\n\b\006\000º¿\030§ö·²Ò\v·\\Ø\n\bô\237÷·
>> \203º¿ÔØ\n\bÔ\203º¿÷Iö·Ôš\216·\000\000\000\000\016\a\226·\000\000\000\000ا
>> ··ô\237÷·x±€·0\000\000\000\200^÷·ÐeÎmó\003\000\000.sn\003\005\000\000\0008]\
>> 217·0W\217·\003", '\0' <repeats 15 times>,
>> "j\a\000\000
>> Ù\n\bا··DË\v·øë\217·tÇ\v·\000\000\000\000ô\237÷·øÔ\n\bÐeÎmš\204º¿)Kö·\230\
>> 204º¿tÇ\v·\214\204º¿\234"... #3  0x0804ec43 in main (argc=<value optimized
>> out>, argv=0x0) at
>> stored.c:306
>> 306                        &dird_workq, handle_connection_request);
>> ch = <value optimized out>
>> no_signals = false
>> test_config = false
>> thid = 3079576464
>> uid = 0xbfbaadcf "backup"
>> gid = 0xbfbaadd9 "backup"
>> python_args = {progname = 0x809d0d0 "gnom-sd", scriptdir = 0x0,
>> modulename = 0x8089495 "SDStartUp",
>>    configfile = 0x809c978 "/opt/bacula/etc/bacula-sd.conf", workingdir
>> = 0x809d128 "/var/bacula/working",
>>    job_getattr = 0x8079f60 <job_getattr(_object*, char*)>, job_setattr
>> = 0x8079db0 <job_setattr(_object*, char*, _object*)>}
>> #0  0x00000000 in ?? ()
>> No symbol table info available.
>> #0  0x00000000 in ?? ()
>> No symbol table info available.
>> #0  0x00000000 in ?? ()
>> No symbol table info available.
>> #0  0x00000000 in ?? ()
>> No symbol table info available.
>>
>>
>>
>> Actually, I don't know if this is still interesting, as the current
>> development has progressed a bit, but I promised the data, so here it
>> is :-)
>>
>> Cheers,
>>
>> Arno
> 
> 
> 

-- 
Arno Lehmann
IT-Service Lehmann
Sandstr. 6, 49080 Osnabrück
www.its-lehmann.de

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Bacula-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-devel

Reply via email to