Oh, just one last note:

Bacula does work perfectly well on 

RedHat 7.1 (Seawolf) kernel version 2.4.20-28.7

but on that machine, the HAVE_POSIX_FADVISE is turned off automatically by 
the ./configure process.

Perhaps you did not start with a clean build directory (i.e. you built it 
first on some other system and later copied the directory).  ./configure has 
the habit of caching the config process, which could explain why your 
HAVE_POSIX_FDADVISE is improperly set for your system.  

One must be careful to either start from a fresh tar file or do a:

  make distclean

before trying to re-run a ./configure on a different architecture OS version.

Regards,

Kern

On Thursday 11 October 2007 23:39, [EMAIL PROTECTED] wrote:
> In the message dated: Tue, 09 Oct 2007 15:55:58 +0200,
> The pithy ruminations from Kern Sibbald on
> <Re: [Bacula-devel] bacula-sd 2.2.4 goes kaboom! (segfault on despooling
> data)> were:
> => Hello,
>
>
> Since 2.2.5 has been released, I've built that version, and I'm seeing the
> same behavior.
>
> =>
> => Most likely you forgot to put in an Autochanger resource for controlling
> your
>
> Nope. The same configuration works fine with the autochanger under 1.38.11.
> Here are excerpts from the config files:
>
> -------------- bacula-dir.conf ----------------------
> # Definition of file storage device
> Storage {
>   Name =  pv132t
> # Do not use "localhost" here
>   Address = parthenon                # N.B. Use a fully qualified name here
>   SDPort = 9103
>   Password = "IeRUDo7djGxxxxxxxxxxxxxxxxxxxxxxxxxlM1xUkbrEJnq4K"
>   Device = pv132t
>   Media Type = LTO2
>   Autochanger = yes
>   Maximum Concurrent Jobs = 25
> }
> -----------------------------------------------------
>
>
>
> --------------- bacula-sd.conf ----------------------
> #
> # An autochanger device with two drives
> #
> Autochanger {
>   Name = pv132t
>   Device = Drive-0
>   Device = Drive-1
>   Changer Command = "/usr/local/bacula-2.2.5/bin/mtx-changer %c %o %S %a
> %d" Changer Device = /dev/changer
> }
>
> Device {
>   Name = Drive-0                      #
>   LabelMedia = yes;                   # lets Bacula label unlabeled media
>   Drive Index = 0
>   Media Type = LTO2
>   Archive Device = /dev/tape0
>   AutomaticMount = yes;               # when device opened, read it
>   AlwaysOpen = yes;
>   RemovableMedia = yes;
>   RandomAccess = no;
>   AutoChanger = yes
>   # Enable the Alert command only if you have the mtx package loaded
>   Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
>   Maximum Spool Size = 60G                      # For network client
>   Maximum Network Buffer Size = 65536
>   Spool Directory = /san3/var/spool/bacula
>   Autoselect = yes
> }
>
> Device {
>   Name = Drive-1                      #
>   LabelMedia = yes;                   # lets Bacula label unlabeled media
>   Drive Index = 1
>   Media Type = LTO2
>   Archive Device = /dev/tape1
>   AutomaticMount = yes;               # when device opened, read it
>   AlwaysOpen = yes;
>   RemovableMedia = yes;
>   RandomAccess = no;
>   AutoChanger = yes
>   # Enable the Alert command only if you have the mtx package loaded
>   Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
>   Maximum Spool Size = 60G                      # For network client
>   Maximum Network Buffer Size = 65536
>   Spool Directory = /san2/var/spool/bacula
>   Autoselect = yes
> }
> -----------------------------------------------------
> => autochanger.  Barring that, you will need to run the SD under the
> debugger as => defined in the Kaboom chapter of the manual and obtain a
> good traceback.
>
>
> Done. Here's a script(1) of the gdb output. Please let me know if there's
> any more information that I can provide, or any alternative way of testing
> this.
>
> --------------------------------------------------------------------
> Script started on Thu 11 Oct 2007 05:25:41 PM EDT
> [EMAIL PROTECTED] bacula-2.2.5]$ ./test_bacula.rc start
> Starting the Storage daemon
> Starting the File daemon
> Starting the Director daemon
> [EMAIL PROTECTED] bacula-2.2.5]$ ps -ef | grep bacula
> root     18416 18385  0 17:25 pts/24   00:00:00 script bacula-debugging
> root     18417 18416  0 17:25 pts/24   00:00:00 script bacula-debugging
> root     18449     1  0 17:25 ?        00:00:00
> /usr/local/bacula-2.2.5/sbin/bacula-fd -u root -g root -v -c
> /usr/local/bacula-2.2.5/etc/bacula-fd.conf root     18451 18449  0 17:25 ? 
>       00:00:00 /usr/local/bacula-2.2.5/sbin/bacula-fd -u root -g root -v -c
> /usr/local/bacula-2.2.5/etc/bacula-fd.conf root     18452 18451  0 17:25 ? 
>       00:00:00 /usr/local/bacula-2.2.5/sbin/bacula-fd -u root -g root -v -c
> /usr/local/bacula-2.2.5/etc/bacula-fd.conf root     18454     1  0 17:25 ? 
>       00:00:00 [bacula-dir]
> root     18456 18454  0 17:25 ?        00:00:00 [bacula-dir]
> root     18457 18456  0 17:25 ?        00:00:00 [bacula-dir]
> root     18458 18456  0 17:25 ?        00:00:00 [bacula-dir]
> root     18460 18418  0 17:25 pts/2    00:00:00 grep bacula
> [EMAIL PROTECTED] bacula-2.2.5]$ gdb sbin/bacula-sd
> GNU gdb Red Hat Linux (5.3.90-0.20030710.41rh)
> Copyright 2003 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and you
> are welcome to change it and/or distribute copies of it under certain
> conditions. Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB.  Type "show warranty" for details.
> This GDB was configured as "i386-redhat-linux-gnu"...Using host
> libthread_db library "/lib/libthread_db.so.1".
>
> (gdb) run -s -f -c etc/bacula-sd.conf
> Starting program: /usr/local/bacula-2.2.5/sbin/bacula-sd -s -f -c
> etc/bacula-sd.conf [Thread debugging using libthread_db enabled]
> [New Thread 16384 (LWP 18507)]
> [New Thread 32769 (LWP 18537)]
> [New Thread 16386 (LWP 18538)]
> [New Thread 32771 (LWP 18540)]
> [New Thread 49156 (LWP 18832)]
> [New Thread 65541 (LWP 18841)]
> [New Thread 81926 (LWP 18854)]
> [New Thread 98311 (LWP 18863)]
> [New Thread 114696 (LWP 18865)]
> [New Thread 131081 (LWP 18874)]
> [New Thread 147466 (LWP 18930)]
> [New Thread 163851 (LWP 18933)]
> [New Thread 180236 (LWP 18961)]
> [New Thread 196621 (LWP 18970)]
>
> Program received signal SIGSEGV, Segmentation fault.
> [Switching to Thread 147466 (LWP 18930)]
> 0x08137148 in ?? ()
> (gdb)        thread apply all bt
>
> Thread 13 (Thread 180236 (LWP 18961)):
> #0  0x4003415b in read () from /lib/i686/libpthread.so.0
> #1  0x00000004 in ?? ()
> #2  0x00000001 in ?? ()
> #3  0x08072e40 in read_nbytes(BSOCK*, char*, int) (bsock=0x8150160,
> ptr=0x423e1924 "\002", nbytes=4) at bnet.c:82
> #4  0x08074ab2 in BSOCK::recv() (this=0x8150160) at bsock.c:381
> #5  0x08072c65 in bget_msg(BSOCK*) (sock=0x8150160) at bget_msg.c:60
> #6  0x0804f9c5 in do_append_data(JCR*) (jcr=0x813ef08) at append.c:202
> #7  0x0805f5b1 in append_data_cmd (jcr=0x813ef08) at fd_cmds.c:194
> #8  0x0805f50f in do_fd_commands(JCR*) (jcr=0x813ef08) at fd_cmds.c:165
> #9  0x0805f385 in run_job(JCR*) (jcr=0x813ef08) at fd_cmds.c:128
> #10 0x080604f5 in run_cmd(JCR*) (jcr=0x813ef08) at job.c:195
> #11 0x0805b1b0 in handle_connection_request(void*) (arg=0x813e498) at
> dircmd.c:229 #12 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #13 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #14 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #15 0x401797fa in clone () from
> /lib/i686/libc.so.6
>
> Thread 11 (Thread 147466 (LWP 18930)):
> #0  0x08137148 in ?? ()
> #1  0x08114570 in ?? ()
> #2  0x081156e0 in ?? ()
> #3  0x0806da33 in despool_data (dcr=0x8115260, commit=true) at spool.c:273
> #4  0x0806d596 in commit_data_spool(DCR*) (dcr=0x8115260) at spool.c:139
> #5  0x0804fd34 in do_append_data(JCR*) (jcr=0x8114570) at append.c:318
> #6  0x0805f5b1 in append_data_cmd (jcr=0x8114570) at fd_cmds.c:194
> #7  0x0805f50f in do_fd_commands(JCR*) (jcr=0x8114570) at fd_cmds.c:165
> ---Type <return> to continue, or q <return> to quit---
> #8  0x0805f385 in run_job(JCR*) (jcr=0x8114570) at fd_cmds.c:128
> #9  0x080604f5 in run_cmd(JCR*) (jcr=0x8114570) at job.c:195
> #10 0x0805b1b0 in handle_connection_request(void*) (arg=0x80c2958) at
> dircmd.c:229 #11 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #12 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #13 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #14 0x401797fa in clone () from
> /lib/i686/libc.so.6
>
> Thread 9 (Thread 114696 (LWP 18865)):
> #0  0x400340db in write () from /lib/i686/libpthread.so.0
>
> Thread 7 (Thread 81926 (LWP 18854)):
> #0  0x080805c7 in sm_sizeof_pool_memory(char const*, int, char*)
> (fname=0x80c0328 "", lineno=1098783052, obuf=0x80c0328 "") at
> mem_pool.c:166
> #1  0x08072c65 in bget_msg(BSOCK*) (sock=0x80c0328) at bget_msg.c:60
> #2  0x0804f892 in do_append_data(JCR*) (jcr=0x80be998) at append.c:154
> #3  0x0805f5b1 in append_data_cmd (jcr=0x80be998) at fd_cmds.c:194
> #4  0x0805f50f in do_fd_commands(JCR*) (jcr=0x80be998) at fd_cmds.c:165
> #5  0x0805f385 in run_job(JCR*) (jcr=0x80be998) at fd_cmds.c:128
> #6  0x080604f5 in run_cmd(JCR*) (jcr=0x80be998) at job.c:195
> #7  0x0805b1b0 in handle_connection_request(void*) (arg=0x80be3b0) at
> dircmd.c:229 #8  0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #9  0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #10 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #11 0x401797fa in clone () from
> /lib/i686/libc.so.6
> Current language:  auto; currently c++
>
> Thread 5 (Thread 49156 (LWP 18832)):
> #0  0x4003415b in read () from /lib/i686/libpthread.so.0
> #1  0x00000004 in ?? ()
> ---Type <return> to continue, or q <return> to quit---
> #2  0x00000001 in ?? ()
> #3  0x08072e40 in read_nbytes(BSOCK*, char*, int) (bsock=0x80b9e78,
> ptr=0x413d6924 "", nbytes=4) at bnet.c:82
> #4  0x08074ab2 in BSOCK::recv() (this=0x80b9e78) at bsock.c:381
> #5  0x08072c65 in bget_msg(BSOCK*) (sock=0x80b9e78) at bget_msg.c:60
> #6  0x0804f9c5 in do_append_data(JCR*) (jcr=0x80a9380) at append.c:202
> #7  0x0805f5b1 in append_data_cmd (jcr=0x80a9380) at fd_cmds.c:194
> #8  0x0805f50f in do_fd_commands(JCR*) (jcr=0x80a9380) at fd_cmds.c:165
> #9  0x0805f385 in run_job(JCR*) (jcr=0x80a9380) at fd_cmds.c:128
> #10 0x080604f5 in run_cmd(JCR*) (jcr=0x80a9380) at job.c:195
> #11 0x0805b1b0 in handle_connection_request(void*) (arg=0x80a56a0) at
> dircmd.c:229 #12 0x0808788c in workq_server (arg=0x80a3900) at workq.c:357
> #13 0x4002edb2 in pthread_start_thread () from /lib/i686/libpthread.so.0
> #14 0x4002ef45 in pthread_start_thread_event () from
> /lib/i686/libpthread.so.0 #15 0x401797fa in clone () from
> /lib/i686/libc.so.6
>
> Thread 4 (Thread 32771 (LWP 18540)):
> #0  0x40034936 in nanosleep () from /lib/i686/libpthread.so.0
> #1  0x00000001 in ?? ()
> #2  0x40030b5a in __pthread_timedsuspend_new () from
> /lib/i686/libpthread.so.0
>
> Thread 2 (Thread 32769 (LWP 18537)):
> #0  0x4017075a in poll () from /lib/i686/libc.so.6
> #1  0x4002dd1a in __pthread_manager () from /lib/i686/libpthread.so.0
> #2  0x4002dfea in __pthread_manager_event () from /lib/i686/libpthread.so.0
> #3  0x401797fa in clone () from /lib/i686/libc.so.6
>
> Thread 1 (Thread 16384 (LWP 18507)):
> #0  0x401728d1 in select () from /lib/i686/libc.so.6
> ---Type <return> to continue, or q <return> to quit---
> #1  0x00000009 in ?? ()
> #2  0x401d3d20 in buffer () from /lib/i686/libc.so.6
> #0  0x08137148 in ?? ()
> (gdb)
> (gdb) quit
> The program is running.  Exit anyway? (y or n) y
> [EMAIL PROTECTED] bacula-2.2.5]$ exit
> exit
>
> Script done on Thu 11 Oct 2007 05:31:46 PM EDT
> --------------------------------------------------------------------
>
> Thanks,
>
> Mark
>
> =>
> => Regards,
> =>
> => Kern
> =>
> => On Thursday 04 October 2007 17:11, [EMAIL PROTECTED] wrote:
> => > I'm testing bacula 2.2.4 and the bacula-sd daemon repeatedly exits
> with a => > segmentation fault. Bacula 1.38.11 works reliably on the same
> machine with => > the same hardware and configuration
> => > files.
> => >
> => > Environment:
> => >  Fedora Core 1
> => >  Kernel 2.4.26
> => >  gcc 3.3.2
> => >  MySQL 5.0.22
> => >  Dell PV 132T autochanger
> => >
> => > Build configuration script:
> => >  ./configure \
> => >                   --prefix=/usr/local/bacula-2.2.4      \
> => >                   --disable-nls \
> => >                   --disable-ipv6        \
> => >                   --enable-batch-insert \
> => >                   [EMAIL PROTECTED]    \
> => >                   [EMAIL PROTECTED]     \
> => >                   --with-db-name=bacula2        \
> => >                   --with-mysql  \
> => >                   --mandir=/usr/local/bacula-2.2.4/man  \
> => >                   --with-pid-dir=/usr/local/bacula-2.2.4/var/run        \
> => >                   --with-subsys-dir=/usr/local/bacula-2.2.4/var/subsys  \
> => >                   --with-dir-user=bacula        \
> => >                   --with-dir-group=bacula       \
> => >                   --with-sd-user=bacula \
> => >                   --with-sd-group=bacula        \
> => >                   --with-fd-user=root   \
> => >                   --with-fd-group=root && make
> => >
> => >
> => > I'm using the same configuration files for the director, sd, and fd
> that => > work with the 1.38.11 installation (after removing the "Accept
> Any Volume" => > directive, changing the paths for 2.2.4, and adding the
> directive => > "RecyclePool = Scratch").
> => >
> => > The software compiles without error. There were no errors from "btape
> test" => > or "btape autochanger".
> => >
> => > The bacula-sd daemon crashes repeatedly whether or not the variable
> => > LD_ASSUME_KERNEL was set to "2.4.19" before compiling bacula.
> => >
> => > The bacula-sd daemon is running as root (while I sort out an issue
> that's => > not present in 1.38.11 with permissions on the tape device).
> The bacula-dir => > normally runs as user "bacula".
> => >
> => > Even if I modify the bacula-dir options to run it as root, no
> traceback => > file is generated. The message (received via email) from
> Bacula is: => >
> => >  Using host libthread_db library "/lib/libthread_db.so.1".
> => >  0x401728d1 in ?? ()
> => >  /usr/local/bacula-2.2.4/etc/btraceback.gdb:1: Error in sourced
> command => > file: Cannot access memory at address 0x80a2e34
> => >
> => > The fault seems to occur right after the SD begins despooling data.
> I've => > got 4 log files from running the SD with debugging on (set to 200
> or => > higher), and the error always happens after the first instance of
> => > despooling data. In each case, the log file shows "stored.c:582 In =>
> > terminate_stored() sig=11".
> => >
> => > I've attached an excerpt from the SD debugging output.
> => >
> => >
> => > Thanks,
> => >
> => > Mark
> => >
> => >
> => > ----
> => > Mark Bergman                      [EMAIL PROTECTED]
> => > System Administrator
> => > Section of Biomedical Image Analysis             215-662-7310
> => > Department of Radiology,           University of Pennsylvania
> => >
> => >
> http://pgpkeys.pca.dfn.de:11371/pks/lookup?search=mark.bergman%40.uphs.upen
> => >n.edu
> =>

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to