I've just upgraded a machine from FreeBSD 6.3 to 7.2. I replaced all the ports with new versions compiled on 7.2, and everything is working normally (just like every other server running these builds) except for bacula-dir. It is hanging right after starting the UA server, and before it starts accepting network connections. From my reading it is hanging in the _umtx_op() call. No core, no log message, nothing -- except that you have to kill -9 the process.
I found one other report about this in the archives but they said updating gettext fixed it. I recompiled those to be sure. I even recompiled bacula with NLS disabled so that gettext wasn't linked it, and the problem doesn't change. I'm making no headway on this, and would appreciate some direction on other things to test/debug: /usr/local/sbin/bacula-dir -d300 -f -v bacula-dir: dird.c:184-0 Debug level = 300 bacula-dir: runscript.c:296-0 runscript: debug bacula-dir: runscript.c:297-0 --> RunScript bacula-dir: runscript.c:298-0 --> Command=/usr/local/share/bacula/ make_catalog_backup bacula bacula *snip* localhost bacula-dir: runscript.c:299-0 --> Target= bacula-dir: runscript.c:300-0 --> RunOnSuccess=1 bacula-dir: runscript.c:301-0 --> RunOnFailure=0 bacula-dir: runscript.c:302-0 --> FailJobOnError=1 bacula-dir: runscript.c:303-0 --> RunWhen=2 bacula-dir: runscript.c:296-0 runscript: debug bacula-dir: runscript.c:297-0 --> RunScript bacula-dir: runscript.c:298-0 --> Command=/usr/local/share/bacula/ delete_catalog_backup bacula-dir: runscript.c:299-0 --> Target= bacula-dir: runscript.c:300-0 --> RunOnSuccess=1 bacula-dir: runscript.c:301-0 --> RunOnFailure=0 bacula-dir: runscript.c:302-0 --> FailJobOnError=1 bacula-dir: runscript.c:303-0 --> RunWhen=1 bacula-dir: message.c:263-0 Copy message resource 2870f1b8 to 28714698 bacula-dir: bsys.c:503-0 Could not open state file. sfd=-1 size=188: ERR=No such file or directory bacula-dir: mysql.c:101-0 db_open first time bacula-dir: mysql.c:130-0 initdb ref=1 connected=0 db=0 bacula-dir: mysql.c:166-0 mysql_init done bacula-dir: mysql.c:187-0 mysql_real_connect done bacula-dir: mysql.c:189-0 db_user=bacula db_name=bacula db_password=*snip* bacula-dir: mysql.c:215-0 opendb ref=1 connected=1 db=28708044 bacula-dir: sql_create.c:341-0 In create mediatype bacula-dir: sql_create.c:344-0 selectmediatype: SELECT MediaTypeId,MediaType FROM MediaType WHERE MediaType='File_SVcolo' bacula-dir: mysql.c:236-0 closedb ref=0 connected=1 db=28708044 bacula-dir: mysql.c:240-0 close db=28708044 backup0-dir: dird.c:317-0 Start UA server FWIW, it's not the state file error. I didn't used to get that error until I removed all files trying to see if something in the environment was confusing it. Exact same process, same hang in the same place whether the state file was there or not. Here is the ktrace (similar to strace on linux) output near the failure. From my reading it is hanging in the _umtx_op() call. 91887 bacula-dir GIO fd 1 wrote 44 bytes "bacula-dir: mysql.c:240-0 close db=28708044 " 91887 bacula-dir RET write 44/0x2c 91887 bacula-dir CALL write(0x4,0x28763000,0x5) 91887 bacula-dir GIO fd 4 wrote 5 bytes 0x0000 0100 0000 01 |.....| 91887 bacula-dir RET write 5 91887 bacula-dir CALL shutdown(0x4,<invalid=2>) 91887 bacula-dir RET shutdown 0 91887 bacula-dir CALL close(0x4) 91887 bacula-dir RET close 0 91887 bacula-dir CALL __sysctl(0xbfbfe88c, 0x2,0x2815eea0,0xbfbfe8a4,0,0) 91887 bacula-dir RET __sysctl 0 91887 bacula-dir CALL sigaction(SIGHUP,0xbfbfecb4,0xbfbfec9c) 91887 bacula-dir RET sigaction 0 91887 bacula-dir CALL open(0x2815eee0,O_RDWR|O_CREAT,S_IRUSR|S_IWUSR) 91887 bacula-dir NAMI "/var/db/bacula/backup0-dir.conmsg" 91887 bacula-dir RET open 4 91887 bacula-dir CALL lseek(0x4,0,SEEK_SET,0x2) 91887 bacula-dir RET lseek 0 91887 bacula-dir CALL close(0x4) 91887 bacula-dir RET close 0 91887 bacula-dir CALL open(0x2815eee0,O_RDWR|O_APPEND|O_CREAT,S_IRUSR| S_IWUSR|S_IRGRP|S_IWGRP|S_IROTH|S_IWOTH) 91887 bacula-dir NAMI "/var/db/bacula/backup0-dir.conmsg" 91887 bacula-dir RET open 4 91887 bacula-dir CALL lseek(0x4,0,SEEK_SET,0x2) 91887 bacula-dir RET lseek 0 91887 bacula-dir CALL write(0x1,0x28711000,0x2a) 91887 bacula-dir GIO fd 1 wrote 42 bytes "backup0-dir: dird.c:317-0 Start UA server " 91887 bacula-dir RET write 42/0x2a 91887 bacula-dir CALL _umtx_op(0xbfbfebd0,0x3,0x1,0,0) 91887 bacula-dir RET _umtx_op 0 91887 bacula-dir CALL sigprocmask(SIG_BLOCK,0xbfbfeb74,0x287010d8) 91887 bacula-dir RET sigprocmask 0 91887 bacula-dir CALL sigprocmask(SIG_SETMASK,0x287010d8,0) 91887 bacula-dir RET sigprocmask 0 91887 bacula-dir CALL _umtx_op(0x281daa80,0x11,0,0,0) 91887 bacula-dir RET _umtx_op -1 errno 4 Interrupted system call 91887 bacula-dir PSIG SIGINT SIG_DFL Machine: Rackable 3U with single-core Athlon CPU: AMD Opteron(tm) Processor 244 (1804.10-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0xf5a Stepping = 10 Features = 0x78bfbff < FPU ,VME ,DE ,PSE ,TSC ,MSR ,PAE ,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2> AMD Features=0xe0500800<SYSCALL,NX,MMX+,LM,3DNow!+,3DNow!> real memory = 2146828288 (2047 MB) This exact machine and hardware have been running FreeBSD 6.x and Bacula for >2 years now, zero problems. -- Jo Rhett Net Consonance : consonant endings by net philanthropy, open source and other randomness ------------------------------------------------------------------------------ Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users