After troubleshooting it some more, I determined that the problem occurred due to a corrupted cfservd ChecksumDatabase. The backtrace from the 2.1.14 version was:
#0 0x40045d95 in __bam_split () from /usr/local/cfengine-test/BerkeleyDB.4.3.27/lib/libdb-4.3.so #1 0x4003a719 in __bam_c_put () from /usr/local/cfengine-test/BerkeleyDB.4.3.27/lib/libdb-4.3.so #2 0x400827c3 in __db_c_put () from /usr/local/cfengine-test/BerkeleyDB.4.3.27/lib/libdb-4.3.so #3 0x4007cfb4 in __db_put () from /usr/local/cfengine-test/BerkeleyDB.4.3.27/lib/libdb-4.3.so #4 0x40088704 in __db_put_pp () from /usr/local/cfengine-test/BerkeleyDB.4.3.27/lib/libdb-4.3.so #5 0x080723e8 in ChecksumChanged (filename=0x441fc91c "/SOMEFILE", digest=0x441fd91c "»-½ï{buåÏEì?÷2% ", warnlevel=2, refresh=1, type=109 'm') at misc.c:384 #6 0x0804fbe5 in CompareLocalChecksum (conn=0x4097c580, sendbuffer=0x442029ac "", recvbuffer=0x442039ac "MD5 /SOMEFILE") at cfservd.c:2858 #7 0x0804d0b0 in BusyWithConnection (conn=0x4097c580) at cfservd.c:1486 #8 0x0804c559 in HandleConnection (conn=0x4097c580) at cfservd.c:1135 #9 0x400edc6f in pthread_start_thread (arg=0x44204be0) at manager.c:279 Removing the checksumdatabase file resolved the problem. Is there any way to modify CFE to be insulated from dbm corruption? Thank you, -Jason Martin -- Hello, I am experiencing frequent cfservd coredumps (about every 30 minutes) with 2.1.14 and 2.1.17. This on RHEL ES2.1. I've pasted the backtrace and logfile messages below. Any suggestions on solving the problem? Linux XXX 2.4.9-e.30smp #1 SMP Fri Nov 28 07:18:53 EST 2003 i686 unknown glibc-2.2.4-32.8 # gdb /usr/local/src/cfengine-2.1.17/src/cfservd ./core.13063 GNU gdb Red Hat Linux (5.2-2) Copyright 2002 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-redhat-linux"... Core was generated by `/usr/local/src/cfengine-2.1.17/src/cfservd -F -d 2'. Program terminated with signal 11, Segmentation fault. Reading symbols from /usr/local/BerkeleyDB.4.4/lib/libdb-4.4.so...done. Loaded symbols for /usr/local/BerkeleyDB.4.4/lib/libdb-4.4.so Reading symbols from /lib/libnss_nis.so.2...done. Loaded symbols for /lib/libnss_nis.so.2 Reading symbols from /lib/i686/libpthread.so.0...done. Loaded symbols for /lib/i686/libpthread.so.0 Reading symbols from /lib/i686/libm.so.6...done. Loaded symbols for /lib/i686/libm.so.6 Reading symbols from /lib/i686/libc.so.6...done. Loaded symbols for /lib/i686/libc.so.6 Reading symbols from /lib/libnsl.so.1...done. Loaded symbols for /lib/libnsl.so.1 Reading symbols from /lib/libnss_files.so.2...done. Loaded symbols for /lib/libnss_files.so.2 Reading symbols from /lib/ld-linux.so.2...done. Loaded symbols for /lib/ld-linux.so.2 Reading symbols from /lib/libnss_nisplus.so.2...done. Loaded symbols for /lib/libnss_nisplus.so.2 Reading symbols from /lib/libnss_dns.so.2...done. Loaded symbols for /lib/libnss_dns.so.2 Reading symbols from /lib/libresolv.so.2...done. Loaded symbols for /lib/libresolv.so.2 #0 0x401daea8 in memmove (dest=0x2e0a4008, src=0x2e084201, len=969818625) at ../sysdeps/generic/memmove.c:105 105 ../sysdeps/generic/memmove.c: No such file or directory. in ../sysdeps/generic/memmove.c RecvSocketStream(8) (Concatenated 8 from stream) Transaction Receive [t 16][] RecvSocketStream(16) (Concatenated 16 from stream) Got Blowfish size 16 BinaryBuffer(16)[40b6350000100020000] = 16 cfservd: Host XXXXX granted access to /SOMEFILE Clocks were off by 0 StatFile(/SOMEFILE) cfservd: Host YYYY granted access to /OTHERFILE Clocks were off by 0 StatFile(/OTHERFILE) cfservd: The debug output cuts off, I suspect that unflushed buffers are the problem. It would be nice if the debug output performed a fflush() after each print to avoid losing data. Thank you, -Jason Martin _______________________________________________ Help-cfengine mailing list Help-cfengine@gnu.org http://lists.gnu.org/mailman/listinfo/help-cfengine