On Saturday, Nov 25th 2006 at 10:29 -0500, quoth Steven W. Orr: =>On Friday, Nov 24th 2006 at 22:33 -0500, quoth Steven W. Orr: => =>=>On Wednesday, Nov 22nd 2006 at 10:51 -0500, quoth Charlie Brady: =>=> =>=>=> =>=>=>On Wed, 22 Nov 2006, Steven W. Orr wrote: =>=>=> =>=>=>> =>Whatever is feeding the standard input of that process has not terminated. =>=>=>> =>What does "ps fax" tell you? =>=>=>> => =>=>=>> =>> Do we need to modify flexbackup to set SIG_IGN for SIGCHLD? =>=>=>> => =>=>=>> =>I don't know why you are suggesting that. =>=>=>> =>=>=>> Right. It's not a zombie like I said above, but since it's not, you're =>=>=>> correct that the issue of SIG_IGN for SIGCHLD would be a red herring. From =>=>=>> the ps output above, it's in a sleep state. Your question about who the =>=>=>> parent is is good. I don't remember because I just killed the process =>=>=>> after I sent this message but I believe (from previous incidents) it is =>=>=>> the child of flexbackup. So the tree should be =>=>=>> =>=>=>> cron =>=>=>> \_bash =>=>=>> \_flexbackup =>=>=>> \_gzip =>=>=> =>=>=>No, the tree should never be just that. Something should be feeding gzip, =>=>=>and gzip should be feeding something. Both "somethings" should be children =>=>=>of flexbackup. The exact identity of the "somethings" will depend on your =>=>=>configuration. =>=>=> =>=>=>> What I think is happening is that flexbackup is waiting for gzip to =>=>=>> complete before it exits. But gzip doesn't exit because it's waiting for =>=>=>> more input, not knowing that more isn't coming. =>=>=> =>=>=>Yes, and you need to determine why no more input is coming, and yet the =>=>=>program providing such input to gzip has not exited. =>=>=> =>=>=>> Sometimes I can go a month without a hangup, and sometimes it hangs =>=>=>> multiple times per week. Do we need to wait for a reoccurance or is this =>=>=>> enough to be able to work with? =>=>=> =>=>=>It's not enough because you haven't given us the full information. Since =>=>=>you've killed the gzip process, we can't determine what was feeding it =>=>=>input and why it was blocked. If you can show the actual process tree =>=>=>rather than what you think "should" be there, then we can provide more =>=>=>debugging instructions. =>=>=> =>=>=>Perhaps if you describe your configuration someone can speculate about =>=>=>what process was blocked and why. =>=> =>=>Ok. I got a new one today and I'm leaving it around so we can figure this =>=>thing out. =>=> =>=>Here's the cron tree: =>=> =>=> 3480 ? Ss 0:01 crond =>=>16571 ? S 0:00 \_ crond =>=>16572 ? Ss 0:00 \_ /usr/bin/perl -w /usr/bin/flexbackup -set backup -incremental =>=>16846 ? Z 0:00 | \_ [sh] <defunct> =>=>16573 ? S 0:00 \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t =>=> =>=>And here's the gzip: =>=> =>=>16860 ? S 0:01 gzip -9 =>=> =>=>and ps -ef shows =>=> =>=>root 16860 1 0 03:31 ? 00:00:01 gzip -9 =>=>root 16571 3480 0 03:31 ? 00:00:00 crond =>=>root 16572 16571 0 03:31 ? 00:00:00 /usr/bin/perl -w /usr/bin/flexbackup -set backup -incremental =>=>smmsp 16573 16571 0 03:31 ? 00:00:00 /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t =>=>root 16846 16572 0 03:31 ? 00:00:00 [sh] <defunct> =>=> =>=>which shows that gzip is now the child of init which means that his parent =>=>exited and orphaned him. And 16846 seems to not be getting cleanup up by =>=>flexbackup. =>=> =>=>Anyone have an idea of what this all means? => =>Next day and we got lucky. It happened again => => =>root 3480 0.0 0.0 2668 468 ? Ss Aug27 0:01 crond =>root 16571 0.0 0.0 3292 988 ? S Nov24 0:00 \_ crond =>root 16572 0.0 0.5 8320 5960 ? Ss Nov24 0:00 | \_ /usr/bin/perl -w /usr/bin/flexbackup -set backup -incremental =>root 16846 0.0 0.0 0 0 ? Z Nov24 0:00 | | \_ [sh] <defunct> =>smmsp 16573 0.0 0.2 7344 2744 ? S Nov24 0:00 | \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t =>root 21193 0.0 0.0 3292 988 ? S 03:31 0:00 \_ crond =>root 21194 0.0 0.5 8320 5952 ? Ss 03:31 0:00 \_ /usr/bin/perl -w /usr/bin/flexbackup -set backup -differential =>root 21377 0.0 0.0 0 0 ? Z 03:31 0:00 | \_ [sh] <defunct> =>smmsp 21195 0.0 0.2 7344 2728 ? S 03:31 0:00 \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t => =>and we now have two gzips owned by init. => =>526 > ps -ef | grep gzip =>root 16860 1 0 Nov24 ? 00:00:01 gzip -9 =>root 21419 1 0 03:31 ? 00:00:03 gzip -9 =>steveo 5813 9890 0 10:28 pts/3 00:00:00 grep gzip =>527 >
Ok. I promise I won't post any more examples. I just had one more from last night. 3480 ? Ss 0:01 crond 16571 ? S 0:00 \_ crond 16572 ? Ss 0:00 | \_ /usr/bin/perl -w /usr/bin/flexbackup -set backup -incremental 16846 ? Z 0:00 | | \_ [sh] <defunct> 16573 ? S 0:00 | \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t 21193 ? S 0:00 \_ crond 21194 ? Ss 0:00 | \_ /usr/bin/perl -w /usr/bin/flexbackup -set backup -differential 21377 ? Z 0:00 | | \_ [sh] <defunct> 21195 ? S 0:00 | \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t 17332 ? S 0:00 \_ crond 17333 ? Ss 0:00 \_ /usr/bin/perl -w /usr/bin/flexbackup -set backup -incremental 17705 ? Z 0:00 | \_ [sh] <defunct> 17334 ? S 0:00 \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t 524 > ps -ef | grep gzip root 16860 1 0 Nov24 ? 00:00:01 gzip -9 root 21419 1 0 Nov25 ? 00:00:03 gzip -9 root 17719 1 0 03:32 ? 00:00:01 gzip -9 steveo 12378 3292 0 10:17 pts/5 00:00:00 grep gzip 525 > Also, I included my flexbackup.conf, if that helps. Thanks. $type = 'afio'; $set{'backup'} = "/e/web /usr/share/emacs/site-lisp /usr/local /etc /boot /root /var/spool/mail /var/log"; $prune{'/e/web'} = "steveo/mpg"; $compress = 'gzip'; # one of false/gzip/bzip2/lzop/zip/compress/hardware $compr_level = '9'; # compression level (1-9) (for gzip/bzip2/lzop/zip) $buffer = 'buffer'; # one of false/buffer/mbuffer $buffer_megs = '10'; # buffer memory size (in megabytes) $buffer_fill_pct = '75'; # start writing when buffer this percent full $buffer_pause_usec = '100'; # pause after write (tape devices only) $device = '/d2/backup'; $blksize = '10'; $mt_blksize = "0"; $pad_blocks = 'true'; $remoteshell = 'ssh'; # command for remote shell (rsh/ssh/ssh2) $remoteuser = ''; # if non-null, secondary username for remote shells $label = 'true'; # somehow store identifying label in archive? $verbose = 'true'; # echo each file? $sparse = 'true'; # handle sparse files? $indexes = 'true'; # false to turn off all table-of-contents support $staticfiles = 'false'; $atime_preserve = 'false'; $traverse_fs = 'false'; $exclude_expr[0] = '.*/[Cc]ache/.*'; $exclude_expr[1] = '.*~$'; $erase_tape_set_level_zero = 'true'; $erase_rewind_only = 'false'; $logdir = '/var/log/flexbackup'; # directory for log files $comp_log = 'bzip2'; # compress log? false/gzip/bzip2/lzop/compress/zip $staticlogs = 'false'; # static log filenames w/ no date stamp $prefix = ''; # log files will start with this prefix $tmpdir = '/tmp'; # used for temporary refdate files, etc $stampdir = '/var/lib/flexbackup'; # directory for backup timestamps $index = '/var/lib/flexbackup/index'; # DB filename for tape indexes $keyfile = '00-index-key'; # filename for keyfile if archiving to dir $sprefix = ''; # stamp files will start with this prefix $afio_nocompress_types = 'mp3 MP3 Z z gz gif GIF zip ZIP lha jpeg jpg JPG taz tgz deb rpm bz2 lzo png'; $afio_echo_block = 'false'; $afio_compress_threshold = '3'; $afio_compress_cache_size = '2'; $tar_echo_record_num = 'false'; $cpio_format = 'newc'; $dump_length = '0'; $dump_use_dumpdates = 'false'; $star_fifo = 'true'; $star_acl = 'true'; $star_format = 'exustar'; $star_echo_block_num = 'false'; $pax_format = 'ustar'; $zip_nocompress_types = 'mp3 MP3 Z z gz gif zip ZIP lha jpeg jpg JPG taz tgz deb rpm bz2 lzo'; $pkgdelta_archive_list = 'rootonly'; $pkgdelta_archive_unowned = 'true'; $pkgdelta_archive_changed = 'true'; 1; ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ flexbackup-help mailing list flexbackup-help@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/flexbackup-help