DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT <http://nagoya.apache.org/bugzilla/show_bug.cgi?id=22106>. ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE.
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=22106 Problems with ssi exec processes hanging under Apache 2.0.47 ------- Additional Comments From [EMAIL PROTECTED] 2003-08-06 01:04 ------- This may or may not be related, but in addition to the hanging ssi processes, occasionally, Apache will stop responding correctly almost entirely; the error log is filled with messages of this sort: "No such process: cgid daemon is gone; is Apache terminating?" "Connection refused: connect #1 to cgi daemon failed, sleeping before retry..." I ran top to check for processes running and discovered the other Apache child processes still running; however, the Apache cgid process was still running as well, and taking up almost all remaining free memory as well (1+ GB!). I tried to get a core dump using gcore but the server was choking and I had to move on... as this was a production server, I had to hurry to get what information I could, but I have the output from pstack for both the cgid daemon process and the Apache root process: cgid: 12817: /usr/local/apache2/worker/bin/httpd -k start ----------------- lwp# 1 / thread# 1 -------------------- ff09941c read (12, 106325, 4854325b) fefdac40 read (12, 104130, 48545450, ffffffff, fffffff8, 48649581) + 34 fed5283c get_req (12, d1fb0, a, ffbef710, ffbef720, d2149) + 268 fed5305c cgid_server (7bf18, f, cffa8, 24, 128, 174) + 44c fed53400 cgid_start (7a178, 7bf18, 1011f0, fed668bc, fed668c0, fed55800) + ac fed524d0 cgid_maint (0, 1011f0, d, fed52408, ceae8, 7c) + c8 ff2e570c apr_proc_other_child_alert (ffbefa10, 0, d, fec00020, 6, 557e0) + 54 ff2e583c apr_proc_other_child_read (ffbefa10, d, d, 7a178, 0, 0) + c 0002d420 server_main_loop (0, 6ec00, 6d800, 52800, 52000, 6d800) + 148 0002d758 ap_mpm_run (0, 6d800, 6ec00, 6c400, 54c00, 54c00) + 2a4 000343e0 main (781f0, 7a178, 6c400, 6c400, ffbefb1c, 6c400) + 7e0 00025884 _start (0, 0, 0, 0, 0, 0) + 5c ----------------- lwp# 2 / thread# 2 -------------------- ff099950 signotifywait () fefcf780 _dynamiclwps (fefeca54, fefee3bc, 2, 0, ff0b5f74, 0) + 1c fefd26d8 thr_yield (0, 0, 0, 0, 0, 0) + 8c ----------------- lwp# 3 / thread# 3 -------------------- ff099f80 lwp_sema_wait (fdf0de78) fefcb2f8 _park (fdf0ddc0, fdf0de78, 0, 1, fefed7e0, 0) + 10c fefcafc0 _swtch (5, fefeca54, fdf0de54, fdf0de50, fdf0de4c, fdf0de48) + 3b4 fefce884 _reap_wait (feff1428, 1e484, 0, 0, 0, 0) + 34 fefce600 _reaper (fefeca54, feff1428, fefed870, feff97bc, 0, fe400000) + 34 fefdbd28 _thread_start (0, 0, 0, 0, 0, 0) + 40 Apache root: 28381: /usr/local/apache2/worker/bin/httpd -k start ----------------- lwp# 1 / thread# 1 -------------------- ff09819c poll (ffbef880, 0, 3e8) ff04f920 select (ffbef880, ff0b9034, ff0b9034, 0, 0, 3e8) + 334 fefdb6c0 select (0, f4240, ffbefa0c, 1, 7a178, fec00070) + 34 0003a2fc ap_wait_or_timeout (ffbefa0c, ffbefa08, ffbefa10, 7a178, 0, 0) + 5c 0002d328 server_main_loop (0, 6ec00, 6d800, 52800, 52000, 6d800) + 50 0002d758 ap_mpm_run (0, 6d800, 6ec00, 6c400, 54c00, 54c00) + 2a4 000343e0 main (781f0, 7a178, 6c400, 6c400, ffbefb1c, 6c400) + 7e0 00025884 _start (0, 0, 0, 0, 0, 0) + 5c ----------------- lwp# 2 / thread# 2 -------------------- ff099950 signotifywait () fefcf780 _dynamiclwps (fefeca54, fefee3bc, 2, 0, ff0b5f74, 0) + 1c fefd26d8 thr_yield (0, 0, 0, 0, 0, 0) + 8c ----------------- lwp# 3 -------------------------------- ff099f34 lwp_cond_wait (feff3fc8, feff3fd8, fefb5c90) ff091a00 _lwp_cond_timedwait (feff3fc8, feff3fd8, 0, 3f304e38, 0, 0) + 90 fefca94c _age (fefeca54, fefed7d4, fefee0a8, fefee0c0, 3, fefeca54) + 90 fefcbecc _lwp_start (6000, ffbef7f4, fefed820, fefed808, fefeca54, ffbef6e8) + 14 fefd26d8 thr_yield (0, 0, 0, 0, 0, 0) + 8c -------------------------- thread# 3 -------------------- fefce884 _reap_wait (feff1428, 1e484, 0, 0, 0, 0) + 34 fefce600 _reaper (fefeca54, feff1428, fefed870, 70d60, 1, fe401000) + 34 fefdbd28 _thread_start (0, 0, 0, 0, 0, 0) + 40 One frustrating part (among the many) to this is that I can't duplicate this in our development environment, so I have to troubleshoot in production, which can be a bit dicey at times... --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
