Hi, On Sun, 2 Mar 2003, Jeff Trawick wrote:
> Andre Breiler wrote: > > > the ap2 (2.0.43) parent process dies (but childs arn't) under load. > > This is with worker mpm on solaris 8 (multiprocessor). > > ... > > > All seem to die with a SEGV or SIGBUS due to the fact that after returning > > from a function call the registers have wrong values. > > can you post backtraces from the coredumps? Sure: --- snip 1 --- program terminated by signal BUS (Bus Error) 0xffffffff7ee2c880: <bad address 0xffffffff7ee2c880> Current function is ap_wait_or_timeout 222 rv = apr_proc_wait_all_procs(ret, exitcode, status, APR_NOWAIT, p); (/tool/lang9.1/SUNWspro/bin/../WS6U2/bin/sparcv9/dbx) where [1] 0xffffffff7ee2c880(0xffffffff0000080e, 0xffffffff00000836, 0xffffffff0000083a, 0x1, 0x100133e58, 0x0), at 0xffffffff7ee2c87f =>[2] ap_wait_or_timeout(status = 0xffffffff0000083a, exitcode = 0xffffffff00000836, ret = 0xffffffff0000080e, p = 0x100133e58), line 222 in "mpm_common.c" dbx: warning: invalid frame pointer --- snap core.httpd.323.u0 --- --- snip core.httpd.27942.u0 --- program terminated by signal SEGV (Segmentation Fault) 0xffffffff7ee2c880: <bad address 0xffffffff7ee2c880> Current function is ap_wait_or_timeout 222 rv = apr_proc_wait_all_procs(ret, exitcode, status, APR_NOWAIT, p); (/tool/lang9.1/SUNWspro/bin/../WS6U2/bin/sparcv9/dbx) where current thread: [EMAIL PROTECTED] [1] 0xffffffff7ee2c880(0x97ffff9b8, 0x97ffff9e0, 0x97ffff9e4, 0x1, 0x100133e58, 0x0), at 0xffffffff7ee2c87f =>[2] ap_wait_or_timeout(status = 0x97ffff9e4, exitcode = 0x97ffff9e0, ret = 0x97ffff9b8, p = 0x100133e58), line 222 in "mpm_common.c" dbx: warning: invalid frame pointer --- snap core.httpd.27942.u0 --- --- snip core.httpd.26969.u0 --- program terminated by signal SEGV (Segmentation Fault) Current function is server_main_loop 1645 perform_idle_server_maintenance(); (/tool/lang9.1/SUNWspro/bin/../WS6U2/bin/sparcv9/dbx) where current thread: [EMAIL PROTECTED] =>[1] server_main_loop(remaining_children_to_start = 0), line 1645 in "worker.c" [2] ap_mpm_run(_pconf = 0x100133e58, plog = 0x10015dfa8, s = 0x1001634e8), line 1743 in "worker.c" [3] main(argc = 3, argv = 0xffffffff7ffffcf8), line 643 in "main.c" --- snap core.httpd.26969.u0 --- --- snip core.httpd.18016.u0 --- program terminated by signal BUS (Bus Error) Current function is server_main_loop 1645 perform_idle_server_maintenance(); (/tool/lang9.1/SUNWspro/bin/../WS6U2/bin/sparcv9/dbx) where current thread: [EMAIL PROTECTED] =>[1] server_main_loop(remaining_children_to_start = 0), line 1645 in "worker.c" [2] ap_mpm_run(_pconf = 0x100133e58, plog = 0x10015dfa8, s = 0x100167508), line 1743 in "worker.c" [3] main(argc = 3, argv = 0xffffffff7ffffd08), line 643 in "main.c" --- snap core.httpd.18016.u0 --- The other cores are following the same pattern. Bye Andre' -- Andre' Breiler | Tel: +44 (0) 1628 407777 BBC Internet Services | URL: http://support.bbc.co.uk Maiden House, Vanwell Road | Maidenhead, SL6 4UB | Mail me if possible. And use a Subject line.
