Brian Pane <[EMAIL PROTECTED]> writes:
> Jeff Trawick wrote:
>
> >Maybe this is a hint... For a couple of the restart iterations,
> >worker on AIX logs this:
> >
> >[crit] ap_queue_push failed with error code -1
> >
>
> In your AIX test environment, can you catch this error
> case in action by putting breakpoints at the two lines
> in ap_queue_push() where it's about to return -1?
>
> int ap_queue_push(fd_queue_t *queue, apr_socket_t *sd, apr_pool_t *p,
> apr_pool_t **recycled_pool)
> {
> /*...*/
> if (apr_thread_mutex_lock(queue->one_big_mutex) != APR_SUCCESS) {
> return FD_QUEUE_FAILURE;
This is returning EINVAL.
> That might help isolate the source of the problem. My two
> guesses right now are:
> - pool lifetime problem, or
> - pthread library problem
Pool lifetime is by far the most likely suspect out of these two.
Consider that we've been happily obtaining/releasing that mutex all
along until restart time, when a process that is dieing hits that
problem.
--
Jeff Trawick | [EMAIL PROTECTED] | PGP public key at web site:
http://www.geocities.com/SiliconValley/Park/9289/
Born in Roswell... married an alien...