Re: memcached 1.2.2 core dump

Chris Goffinet Fri, 16 Nov 2007 20:04:54 -0800

Send the core dumps. You have --enable-threads correct?


    pthread_mutex_lock(&cache_lock);
    ret = do_item_stats_sizes(bytes);
    pthread_mutex_unlock(&cache_lock);

It's wrapped in pthread mutex, you do know its going to lock theentire cache by calling this correct?

I remember reading sometime ago that by doing this (dump script) itwould lock entire cache.


-Chris

On Nov 16, 2007, at 7:52 PM, Jeremy LaTrasse wrote:

We're running memcached 1.2.2 with libevent 1.3 on OpenSolaris 11.
We recently changed some of the ways that our application interactswith memcached, and then very suddenly afterward startedexperiencing core dumps across our 32 instances of memcached,seemingly arbitrarily.
We captured the core files and discovered that they were allgenerated when a request for 'stats sizes' was issued by ourmonitoring processes.
One of the engineers here postulates the following:

SEGFAULT on line 341 of items.c:

    /* build the histogram */
    memset(histogram, 0, (size_t)num_buckets * sizeof(int ));
    for (i = 0; i < LARGEST_ID; i++) {
        item *iter = heads[i];
        while (iter) {
            int ntotal = ITEM_ntotal(iter);
            int bucket = ntotal / 32;
            if ((ntotal % 32) != 0) bucket++;
            if (bucket < num_buckets) histogram[bucket]++;
            iter = iter->next;
        }
    }

That's:

            int ntotal = ITEM_ntotal(iter);
Given the huge amount of transactions we're doing, we're probablyhitting a race condition around moving items from one bucket to theother. Perhaps a mutex lock is not being set properly
For the time being we've disabled the 'stats sizes' request from ourmonitoring processes to preclude this situation.
I could not find this to be a known issue in previous messages onthis list, but I am certain that someone will end up in this scenario.
I can send the core files or gdb output to anyone interested inaddressing this.
Jeremy LaTrasse
Operations
Twitter

Re: memcached 1.2.2 core dump

Reply via email to