Re: [LUAU] pthreads signaling question

Jim Thompson Thu, 10 Mar 2005 22:27:29 -1000 (HST)


On Mar 10, 2005, at 9:05 PM, Charles Lockhart wrote:

Jim Thompson wrote:
pthread_cond_broadcast() will (attempt) to waken *ALL* threadswaiting on that condition variable. If there is more than one,they'llrace, and the first one that gets past the mutex 'wins'.pthread_cond_signal() will attempt to waken *one* thread (typicallythe first
thread waiting on the condition variable.)
What do you mean 'wins'? Maybe I don't understand correctly, but if Ihave three threads waiting on a conditional, and I do a broadcast onthat conditional, they should ALL wake up, right?

Yes, but then two of them won't get (the outter) mutex whichessentially forced my former code to execute single-threaded. The onethat has or gets the mutex 'wins' (and gets to continue running).


Of course, before you said:

One of the threads sleeps on a conditional wait (pthread_cond_wait(&my_cond, &my_mutex)), and when a different threadcalls pthread_cond_broadcast(&my_cond), I kind of expect the firstthread to wake it's shiny smiling face and do some work. But itdoesn't. Shouldn't it? It used to.

And I'll have to ask, "Are you 100% sure that the first thread issleeping on the CV?" The pthread_cond_broadcast() andpthread_cond_signal() functions have no effect if there are no threadscurrently blocked on the CV.

If all (worker?) threads are busy handling previous requests, when anew one arrives, the signaling of the condition variable will donothing (since all threads are busy doing other things, NOT waiting onthe condition variable), and after all the worker (?) threads finishhandling their current request, they come back to wait on the variable,which won't necessarily be signaled again (for example, if no newrequests arrive). Thus, there is at least one request pending, whileall handling threads are blocked, waiting for a signal, which willnever arrive.

I can't read your code, but this describes one interpretation of theproblem you may be having, given what you've said, though violating theassumption that the thread is, indeed, sleeping on the CV.

The typical response to this problem is to set some integer variableto denote the number of pending requests, and have each thread checkthe value of this variable before waiting on the CV. If this variable'svalue is positive, some request is pending, and the thread should goand handle it, instead of going to sleep on the CV. Additionally, athread that handles a request, should reduce the value of this variableby one, to keep the count correct.

I've attached a more complete example, though the code here is stillimperfect.

I mean, I don't see anything to indicate that there's any particularorder, but they should all wake up. right? Wait, uh, yeah, that'seven what it says in the man page. Whereas pthread_cond_signal wakesup one thread, but if there are multiple threads waiting, there's noprovision for determining which one.

Typically you get the thread at the top of the list associated withthat CV. Now, please pardon me while I get pedantic.

The first version of the pthreads standard (IEEE POSIX 1003.1c standard(1995)) says pthread_cond_signal() will wake "one" thread, but itdoesn't say which one, as you allow.

The "Unified Unix Specification" (IEEE POSIX 1003-2001 (and 2004) says"at least one", which would allow a conforming implementation to wake*all* threads waiting on a given CV, just likepthread_cond_broadcast(). To see why this was changed, read thislittle tidbit from the 2001 (and 2004) versions of 1003.1:


------

On a multi-processor, it may be impossible for an implementation ofpthread_cond_signal() to avoid the unblocking of more than one threadblocked on a condition variable. For example, consider the followingpartial implementation of pthread_cond_wait() andpthread_cond_signal(), executed by two threads in the order given. Onethread is trying to wait on the condition variable, another isconcurrently executing pthread_cond_signal(), while a third thread isalready waiting.


pthread_cond_wait(mutex, cond):
    value = cond->value; /* 1 */
    pthread_mutex_unlock(mutex); /* 2 */
    pthread_mutex_lock(cond->mutex); /* 10 */
    if (value == cond->value) { /* 11 */
        me->next_cond = cond->waiter;
        cond->waiter = me;
        pthread_mutex_unlock(cond->mutex);
        unable_to_run(me);
    } else
        pthread_mutex_unlock(cond->mutex); /* 12 */
    pthread_mutex_lock(mutex); /* 13 */


pthread_cond_signal(cond):
    pthread_mutex_lock(cond->mutex); /* 3 */
    cond->value++; /* 4 */
    if (cond->waiter) { /* 5 */
        sleeper = cond->waiter; /* 6 */
        cond->waiter = sleeper->next_cond; /* 7 */
        able_to_run(sleeper); /* 8 */
    }
    pthread_mutex_unlock(cond->mutex); /* 9 */

The effect is that more than one thread can return from its call topthread_cond_wait() or pthread_cond_timedwait() as a result of one callto pthread_cond_signal(). This effect is called "spurious wakeup". Notethat the situation is self-correcting in that the number of threadsthat are so awakened is finite; for example, the next thread to callpthread_cond_wait() after the sequence of events above blocks.

While this problem could be resolved, the loss of efficiency for afringe condition that occurs only rarely is unacceptable, especiallygiven that one has to check the predicate associated with a conditionvariable anyway. Correcting this problem would unnecessarily reduce thedegree of concurrency in this basic building block for all higher-levelsynchronization operations.

An added benefit of allowing spurious wakeups is that applications areforced to code a predicate-testing-loop around the condition wait. Thisalso makes the application tolerate superfluous condition broadcasts orsignals on the same condition variable that may be coded in some otherpart of the application. The resulting applications are thus morerobust. Therefore, IEEE Std 1003.1-2001 explicitly documents thatspurious wakeups may occur.

--------

Heh.

Perhaps you're just running into side-effects of the scheduler. Itseems to work here (gentoo, 2.6.10 kernel). See the code and outputbelow.
(I'm not bragging on the code, it was quick-n-dirty.)
Ah, man, you're code worked on my machine too. So I must be doingsomething screwy. I guess I should be happy, because that means I canfix the bug. On the other hand, that means I have to fix the bug.

In my experience, fixing bugs in your code is more fun than findingworkarounds for bugs in other people's code.

The Fedora Project officially ended support for Fedora Core 1 (FC1)on September 20th, 2004. FC3 was released November 8, 2004.Maybe you should run FC3 (which is current) .vs a release that is nowknown as "Fedora Legacy". (http://fedora.redhat.com/)
Yeah, that would be nice. But some of the hardware we have in "themachine" currently only has drivers for the 2.4 kernel, and it's kindof late in the game to start porting and testing, and I am way toobusy (and LAZY). Probably on the next instrument.
Jim, you The Man, thank you very much for your time and feedback.

No problem. Here's the more complete code (the previous thing was ahack. This could actually be used to do something, though there arestill changes that would need to be done if this was part of along-running program. (I wouldn't accept an exit on an out of memorycondition if I was in the code review. Still, this is a mail list,and its unlikely that I'll be working for anyone soon, so perfectionisn't required.)

Note that the scheduling behavior is still quite different if you usepthread_cond_signal() .vs pthread_cond_broadcast().


/home/jim> cat thread-pool-server.c
#define _GNU_SOURCE

#include <stdio.h>
#include <pthread.h>
#include <stdlib.h>

/* number of threads used to service requests */
#define NUM_HANDLER_THREADS 3

/* global mutex for our program. note that we use a recursive mutex,
 * since a handlerthread might try to lock it twice consecutively.
 */

pthread_mutex_t request_mutex = PTHREAD_RECURSIVE_MUTEX_INITIALIZER_NP;

/* global condition variable for our program.*/
pthread_cond_t  got_request   = PTHREAD_COND_INITIALIZER;

int num_requests = 0;

/* a single request. */
struct request {
    struct request* next;   /* pointer to next request, NULL if none */
    int number;
};

struct request* requests = NULL; /* head of linked list ofrequests. */

struct request* last_request = NULL; /* pointer to last request */

/*
 * add a request to the requests list
 *
 * creates a request structure, adds to the list, and
 * increases number of pending requests by one.
 */
void

add_request(int request_num, pthread_mutex_t* p_mutex, pthread_cond_t*p_cond_var)

int rc; /* return code of pthreadsfunctions, UNCHECKED */

    struct request* a_request;      /* pointer to newly added request */

    /* create structure with new request */
    a_request = (struct request*)malloc(sizeof(struct request));
    if (!a_request) { /* malloc failed?? */
        fprintf(stderr, "add_request: out of memory\n");
        exit(1);
    }
    a_request->number = request_num;
    a_request->next = NULL;

    /* lock the mutex, to assure exclusive access to the list */
    rc = pthread_mutex_lock(p_mutex);

/* add new request to the end of the list, updating list pointersas required */

    if (num_requests == 0) { /* special case - list is empty */
        requests = a_request;
        last_request = a_request;
    }  else {
        last_request->next = a_request;
        last_request = a_request;
    }

    /* increase total number of pending requests by one. */
    num_requests++;

#ifdef DEBUG

printf("add_request: added request with id '%d'\n",a_request->number);

    fflush(stdout);
#endif /* DEBUG */

    /* unlock mutex */
    rc = pthread_mutex_unlock(p_mutex);

/* signal the condition variable - there's a new request to handle*/

#ifdef COND_SIGNAL
    rc = pthread_cond_signal(p_cond_var);
#else
    rc = pthread_cond_broadcast(p_cond_var);
#endif
}

/*

* gets the first pending request from the requests list removing itfrom the list.

 * the returned request need to be freed by the caller.
 */
struct request*
get_request(pthread_mutex_t* p_mutex)
{

int rc; /* return code of pthreadsfunctions UNCHECKED */

    struct request* a_request;      /* pointer to request */

    /* lock the mutex, to assure exclusive access to the list */
    rc = pthread_mutex_lock(p_mutex);

    if (num_requests > 0) {
        a_request = requests;
        requests = a_request->next;

if (requests == NULL) { /* this was the last request on thelist */

            last_request = NULL;
        }
        /* decrease the total number of pending requests */
        num_requests--;
    }  else { /* requests list is empty */
        a_request = NULL;
    }

    /* unlock mutex */
    rc = pthread_mutex_unlock(p_mutex);

    /* return the request to the caller. */
    return a_request;
}

void
handle_request(struct request* a_request, int thread_id)
{
    if (a_request) {

printf("Thread '%d' handled request '%d'\n", thread_id,a_request->number);

        fflush(stdout);
    }
}

/*
 * infinite loop of request handling
 */
void*
handle_requests_loop(void* data)
{
    int rc;                         /* return code UNCHECKED */
    struct request* a_request;      /* pointer to a request */
    int thread_id = *((int*)data);  /* thread id */

#ifdef DEBUG
    printf("Starting thread '%d'\n", thread_id);
    fflush(stdout);
#endif /* DEBUG */

    /* access the requests list exclusively */
    rc = pthread_mutex_lock(&request_mutex);

#ifdef DEBUG
    printf("thread '%d' after pthread_mutex_lock\n", thread_id);
    fflush(stdout);
#endif /* DEBUG */

    while (1) {
#ifdef DEBUG

printf("thread '%d', num_requests = %d\n", thread_id,num_requests);

        fflush(stdout);
#endif /* DEBUG */
        if (num_requests > 0) { /* a request is pending */
            a_request = get_request(&request_mutex);
            if (a_request) { /* got a request - handle and free it */

/* unlock mutex - so other threads would be able tohandle

                 * other reqeusts waiting in the queue paralelly
                 */
                rc = pthread_mutex_unlock(&request_mutex);
                handle_request(a_request, thread_id);
                free(a_request);
                /* and lock the mutex again */
                rc = pthread_mutex_lock(&request_mutex);
            }
        } else {
            /* wait for a request to arrive. note the mutex will be
             * unlocked here, thus allowing other threads access to
             * requests list
             */
#ifdef DEBUG
            printf("thread '%d' before pthread_cond_wait\n", thread_id);
            fflush(stdout);
#endif /* DEBUG */
            rc = pthread_cond_wait(&got_request, &request_mutex);
            /* after we return from pthread_cond_wait, the mutex
             * is locked again, so we don't need to lock it ourselves
             */
#ifdef DEBUG
            printf("thread '%d' after pthread_cond_wait\n", thread_id);
            fflush(stdout);
#endif /* DEBUG */
        }
    }
}

int
main(int argc, char* argv[])
{
    int i;                                       /* loop control */
    int thr_id[NUM_HANDLER_THREADS];             /* thread IDs */

pthread_t p_threads[NUM_HANDLER_THREADS]; /* thread's structures*/struct timespec delay; /* used for wastingtime */


    /* create the request-handling threads */
    for (i=0; i<NUM_HANDLER_THREADS; i++) {

thr_id[i] = i; /* this should actually store the valuereturned from pthread_create() for a future pthread_join() */pthread_create(&p_threads[i], NULL, handle_requests_loop,(void*)&thr_id[i]);

    }

    /* generate requests */
    for (i=0; i<600; i++) {
        add_request(i, &request_mutex, &got_request);
        /* pause execution for a little bit, to allow
         * other threads to run and handle some requests.
         */
        if (rand() > 3*(RAND_MAX/4)) { /* approx 25% of the time */
            delay.tv_sec = 0;
            delay.tv_nsec = 10;
            nanosleep(&delay, NULL);
        }
    }

    /* now wait till there are no more requests to process */
    sleep(5);

    printf("Check ya latter, bra\n");

    return 0;
}

Re: [LUAU] pthreads signaling question

Reply via email to