Re: preemption across processors

Seigo Tanimura Tue, 28 May 2002 00:52:58 -0700

On Wed, 15 May 2002 08:21:46 -0400 (EDT),
  John Baldwin <[EMAIL PROTECTED]> said:

jhb> On 15-May-2002 Seigo Tanimura wrote:
>> Currently, a new runnable thread cannot preempt the thread on any
>> processor other than the thread that called mi_switch().  For
>> instance, we do something like the following in _mtx_unlock_sleep():
>> 
>> --- v --- _mtx_unlock_sleep() --- v ---
>> setrunqueue(th_waken_up);
>> if (curthread->preemptable && th_waken_up->priority < curthread->priority) {
>> setrunqueue(curthread);
>> mi_switch();
>> }
>> --- ^ --- _mtx_unlock_sleep() --- ^ ---
>> 
>> If the priority of curthread is higher than th_waken_up, we cannot run
>> it immediately even if there is another processor running a thread
>> with a priority lower than th_waken_up.  th_waken_up should preempt
>> that processor, or we would end up with a priority inversion.
>> 
>> Maybe we have to dispatch a runnable thread to the processor running
>> a thread with the lowest priority.  Solaris seems to take the
>> following steps to do that:
>> 
>> 1. If a new thread has slept for longer than 3/100 seconds (this
>> should be tunable), linearly search the processor running a thread
>> with the lowest priority.  Otherwise, choose the processor that ran
>> the new thread most recently.
>> 
>> 2. Make an inter-processor interrupt to the processor chosen in 1.
>> 
>> 3. The chosen processor puts its current thread back to the dispatch
>> queue and performs a context switch to run the new thread.
>> 
>> Above is only a rough sketch.  We have to watch out for a race of
>> inter-processor interrupts and a processor entering a critical section.
>> 
>> If no one is working on preemption across processors, I would like to
>> see if I can do that.

jhb> I actually think that the little gain this brings isn't worth the extra
jhb> effort involved personally.  We don't have to get things perfect, getting
jhb> them reasonably close is good enough for some things.  However, that is
jhb> only my opinion.  If the code to support this is relatively clean and
jhb> simple with low-impact in the normal case then I would support it.  However,
jhb> there are several tricky race conditions here so I'm not sure it can be
jhb> done simply.

The prototype patch is at:

http://people.FreeBSD.org/~tanimura/patches/ippreempt.diff.gz

And the p4 depot

//depot/user/tanimura/ippreempt/...

The patch is for only i386 at the moment.

The following is the brief description of the patch:

--- v --- Description --- v ---
Overview:

setrunqueue() finds for a newly runnable thread the processor running
the thread with the lowest priority by chooseprocessor(). setrunqueue()
then marks the priority of the new thread on the processor chosen for
preemption.

If the processor chosen is not the current processor, setrunqueue()
notifies the processor by making a preemption IPI to the processor
chosen, where the IPI handler calls dispatchthread().  If the current
processor is chosen for preemption, setrunqueue() directly calls
dispatchthread().

dispatchthread() grabs the thread with the highest priority from the
run queue.  If the current thread is running and has a higher priority
than the thread grabbed, dispatchthread() returns.  Otherwise,
dispatchthread() puts the current thread back to the run queue (if it
is not an idle thread) and switches to the thread grabbed.  If the
current thread is going to sleep, (i.e. its state is SSLEEP, SSTOP,
etc.) we always switch to the thread grabbed.

Implementation:

Call dispatchthread() instead of mi_switch() in msleep(), cv_*wait*(),
etc. in order to give up the current processor.

setrunqueue() no longer requires maybe_resched() in wakeup() and the
preemption check in _mtx_unlock_sleep().  If it is not appropriate to
preempt the current processor, call setrunqueue() in a critical
section.  Note that setrunqueue() may dispatch the thread passed to a
processor other than the current one.

Miscellaneous stuff:

If a thread spins for an adaptive mutex, propagate its priority to the
owner thread of the mutex.  This prevents preemption of the owner
thread by a thread with the priority in between the owner thread and
the spinning thread.

In order to make a space in the IPI priority for a preemption IPI,
raise the IPI priority of Xcpustop and Xinvltlb by one.

An idle processor no longer has to check whether or not there is a
runnable thread.  Halt an idle processor in an SMP kernel as in a UP
kernel.
--- ^ --- Description --- ^ ---

The time taken for configuring, depending, compiling and linking a
GENERIC kernel was measured by time(1) for the vanilla kernel and the
patched one.  Both of the kernels omit INVARIANTS, INVARIANT_SUPPORT
and WITNESS*.  The spec of the test machine is:

CPU:    dual Pentium II 450MHz
RAM:    256MB
HDD:    one IDE 2GB

Tests were done in the single-user mode immediately after reboot.
Make(1) was run with -j16 for compilation and linking by the
kernel-depend target.  The following results are the averages of five
tests in seconds:

On the vanilla kernel:
        Real:   552.10
        User:   872.34
        Sys:    84.81

On the patched kernel:
        Real:   553.38
        User:   873.96
        Sys:    85.16

Since the results of the vanilla kernel had a range of about one
second around the average, we can say that the patched kernel achieves
almost the same performance as the vanilla one.

-- 
Seigo Tanimura <[EMAIL PROTECTED]> <[EMAIL PROTECTED]>

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message

Re: preemption across processors

Reply via email to