Re: Scheduler hack for multi-threaded processes

Mark Kettenis Thu, 24 Mar 2016 12:18:53 -0700

> Date: Wed, 23 Mar 2016 22:16:36 +0100
> From: Martin Pieuchot <m...@openbsd.org>
> 
> On 23/03/16(Wed) 21:35, Mark Kettenis wrote:
> > > Date: Mon, 21 Mar 2016 16:51:16 +0100 (CET)
> > > From: Mark Kettenis <mark.kette...@xs4all.nl>
> > > 
> > > This doesn't only change the sched_yield() behaviour, but also
> > > modifies how in-kernel yield() calls behave for threaded processes.
> > > That is probably not right.
> > 
> > So here is a diff that keeps yield() the same and adds the code in the
> > sched_yield(2) implementation instead. 
> 
> I'm not a big fan of code duplication, what about checking for
> p_usrpri >= PUSER instead?


I think we really want to keep the existing yield() semantics for
userland processes running inside the kernel as well.  So that
wouldn't work, even apart from the fact that the p_usrpri >= PUSER
check is a bit dodgy.  So I think code duplication is a unavoidable
here.

> >                                         It also changes the logic that
> > picks the priority to walk the complete threads listand pick the
> > highest priotity of all the threads.  That should be enough to make
> > sure the calling thread is scheduled behind all other threads from the
> > same process.  That does require us to grab the kernel lock though.
> > So this removes NOLOCK from sched_yield(2).  I don't think that is a
> > big issue.
> > 
> > Still improves video playback in firefox on my x1.
> 
> This is another kind of hack :)  It's certainly less intrusive but
> I'm still not sure if we want that.  Reducing the priority of a
> thread to the worst priority of its siblings might still not be
> enough.

I don't disagree with you, although our current implementation of
sched_yield(2) *is* simewhat broken as it doesn't guarantee that the
current thread will land at the tail of the list of threads running at
the same priority (priority as set by the user through setpriority(2)).

> Now I'd like to know how many times sched_yield() really triggers a
> context switch for threaded programs with this version of the diff.

Do you have a diff to measure this?

> Without any diff I observed that only 20 to 25% of the sched_yield()
> calls made by firefox result in a different thread being chosen by
> mi_switch().  Somethings tell me that per-CPU runqueues are to blame
> here.

I'm not sure the per-CPU runqueues are that much of a problem.
Spinning on sched_yield() may be somewhat suboptimal, but it shouldn't
really affect the speed at which things run as long as the thread that
holds the spinlock is guaranteed to make progress on some other CPU.

Ultimately we want to reduce the contention; and to first order that
means a more thread-friendly malloc implementention.  So I encourage
folks to look at otto's diff.

But there will always be some lock contention.  So improving the
__thrsleep/__thrwakeup interface will be important as well.
Unfortunately the atomic-operation challenged hardware platforms make
this non-trivial.

With those issues fixed we might be able to implement sched_yield(2)
in a slightly less pessimistic way.

Cheers,

Mark

> > Index: syscalls.master
> > ===================================================================
> > RCS file: /cvs/src/sys/kern/syscalls.master,v
> > retrieving revision 1.167
> > diff -u -p -r1.167 syscalls.master
> > --- syscalls.master 21 Mar 2016 22:41:29 -0000      1.167
> > +++ syscalls.master 23 Mar 2016 20:33:45 -0000
> > @@ -514,7 +514,7 @@
> >  #else
> >  297        UNIMPL
> >  #endif
> > -298        STD NOLOCK      { int sys_sched_yield(void); }
> > +298        STD             { int sys_sched_yield(void); }
> >  299        STD NOLOCK      { pid_t sys_getthrid(void); }
> >  300        OBSOL           t32___thrsleep
> >  301        STD             { int sys___thrwakeup(const volatile void 
> > *ident, \
> > Index: kern_synch.c
> > ===================================================================
> > RCS file: /cvs/src/sys/kern/kern_synch.c,v
> > retrieving revision 1.129
> > diff -u -p -r1.129 kern_synch.c
> > --- kern_synch.c    9 Mar 2016 13:38:50 -0000       1.129
> > +++ kern_synch.c    23 Mar 2016 20:33:45 -0000
> > @@ -1,4 +1,4 @@
> > -/* $OpenBSD: kern_synch.c,v 1.129 2016/03/09 13:38:50 mpi Exp $    */
> > +/* $openbsd: kern_synch.c,v 1.129 2016/03/09 13:38:50 mpi Exp $    */
> >  /* $NetBSD: kern_synch.c,v 1.37 1996/04/22 01:38:37 christos Exp $ */
> >  
> >  /*
> > @@ -432,7 +432,24 @@ wakeup(const volatile void *chan)
> >  int
> >  sys_sched_yield(struct proc *p, void *v, register_t *retval)
> >  {
> > -   yield();
> > +   struct proc *q;
> > +   int s;
> > +
> > +   SCHED_LOCK(s);
> > +   /*
> > +    * If one of the threads of a multi-threaded process called
> > +    * sched_yield(2), drop its priority to ensure its siblings
> > +    * can make some progress.
> > +    */
> > +   p->p_priority = p->p_usrpri;
> > +   TAILQ_FOREACH(q, &p->p_p->ps_threads, p_thr_link)
> > +           p->p_priority = max(p->p_priority, q->p_priority);
> > +   p->p_stat = SRUN;
> > +   setrunqueue(p);
> > +   p->p_ru.ru_nvcsw++;
> > +   mi_switch();
> > +   SCHED_UNLOCK(s);
> > +
> >     return (0);
> >  }
> >  
> >

Re: Scheduler hack for multi-threaded processes

Reply via email to