----- Original Message -----
> From: "Peter Zijlstra" <pet...@infradead.org>
> To: "Paul Turner" <p...@google.com>
> Cc: "NeilBrown" <nfbr...@novell.com>, "Linus Torvalds" 
> <torva...@linux-foundation.org>, "Thomas Gleixner"
> <t...@linutronix.de>, "LKML" <linux-kernel@vger.kernel.org>, "Mike Galbraith" 
> <efa...@gmx.de>, "Ingo Molnar"
> <mi...@kernel.org>, "Peter Anvin" <h...@zytor.com>, "vladimir murzin" 
> <vladimir.mur...@arm.com>,
> linux-tip-comm...@vger.kernel.org, jstan...@redhat.com, "Oleg Nesterov" 
> <o...@redhat.com>
> Sent: Friday, 11 December, 2015 12:39:59 PM
> Subject: Re: [tip:locking/core] sched/wait: Fix signal handling in bit wait 
> helpers
> 
> On Fri, Dec 11, 2015 at 03:30:33AM -0800, Paul Turner wrote:
> 
> > > Blergh, all I've managed to far is to confuse myself further. Even
> > > something like the original (+- the EINTR) should work when we consider
> > > the looping, even when mixed with an occasional spurious wakeup.
> > >
> > >
> > > int bit_wait()
> > > {
> > >         if (signal_pending_state(current->state, current))
> > >                 return -EINTR;
> > >         schedule();
> > > }
> 
> So I asked Vladimir to test that (simply changing the return from 1 to
> -EINTR) and it made his fail much less likely but it still failed in the
> same way.
> 
> So I'm fairly sure I'm still missing something :/
> 
> > Hugh asked me about this after seeing a crash, here's another exciting
> > way in which the current code breaks -- this one actually quite
> > serious:
> 
> Yep, this got reported by Jan and I did kick myself for that.
> 
> > Peter's proposed follow-up above looks strictly more correct.  We need
> > to evaluate the potential existence of a signal, *after* we return
> > from schedule, but in the context of the state which we previously
> > _entered_ schedule() on.
> > 
> > Reviewed-by: Paul Turner <p...@google.com>
> 
> Right, its maybe a bit overkill, but at this point I'm a tad
> conservative/paranoid.
> 
> Vladimir, Jan could you both please that patch?
> 
>  lkml.kernel.org/r/20151208104712.gj6...@twins.programming.kicks-ass.net

This appears to exactly match patch I tested against v4.4-rc4 here:
  http://marc.info/?l=linux-mm&m=144950957622869&w=2

Anyway, I repeated the test with v4.4-rc4-113-g0bd0f1e as base.
Results look good. With patch applied, I can't trigger
"kernel BUG at mm/filemap.c:238!" anymore.

Regards,
Jan

> 
> 
> Thanks!
> 
> 
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to