On Tue, Dec 12, 2017 at 11:31:02AM +0100, Ingo Molnar wrote:
> 
> * Thomas Gleixner <t...@linutronix.de> wrote:
> 
> > On Fri, 8 Dec 2017, Darren Hart wrote:
> > 
> > > From: "Darren Hart (VMware)" <dvh...@infradead.org>
> > > 
> > > uaddr alignment is currently tested by get_futex_key(). We can catch
> > > misalignment earlier in sys_futex and return -EINVAL sooner. This
> > > simplifies get_futex_key() a little, but more importantly exits the
> > > kernel as soon as an invalid parameter is detected.
> > > 
> > > Passes all selftests/futex testcases on a dual socket Xeon E5-2670, 16
> > > physical cores total, 32 threads total.
> > > 
> > > Signed-off-by: Darren Hart (VMware) <dvh...@infradead.org>
> > > Cc: Thomas Gleixner <t...@linutronix.de>
> > > Cc: Ingo Molnar <mi...@redhat.com>
> > > Cc: Peter Zijlstra <pet...@infradead.org>
> > > Cc: Darren Hart <dvh...@infradead.org>
> > > ---
> > >  kernel/futex.c | 7 +++++--
> > >  1 file changed, 5 insertions(+), 2 deletions(-)
> > > 
> > > diff --git a/kernel/futex.c b/kernel/futex.c
> > > index 76ed592..c3ee6c4 100644
> > > --- a/kernel/futex.c
> > > +++ b/kernel/futex.c
> > > @@ -509,8 +509,6 @@ get_futex_key(u32 __user *uaddr, int fshared, union 
> > > futex_key *key, int rw)
> > >    * The futex address must be "naturally" aligned.
> > >    */
> > >   key->both.offset = address % PAGE_SIZE;
> > > - if (unlikely((address % sizeof(u32)) != 0))
> > > -         return -EINVAL;
> > >   address -= key->both.offset;
> > >  
> > >   if (unlikely(!access_ok(rw, uaddr, sizeof(u32))))
> > > @@ -3525,6 +3523,11 @@ SYSCALL_DEFINE6(futex, u32 __user *, uaddr, int, 
> > > op, u32, val,
> > >   u32 val2 = 0;
> > >   int cmd = op & FUTEX_CMD_MASK;
> > >  
> > > + /* Only allow for aligned uaddr variables */
> > > + if (unlikely((unsigned long)uaddr % sizeof(u32) != 0 ||
> > > +              (unsigned long)uaddr2 % sizeof(u32) != 0))
> > 
> > Errm. How is that supposed to work? uaddr2 is not used by all opcodes.....
> 
> So to explain the curious timing of the mails from Thomas and me: I told 
> Thomas 
> about the breakage over IRC and he found the likely bug! ;-)

My thinking had been that while uaddr2 is not use by all the op-codes,
it is not ever used as anything but a userspace address and when it
isn't used, it shouldn't be getting garbage passed in.

So if it's failing due to a uaddr2 not being aligned for an op-code that
doesn't use it... that would have to be on FUTEX_WAIT*, FUTEX_WAKE,
FUTEX_WAKE_BITSET, or FUTEX_*_PI.

If it's failing on distro boot, the PI ops are unlikely candidates.

I'm curious what a valid use of this would be, and why my own tests
didn't catch it.

I can move the uaddr2 test under a conditional on the cmd for now. Would
that be acceptable?

Then I can add a WARNON for the other ops for my own testing, but not
for upstream.

-- 
Darren Hart
VMware Open Source Technology Center

Reply via email to