On Tue, 5 Feb 2019, Mike Snitzer wrote:
> On Tue, Feb 05 2019 at 5:09am -0500,
> Mikulas Patocka <[email protected]> wrote:
>
> > Hi
> >
> > Please submit patch this to Linus before 5.0 is released.
> >
> > Mikulas
> >
> >
> >
> > waitqueue_active without preceding barrier is unsafe, see the comment
> > before waitqueue_active definition in include/linux/wait.h.
> >
> > This patch changes it to wq_has_sleeper.
> >
> > Signed-off-by: Mikulas Patocka <[email protected]>
> > Fixes: 6f75723190d8 ("dm: remove the pending IO accounting")
> >
> > ---
> > drivers/md/dm.c | 2 +-
> > 1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > Index: linux-2.6/drivers/md/dm.c
> > ===================================================================
> > --- linux-2.6.orig/drivers/md/dm.c 2019-02-04 20:18:03.000000000 +0100
> > +++ linux-2.6/drivers/md/dm.c 2019-02-04 20:18:03.000000000 +0100
> > @@ -699,7 +699,7 @@ static void end_io_acct(struct dm_io *io
> > true, duration, &io->stats_aux);
> >
> > /* nudge anyone waiting on suspend queue */
> > - if (unlikely(waitqueue_active(&md->wait)))
> > + if (unlikely(wq_has_sleeper(&md->wait)))
> > wake_up(&md->wait);
> > }
> >
>
> This could be applicable to dm-rq.c:rq_completed() too...
I don't know - it depends if the I/O counters are already protected by
some other lock or serializing instruction. If not, then this is broken
too.
> but I'm not following where you think we benefit from adding the
> smp_mb() to end_io_acct() please be explicit about your concern.
end_io_acct does:
decrease the percpu counter
test if the waitqueue is active
if active, wake up
the CPU can reorder it to:
test if the waitqueue is active
decrease the percpu counter
if active, wake up
now, we can have two CPUs racing in this way:
CPU1: test if the waitqueue is active - returns false
CPU2: it sees that the sum of the counters is not zero
CPU2: it adds itself to the waitqueue
CPU1: decrease the percpu counter - now the sum is zero
CPU1: not calling wake_up, because it already tested the waitqueue and
there was no process waiting on it
CPU2: keeps waiting on the waitqueue - deadlock
> Focusing on bio-based DM, your concern is end_io_acct()'s wake_up() will
> race with its, or some other cpus', preceding generic_end_io_acct()
> percpu accounting?
> - and so dm_wait_for_completion()'s !md_in_flight() condition will
> incorrectly determine there is outstanding IO due to end_io_acct()'s
> missing smp_mb()?
> - SO dm_wait_for_completion() would go back to top its loop and may
> never get woken up again via wake_up(&md->wait)?
>
> The thing is in both callers that use this pattern:
>
> if (unlikely(waitqueue_active(&md->wait)))
> wake_up(&md->wait);
>
> the condition (namely IO accounting) will have already been updated via
> generic_end_io_acct() (in terms of part_dec_in_flight() percpu updates).
> So to me, using smp_mb() here is fairly pointless when you consider the
> condition that the waiter (dm_wait_for_completion) will be using is
> _not_ the byproduct of a single store.
>
> Again, for bio-based DM, block core is performing atomic percpu updates
> across N cpus. And the dm_wait_for_completion() waiter is doing percpu
> totalling via md_in_flight_bios().
Without locks or barriers, the CPU can reorder reads and writes
arbitrarily. You can argue about ordering of memory accesses, but other
CPUs may see completely different order.
> Could be there is still an issue here.. but I'm not quite seeing it.
> Cc'ing Jens to get his thoughts.
>
> Thanks,
> Mike
Mikulas
--
dm-devel mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/dm-devel