On Thu, Sep 14, 2017 at 09:56:56AM +0800, Ming Lei wrote:
> On Wed, Sep 13, 2017 at 11:37:20AM -0700, Omar Sandoval wrote:
> > On Mon, Sep 11, 2017 at 12:08:29PM +0800, Ming Lei wrote:
> > > On Sun, Sep 10, 2017 at 10:20:27AM -0700, Omar Sandoval wrote:
> > 
> > [snip]
> > 
> > > > What I mean is that you keep the same initialization above, but instead 
> > > > of
> > > >                 depth += nr
> > > > you do
> > > >                 depth = min_t(unsigned int, word->depth, sb->depth - 
> > > > scanned);
> > > > because like I said, the reasoning about why `+= nr` is okay in the
> > > > `sb->depth - scanned` case is subtle.
> > > > 
> > > > And maybe even replace the
> > > >                 scanned += depth;
> > > > with
> > > >                 scanned += min_t(unsigned int, word->depth - nr,
> > > >                                  sb->depth - scanned);
> > > > I.e., don't reuse the depth local variable for two different things. I'm
> > > > nitpicking here but this code is tricky enough as it is.
> > > 
> > > It wasn't reused in old version, just for saving one local variable, and
> > > one extra min_t().
> > > 
> > > Yeah, I admit it isn't clean enough.
> > > 
> > > > 
> > > > For completeness, I mean this exactly:
> > > > 
> > > >         while (1) {
> > > >                 struct sbitmap_word *word = &sb->map[index];
> > > >                 unsigned int depth;
> > > > 
> > > >                 scanned += min_t(unsigned int, word->depth - nr,
> > > >                                  sb->depth - scanned);
> > > >                 if (!word->word)
> > > >                         goto next;
> > > > 
> > > >                 depth = min_t(unsigned int, word->depth, sb->depth - 
> > > > scanned);
> > > 
> > > two min_t and a little code duplication.
> > 
> > They're similar but they represent different things, so I think trying
> > to deduplicate this code just makes it more confusing. If performance is
> > your concern, I'd be really surprised if there's a noticable difference.
> 
> No only one extra min_t(), also it isn't easy to read the code, since
> only in the first scan that 'depth' isn't same with 'depth', that is
> why I set the 1st 'scan' outside of the loop, then we can update 'scan'
> with 'depth' in every loop. People will be easy to follow the
> meaning.
> 
> > 
> > As a side note, I also realized that this code doesn't handle the
> > sb->depth == 0 case. We should change the while (1) to
> > while (scanned < sb->depth) and remove the
> > if (scanned >= sb->depth) break;
> 
> In the attached patch, I remember that the zero depth case is
> addressed by:
> 
>       if (start >= sb->depth)
>               return;
> 
> which is required since 'start' parameter is introduced in
> this patch.

I think the better way to handle this is

if (start >= sb->depth)
        start = 0;

Since the sbitmap may have gotten resized since the last time the user
called this and cached their start value.

> > 
> > > >                 off = index << sb->shift;
> > > >                 while (1) {
> > > >                         nr = find_next_bit(&word->word, depth, nr);
> > > >                         if (nr >= depth)
> > > >                                 break;
> > > > 
> > > >                         if (!fn(sb, off + nr, data))
> > > >                                 return;
> > > > 
> > > >                         nr++;
> > > >                 }
> > > > next:
> > > >                 if (scanned >= sb->depth)
> > > >                         break;
> > > >                 nr = 0;
> > > >                 if (++index >= sb->map_nr)
> > > >                         index = 0;
> > > >         }
> > > 
> > > The following patch switches to do{}while and handles the
> > > 1st scan outside of the loop, then it should be clean
> > > enough(no two min_t()), so how about this one?
> > 
> > I find this one subtler and harder to follow. The less it looks like the
> > typical loop pattern, the longer someone reading the code has to reason
> > about it.
> 
> Looks using 'depth' to update 'scanned' is easier to follow, than
> two min_t(), since it will make people easy to understand the relation
> between the two, then understand the whole code.

Honestly I prefer your original patch with a comment on depth += nr. I'd
be happy with the following incremental patch on top of your original v4
patch.

diff --git a/include/linux/sbitmap.h b/include/linux/sbitmap.h
index 2329b9e1a0e2..8d747048ae4f 100644
--- a/include/linux/sbitmap.h
+++ b/include/linux/sbitmap.h
@@ -218,7 +218,7 @@ typedef bool (*sb_for_each_fn)(struct sbitmap *, unsigned 
int, void *);
 
 /**
  * sbitmap_for_each_set() - Iterate over each set bit in a &struct sbitmap.
- * @off: Where to start the iteration
+ * @off: Where to start the iteration.
  * @sb: Bitmap to iterate over.
  * @fn: Callback. Should return true to continue or false to break early.
  * @data: Pointer to pass to callback.
@@ -230,11 +230,16 @@ static inline void __sbitmap_for_each_set(struct sbitmap 
*sb,
                                          unsigned int off,
                                          sb_for_each_fn fn, void *data)
 {
-       unsigned int index = SB_NR_TO_INDEX(sb, off);
-       unsigned int nr = SB_NR_TO_BIT(sb, off);
+       unsigned int index;
+       unsigned int nr;
        unsigned int scanned = 0;
 
-       while (1) {
+       if (off >= sb->depth)
+               off = 0;
+       index = SB_NR_TO_INDEX(sb, off);
+       nr = SB_NR_TO_BIT(sb, off);
+
+       while (scanned < sb->depth) {
                struct sbitmap_word *word = &sb->map[index];
                unsigned int depth = min_t(unsigned int, word->depth - nr,
                                           sb->depth - scanned);
@@ -243,6 +248,11 @@ static inline void __sbitmap_for_each_set(struct sbitmap 
*sb,
                if (!word->word)
                        goto next;
 
+               /*
+                * On the first iteration of the outer loop, we need to add the
+                * bit offset back to the size of the word for find_next_bit().
+                * On all other iterations, nr is zero, so this is a noop.
+                */
                depth += nr;
                off = index << sb->shift;
                while (1) {
@@ -254,9 +264,7 @@ static inline void __sbitmap_for_each_set(struct sbitmap 
*sb,
 
                        nr++;
                }
- next:
-               if (scanned >= sb->depth)
-                       break;
+next:
                nr = 0;
                if (++index >= sb->map_nr)
                        index = 0;
@@ -268,9 +276,6 @@ static inline void __sbitmap_for_each_set(struct sbitmap 
*sb,
  * @sb: Bitmap to iterate over.
  * @fn: Callback. Should return true to continue or false to break early.
  * @data: Pointer to pass to callback.
- *
- * This is inline even though it's non-trivial so that the function calls to 
the
- * callback will hopefully get optimized away.
  */
 static inline void sbitmap_for_each_set(struct sbitmap *sb, sb_for_each_fn fn,
                                        void *data)

Reply via email to