Hello,

On Mon, Aug 06, 2012 at 03:08:37PM -0700, Kent Overstreet wrote:
>  /**
> + * bio_split - split a bio
> + * @bio:     bio to split
> + * @sectors: number of sectors to split from the front of @bio
> + * @gfp:     gfp mask
> + * @bs:              bio set to allocate from
> + *
> + * Allocates and returns a new bio which represents @sectors from the start 
> of
> + * @bio, and updates @bio to represent the remaining sectors.
> + *
> + * If bio_sectors(@bio) was less than or equal to @sectors, returns @bio
> + * unchanged.

Umm.... I don't know.  This is rather confusing.  The function may
return new or old bios?  What's the rationale behind it?  Return
ERR_PTR(-EINVAL) instead?

> + *
> + * The newly allocated bio will point to @bio's bi_io_vec, if the split was 
> on a
> + * bvec boundry; it is the caller's responsibility to ensure that @bio is not
> + * freed before the split.

This is somewhat error-prone.  Given how splits are used now, this
might not be a big issue but it isn't difficult to imagine how this
could go subtly wrong.  More on this.

> + *
> + * BIG FAT WARNING:
> + *
> + * If you're calling this from under generic_make_request() (i.e.
> + * current->bio_list != NULL), you should mask out __GFP_WAIT and punt to
> + * workqueue if the allocation fails. Otherwise, your code will probably
> + * deadlock.

If the condition is detectable, WARN_ON_ONCE() please.

> + * You can't allocate more than once from the same bio pool without 
> submitting
> + * the previous allocations (so they'll eventually complete and deallocate
> + * themselves), but if you're under generic_make_request() those previous
> + * allocations won't submit until you return . And if you have to split bios,
                                               ^
                                               extra space
> + * you should expect that some bios will require multiple splits.
> + */
> +struct bio *bio_split(struct bio *bio, int sectors,
> +                   gfp_t gfp, struct bio_set *bs)
> +{
> +     unsigned idx, vcnt = 0, nbytes = sectors << 9;
> +     struct bio_vec *bv;
> +     struct bio *ret = NULL;
> +
> +     BUG_ON(sectors <= 0);
> +
> +     if (sectors >= bio_sectors(bio))
> +             return bio;
> +
> +     trace_block_split(bdev_get_queue(bio->bi_bdev), bio,
> +                       bio->bi_sector + sectors);
> +
> +     bio_for_each_segment(bv, bio, idx) {
> +             vcnt = idx - bio->bi_idx;
> +
> +             if (!nbytes) {
> +                     ret = bio_alloc_bioset(gfp, 0, bs);
> +                     if (!ret)
> +                             return NULL;
> +
> +                     ret->bi_io_vec = bio_iovec(bio);
> +                     ret->bi_flags |= 1 << BIO_CLONED;
> +                     break;
> +             } else if (nbytes < bv->bv_len) {
> +                     ret = bio_alloc_bioset(gfp, ++vcnt, bs);
> +                     if (!ret)
> +                             return NULL;
> +
> +                     memcpy(ret->bi_io_vec, bio_iovec(bio),
> +                            sizeof(struct bio_vec) * vcnt);
> +
> +                     ret->bi_io_vec[vcnt - 1].bv_len = nbytes;
> +                     bv->bv_offset   += nbytes;
> +                     bv->bv_len      -= nbytes;
> +                     break;
> +             }

Ummm... ISTR reviewing this code and getting confused by bio_alloc
inside bio_for_each_segment() loop and commenting something about
that.  Yeah, this one.

  http://thread.gmane.org/gmane.linux.kernel.device-mapper.devel/15790/focus=370

So, I actually have reviewed this but didn't get any response and
majority of the issues I raised aren't addressed and you sent the
patch to me again?  What the hell, Kent?

> +
> +             nbytes -= bv->bv_len;
> +     }
> +
> +     ret->bi_bdev    = bio->bi_bdev;
> +     ret->bi_sector  = bio->bi_sector;
> +     ret->bi_size    = sectors << 9;
> +     ret->bi_rw      = bio->bi_rw;
> +     ret->bi_vcnt    = vcnt;
> +     ret->bi_max_vecs = vcnt;
> +     ret->bi_end_io  = bio->bi_end_io;

Is this safe?  Why isn't this chaining completion of split bio to the
original one?

Thanks.

-- 
tejun
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to