Peff,
On Fri, Jun 13, 2014 at 03:15:50AM -0400, Jeff King wrote:
> On Thu, Jun 12, 2014 at 04:46:37PM -0700, Jeremiah Mahler wrote:
>
> > > Although strbuf_set() does make the code a bit easier to read
> > > when strbufs are repeatedly re-used, re-using a variable for
> > > different purposes is generally considered poor programming
> > > practice. It's likely that heavy re-use of strbufs has been
> > > tolerated to avoid multiple heap allocations, but that may be a
> > > case of premature (allocation) optimization, rather than good
> > > programming. A different ("better") way to make the code more
> > > readable and maintainable may be to ban re-use of strbufs for
> > > different purposes.
> > >
> > > But I deleted it before sending because it's a somewhat tangential
> > > issue not introduced by your changes. However, I do see strbuf_set()
> > > as a Band-Aid for the problem described above, rather than as a useful
> > > feature on its own. If the practice of re-using strbufs (as a
> > > premature optimization) ever becomes taboo, then strbuf_set() loses
> > > its value.
> > >
> >
> > I am getting the feeling that I have mis-understood the purpose of
> > strbufs. It is not just a library to use in place of char*.
> >
> > If strbufs should only be added to and never reset a good test would be
> > to re-write builtin/remote.c without the use of strbuf_reset.
> >
> > builtin/remote.c does re-use the buffers. But it seems if a buffer is
> > used N times then to avoid a reset you would need N buffers.
> >
> > But on the other hand I agree with your comment that re-using a variable
> > for different purposes is poor practice.
> >
> > Now I am not even sure if I want my own patch :-)
>
> I think reusing buffers like remote.c does makes things uglier and more
> confusing than necessary, and probably doesn't have any appreciable
> performance gain. We are saving a handful of allocations, and have such
> wonderful variable names as "buf2" (when is it OK to reuse that one,
> versus regular "buf"?).
>
> A better reason I think is to reuse allocations across a loop, like:
>
> struct strbuf buf = STRBUF_INIT;
>
> for (i = 0; i < nr; i++) {
> strbuf_reset(&buf);
> strbuf_add(&buf, foo[i]);
> ... do something with buf ...
> }
> strbuf_release(&buf);
>
> You can write that as:
>
> for (i = 0; i < nr; i++) {
> struct strbuf buf = STRBUF_INIT;
> strbuf_add(&buf, foo[i]);
> ... do something ...
> strbuf_release(&buf);
> }
>
> and it is definitely still a case of premature optimization. But:
>
> 1. "nr" here may be very large, so the amortized benefits are greater
>
> 2. It's only one call to strbuf_reset to cover many items. Not one
> sprinkled every few lines.
>
> You'll note that strbuf_getline uses a "set" convention (making it
> different from the rest of strbuf) to handle this looping case.
>
> I don't have a problem with strbuf_set, but just peeking at remote.c, I
> think I'd rather see it cleaned up. It looks like one of the major uses
> is setting config variables. I wonder how hard it would be to make a
> git_config_set variant that took printf-style formats, and handled the
> strbuf itself.
>
> -Peff
Improving remote.c sounds like a better direction than adding set
operations. I will start looking in to it.
--
Jeremiah Mahler
[email protected]
http://github.com/jmahler
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html