Re: improving the join function

Steven Schveighoffer Wed, 13 Oct 2010 12:05:39 -0700

On Mon, 11 Oct 2010 20:33:27 -0400, Andrei Alexandrescu<[email protected]> wrote:

I'm looking at http://d.puremagic.com/issues/show_bug.cgi?id=3313 andthat got me looking at std.string.join, which currently has the sig:
string join(in string[] words, string sep);

A narrow fix:

Char[] join(Char)(in Char[][] words, in Char[] sep)
if (isSomeChar!Char);
I think it's reasonable to assume that people would want to join thingsthat aren't necessarily arrays of characters, so T could be pretty muchany type. An obvious step towards generalization is:
T[] join(T)(in T[][] items, T[] sep);

This doesn't quite work if T is not a value type (actually, I think itdoes, but only because there are bugs in the compiler).

But join doesn't really need random access for words - really, an inputrange should suffice. So a generally useful join, almost worth puttingin std.algorithm, would be:
ElementType!R1[] join(R1, R2)(R1 items, R2 sep)
if (isInputRange!R1 && isForwardRange!R2
     && is(ElementType!R2 : ElementType!R1);
Notice how the separator must be a forward range because it gets spannedmultiple times, whereas the items need only be an input range as theyare spanned once. This is at the same time a very general and veryprecise interface.

I think this is fine. Note that this does not take into account theconstancy of items, meaning it is legal for this function to mess with theoriginal data in items.

Not that I think it's a bad thing, but it does lose some guarantees ascompared to the original join. inout can't be used here because itdoesn't work as a template parameter.

One thing is still bothering me: the array output type. Why would the"default" output range be an array? What can be done to make join() atthe same time a general function and also one that works for strings theway the old join did? For example, if I want to join things into analready-existing buffer, or if I want to write them straight to a file,there's no way to do so without having an array allocation in the loop.I have a couple of ideas but I wouldn't want to bias yours.

Well, one could have a version of join that takes an output range. Itwould have to return the output range instead of the *result* of theoutput range. And in that case, the standard join which returns an arraycan be implemented:


ElementType!R1[] join(R1 items, R2 sep) ...
{
   return join(R1, R2, Appender!(ElementType!R1)).data;
}

I also have a question from people who dislike Phobos. Was there a pointin the changes of signature above where you threw your hands thinking,"do the darn string version already and cut all that crap!"?

It's not a problem with phobos, it's a problem with documentation. Thereis a fundamental issue with documenting complex templates which makesfunction signatures very difficult to understand. The doc generator canand should simplify things, and I think at some point we should addressthis. In other words, it should be transformed into a form that's easy tosee that it's the same as string[] join(string[][], string[]).


-Steve

Re: improving the join function

Reply via email to