Re: Range length property

Cym13 via Digitalmars-d-learn Tue, 10 Apr 2018 15:11:20 -0700

On Tuesday, 10 April 2018 at 20:08:14 UTC, Jonathan M Davis wrote:

On Tuesday, April 10, 2018 19:47:10 Nordlöw viaDigitalmars-d-learn wrote:
On Tuesday, 10 April 2018 at 14:34:40 UTC, Adam D. Ruppe wrote:
> On Tuesday, 10 April 2018 at 14:25:52 UTC, Nordlöw wrote:
>> Should ranges always provide a length property?
>
> No.
>
>> If so, in which cases is a length property an advantage or>> a requirement?
>
> Just provide it whenever it is cheap to do so. If you need> to do complex calculations or especially loop over contents> to figure out the length, do NOT provide it.
>
> But if it is as simple as returning some value, provide it> and algorithms can take advantage of it for optimizations> etc. as needed.
I'm thinking of my own container Hashmap having its rangeByKeyValue requiring one extra word of memory to store theiteration count which, in turn, can be used to calculate thelength of the remaining range. Is this motivated?
That would depend entirely on what you're trying to do, but ingeneral, if a range has length, then some algorithms will bemore efficient, and some algorithms do require length. So, ifyou can provide length, then the range will be more useful,just like a bidirectional range can be more useful than aforward range or a random-access range can be more useful thaneither. However, if you're not doing anything that everbenefits from it having length, then it doesn't buy youanything. So, it ultimately depends on what you're doing. In ageneral purpose library, I'd say that it should have length ifit can do so in O(1), but if it's just for you, then it may ormay not be worth it.
The other thing to consider is what happens when the containeris mutated. I don't think that ranges necessarily behave allthat well when an underlying container is mutated, but it issomething that has to be considered when dealing with a rangeover a container. Even if mutating the underlying containerdoesn't necessarily invalidate a range, maintaining the lengthin the manner that you're suggesting probably makes it so thatit would be invalidated in more cases, since if any elementsare added or removed in the portion that was already popped offthe range, then the iteration count couldn't be used tocalculate the length in the same way anymore. Now, with a hashmap, the range is probably fully invalidated when anything getsadded or removed anyway, since that probably screws with theorder of the elements in the range, but how the range is goingto behave when the underlying container is mutated and howhaving the length property does or doesn't affect that issomething that you'll need to consider.
- Jonathan M Davis

I find that discussion very interesting as I had never consideredthat because of design by introspection having a costly lengthmethod would lead to unexpected calls by generic algorithmsmaking it a disadventage if present.

On the other hand I don't think the end user should have toscratch his head to find the length of a range, especially ifit's not trivial to get (say, O(log n) kind of case). Thereforeexposing a method in any case seems the best from an APIperspective.

But to avoid the performance issues mentionned earlier it meansit should bear a different name (get/setLength comes to mind). Ibelieve this is the same kind of issue that lead to having "in"for associative arrays but not regular ones. However this alsoleads to less coherent APIs in contradiction with the principleof least surprise.

In retrospect since only "unexpected" calls to such methods causethe issue I wonder if it wouldn't be best to have an UDA saying"Hey, please, this method is costly, if you're a generic templateperforming introspection you should probably not call me". Andwriting that Andrei's work on complexity annotations comes tomind. Anyway, I don't think the user should use different namesjust to alleviate an issue on the library side but thealternative would be costly to put in place...


Any thoughts?

Re: Range length property

Reply via email to