Re: Difference between input range and forward range

Jonathan M Davis via Digitalmars-d Tue, 10 Nov 2015 10:57:02 -0800

On Tuesday, 10 November 2015 at 16:33:02 UTC, Ur@nuz wrote:

I agree with these considerations. When I define non-copyablerange (with disabled this) lot of standard phobos functionsfails to compile instead of using *save* method. So logicalquestion is in which cases we should use plain old struct copyor and when we should use *save* on forward ranges.
Also good question is should we have input ranges copyable (orfor what types of ranges they can be copyable)? Good example isnetwork socket as input range, because we can't save the stateof socket stream and get consumed data again so as I thingcopying of such range looks meaningless (in my opinion). If wewant to pass it somewhere it's better pass it by reference.

Passing by reference really doesn't work with ranges. Considerthat most range-based functions are lazy and wrap the range thatthey're given in a new range. e.g.


auto r = filter!pred(range);

or

auto r = map!func(range);

The range has to be copied for that to work. And even if youcould make it so that the result of functions like map or filterreferred to the original range by reference, their return valuewould not be returned by ref, so if a function required that itsargument by passed by ref, then you couldn't chain it. So,requiring that ranges be passed by ref would pretty much killfunction chaining.

Also passing range somewhere to access it in two differentplaces simultaneously is also bad idea. The current state lookslike we have current approach with range postblit constructorand +save+, because we have it for structs and it works somehow(yet) for trivial cases. But we don't have clear intentionsabout how it should really work.

It's mostly clear, but it isn't necessarily straightforward toget it right. If you want to duplicate a range, then you _must_use save. Copying a range by assigning it to another range is notactually copying it per the range API. You pretty much have toconsider it a move and consider the original unusable after thecopy.

The problem is that for arrays and many of the common ranges,copying the range and calling save are semantically the same, soit's very easy to write code which assumes that behavior and thendoesn't work with other types of changes. That's why it'scritical to test range-based functions with a variety of rangestypes - particularly reference types in addition to value typesor dynamic arrays.

Copying and passing ranges should also be specifyed as part ofrange protocol, because it's very common use case and shouldn'tbe ambigous.

The semantics of copying a range depend heavily on how a range isimplemented and cannot be defined in the general case:


auto copy = orig;

Dynamic arrays and classes will function fundamentallydifferently, and with structs, there are a variety of differentsemantics that that copy could have. What it ultimately comesdown to is that while the range API can require that the copy bein the exact same state that the original was in, it can't sayanything about the state of the original after the copy.Well-behaved range-based code has to assume that once orig hasbeen copied, it is unusable. If the code wants to actually get aduplicate of the range, then it will have to use save, and thesemantics of that _are_ well-defined and do not depend on thetype of the range.

Also as far as range could be class object we must consider howshould they behave?

There's really nothing to consider here. It's known how theyshould behave. There's really only one way that they _can_behave. One of the main reasons that save exists is because ofclasses. While copying a dynamic array or many struct types isequivalent to save, it _can't_ be equivalent with a class. Whenyou consider that fact, the required behavior of ranges prettymuch falls into place on its own. We may very well need to be farclearer about what those semantics are and how that affects bestpractices, but there really isn't much (if any) wiggle room inwhat the range API does and doesn't guarantee and how it shouldbe used. The problem is whether it's _actually_ used that way.

If a range-based function is tested with a variety of range types- dynamic arrays, value types, reference types, etc. then itbecomes clear very quickly when calls to save are required andhow the function must be written to work for all of those rangetypes. But far too often, range-based functions are tested withdynamic arrays and a few struct range types that wrap dynamicarrays, and bugs with regards to reference type ranges are notfound. So, there's almost certainly a lot of range-based code outthere that works fantastically with dynamic arrays but would failmiserably with a number of other range types.

For the most part, I think that it's pretty clear how ranges haveto act and how they need to be used based on their API when youactually look at how the range API interacts with different typesof ranges, but we often do not go much beyond dynamic arrays andmiss out on some of the subtleties.

We really do need some good write-ups on ranges and their bestpractices. I've worked on that before but never managed to spendthe time to finish it. Clearly, I need to fix that.


- Jonathan M Davis

Re: Difference between input range and forward range

Reply via email to