Re: contiguous ranges

Pasqui23 via Digitalmars-d Wed, 18 Feb 2015 17:26:06 -0800

On Wednesday, 18 February 2015 at 23:11:07 UTC, Vlad Levenfeldwrote:

On Wednesday, 18 February 2015 at 16:10:30 UTC, Pasqui23 wrote:

No.The main strenght of the range api is its ability to
preserve range categories.The loss of range categories is a
*price* we pay for lazy  evalutation,categories wich can be
restored by .array in exchange for the usual price for dynamic
allocation.


The argument naturally extends to ContiguousRanges:

The loss of contiguity is the price paid for lazy evaluation(note that retro is lazily evaluated), and the result of .arrayis always contiguous.

The point is that you want to pay the less possible for the mostgain:)

you are describing ContiguousRange as
a type implicitly convertable to T[]
Not implicitly, but otherwise this may be a much nicer way todefine ContiguousRange than what I had proposed.
I see a ContiguousRange as an underlying T[] with user-definedoperations and constraints, e.g. a Bitmap, or an AudioClip.This would enable a generic function to operate on a customranges without losing information by decaying the argument intoa T[]. For example, you could perform some generic searchoperation on an AudioClip and return a slice while preservingthe AudioClip's sampling rate.


My proposal and yours are not mutually exclusive.

A range wich can be casted to T[] is automatically consideredcontiguous,but not all contiguous ranges are castable to T[](seefor example BinaryHeap!(T[]) ).

As an aside,I prefer implicit cast as it ease the burden of bothrange authors and algorithms authors:

range authors have to just add
  T[]array;
  alias array this;

algorithms need to just accept T[](or R:T[] if it returns therange parameter)

for an array r, is r.retro contiguous or not?


I would say yes.In general,a good rule of thumb would be:this
range can be copied via memcpy?Then preserve contiguous-ness.


The problem with this rule is illustrated with an example:

  void copy (R1 src, R2 tgt)
  out {
    foreach (i; 0..min (src.length, tgt.length))
      assert (tgt[i] == src[i]);
  }
  body {

memcpy (tgt.ptr, src.ptr, ElementType!R1.sizeof * min(src.length, tgt.length));

  }

  auto s = [0,1,2,3,4,5];
  auto r = [9,8,7,6,5,4].retro;
  s.copy (r);

Originally, r == [4,5,6,7,8,9].
What's the value of r at the end of this program?

If *r.ptr == 4, then we have
  r == [0,5,6,7,8,9]
and a segfault if we're lucky.

If *r.ptr == 9, then we have
  r == [5,4,3,2,1,0]
and a failing postcondition.

There is no scalable, generic way to satisfy the postconditionand get our expected behavior, i.e.

  r == [0,1,2,3,4,5]

The second interpretation(*r.ptr==9) is the correctinterpretation.You are raising an important point:the optimization I gave forcopy is validonly if r1 and r2 are of the same type(in fact i gave it thesignaturevoid copy(R)(R r1,R r2)if(isContiguousRange!R);//only one typeof range),as different types indicate different access implementation.Ingeneral the use of contiguous ranges force the user to check thatboth source and destination range are of the same type,and thiscan be a source of errors.

Optimized data transfers don't care at all that the
bytes are in order,only that the data are in the slice you gave
them.
I have to disagree with this. While the transfer itself is freeto occur out-of-order, if the result is out of order, then theprogram is most likely incorrect.

memcpy(src,dest,len) in general give undefined behavour if src ordest are ofdifferent types or their size is different than len,the same istrue of contiguous ranges(that's why toContiguous is an explicitcast returning void[])

My proposal allow simple extension to multiple dimension.
if  r is a n-dimensional range,then then the following must
hold true:

void[]s=r.toContiguous;
foreach(i1;r)foreach(i2;i1)...foreach(in;inprev)
 assert(s.ptr<=&in<s.ptr+s.lenght);
So, the catch here is illustrated with another example:
Consider a 2D array in row-major order (where [i,j] is adjacentto [i,j+1]). Here, slicing along the rows will yield aContiguousRange r[x..y, 0..$], but r[x..y, z..w] where z > 0 ||w < r.length will be non-contiguous at the column boundaries.There's contiguity hiding in there, but it can't be leveraged.
This could be solved by considering r[x..y, z..w] as aRandomAccessRange of ContiguousRanges, which would imply thatr[a,b] is equivalent to r[a][b].On the other hand, it could be considered a RandomAccessRangewhich yields ContiguousRanges under 1-dimensional slicing...more verbose, but avoids the former interpretation'simplications.

The first implication(r[a,b]==r[a][b])is checked by controllingthat r is a contiguous range,the second one(r[a,b]!=r[a][b]) ischecked by checking if r[a] is itself contiguous

But, like I said, extending range concepts to multipledimensions is tricky. I've been working on the problem on myown for the last few months, and so far my solution has been todefine linear spaces which convert to ranges under certainconditions.

This can be compatible with my proposal:both the 'linear space'and the single ranges are contiguous.

My proposal in general term aims to allow container to be copiedby memcpy when possible and safe,and in general to call C funcionexpecting untyped arrays(that's why toContiguous returnsvoid[])like the splice api on linux.Yours aims to safely call C funcion wich expects arrays of agiven type.Those are two use cases wich are more different than what I gaveit credit,and so our proposal are orthogonal,maybe evencomplementary,as I've already said.

Re: contiguous ranges

Reply via email to