On Thursday, 2 August 2018 at 17:21:47 UTC, Steven Schveighoffer wrote:
On 8/2/18 12:21 PM, Jonathan Marler wrote:
On Monday, 30 July 2018 at 21:02:56 UTC, Steven Schveighoffer wrote:
Would it be a valid optimization to have D remove the requirement for allocation when it can determine that the entire data structure of the item in question is an rvalue, and would fit into the data pointer part of the delegate?

Here's what I'm looking at:

auto foo(int x)
{
   return { return x + 10; };
}

In this case, D allocates a pointer on the heap to hold "x", and then return a delegate which uses the pointer to read x, and then return that plus 10.

However, we could store x itself in the storage of the pointer of the delegate. This removes an indirection, and also saves the heap allocation.

Think of it like "automatic functors".

Does it make sense? Would it be feasible for the language to do this? The type system already casts the delegate pointer to a void *, so it can't make any assumptions, but this is a slight break of the type system.

The two requirements I can think of are:
1. The data in question must fit into a word
2. It must be guaranteed that the data is not going to be mutated (either via the function or any other function). Maybe it's best to require the state to be const/immutable.

I've had several cases where I was tempted to not use delegates because of the allocation cost, and simply return a specialized struct, but it's so annoying to do this compared to making a delegate. Plus something like this would be seamless with normal delegates as well (in case you do need a real delegate).


I think the number of cases where you could optimize this is very small.  And the complexity of getting the compiler to analyze cases to determine when this is possible would be very large.

It's not that complicated, you just have to analyze how much data is needed from the context inside the delegate. First iteration, all of the data has to be immutable, so it should be relatively straightforward.

After thinking about it more I suppose it wouldn't be that complicated to implement. For delegate literals, you already need to gather a list of all the data you need to put on the heap, and if it can all fit inside a pointer, then you can just put it there instead.

On that note, I think if a developer wants to be sure that this optimization occurs in their code, they should explicitly use a library solution like the one in Ocean or the one I gave. If a developer relies on the optimization, then when it doesn't work you won't get any information as to why it couldn't perform the optimization (i.e. some data was mutable or were not r-values). Depending on the code, this failure will either be ignored or break some dependency on the optimization like @nogc. With a library solution, it explicitly copies the data into the pointer so you'll get an explicit error message if it doesn't fit or has some other issue.

Something else to consider is this would cause some discrepancy with the @nogc attribute based on the platform's pointer width. By making this an optimization that you don't have to "opt-in", the developer may be unaware that their code is depending on this optimization that won't work on other platforms. Their code could become platform-dependent without them knowing. However, I suppose the counter-argument is that code that uses delegate literals with @nogc would probably we aware of this, but still something to consider.

In the end, I think that most if not all use cases would be better off using the library solution if they want this optimization. This allows the developer to "opt-in" or "opt-out" of this optimization and enables the compiler to provide error messages when the "opt-in" with incompatible usage.

Reply via email to