Re: groupBy/chunkBy redux

Peter Alexander via Digitalmars-d Fri, 13 Feb 2015 15:51:23 -0800

On Friday, 13 February 2015 at 18:32:35 UTC, Andrei Alexandrescuwrote:

* Perhaps rename groupBy to chunkBy. People coming from SQL andother languages might expect groupBy to do hash-based grouping.


Agreed.

* The unary function implementation must return for each groupa tuple consisting of the key and the lazy range of values. Thebinary function implementation should continue to only returnthe lazy range of values.

Is the purpose of this just to avoid the user potentially needingto evaluate the key function twice?

* SortedRange should add a method called group(). Invoked withno predicate, group() should do what chunkBy does, using thesorting predicate.

Will need to be called something else since there may be existingcode trying to call std.algorithm.group using UFCS. This wouldchange its behaviour.

* aggregate() should detect the two kinds of results per group(well, chunk) and process them accordingly: for unary-predicatechunks, pass the key through and only process the lazy range.Meaning:
auto data = [
  tuple("John", 100),
  tuple("John", 35),
  tuple("Jane", 200),
  tuple("Jane", 87),
];
auto r = data.chunkBy!(x => x[0]).aggregate!sum;
yields a range of tuples: tuple("John", 135), tuple("Jane",187).


Not sure I understand how this is meant to work.

With your second bullet implemented, data.chunkBy!(x => x[0])will return:


tuple("John", [tuple("John", 100), tuple("John", 35)]),
tuple("Jane", [tuple("Jane", 200), tuple("Jane", 87)])

(here [...] denotes the sub-range, not an array).

So aggregate will ignore the key part, but how does it know toignore the name in sub-ranges?

Re: groupBy/chunkBy redux

Reply via email to