I got the impression that Oren wanted to add these features, but AFAIK
they were not. A distinct operation would be fairly easy.
class Distinct : AbstractAggregateOperation
{
string[] columns;
public IEnumerable<Row> Execute(IEnumerable<Row> rows)
{
columns = rows.First().Columns.ToArray();
base.Execute(rows)
}
protected override void Aggregate(Row row, Row aggregate)
{
aggregate = row;
}
public string[] GetColumnsToGroupBy()
{
return columns;
}
}
that's where I would start anyway.
"I'm using an aggregate operation, which is terribly slow" could you
elaborate? metrics, code, etc.
On May 21, 7:17 am, jalchr <[email protected]> wrote:
> hello,
> I'm joining data from several sources and I'm trying to output a
> unique row based on a number of columns, I'm using an aggregate
> operation, which is terribly slow. I wonder how you guys deal with
> this issue.
>
> I might got it wrong but I think there was a "DistinctOperation"
> check this
> linkhttp://ayende.com/Blog/archive/2007/08/04/Rhino.ETL-Status-Report--Jo...
> ...
>
> which is now abscent ...
>
> --
> You received this message because you are subscribed to the Google Groups
> "Rhino Tools Dev" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected].
> For more options, visit this group
> athttp://groups.google.com/group/rhino-tools-dev?hl=en.
--
You received this message because you are subscribed to the Google Groups
"Rhino Tools Dev" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/rhino-tools-dev?hl=en.