I got the impression that Oren wanted to add these features, but AFAIK
they were not. A distinct operation would be fairly easy.
class Distinct : AbstractAggregateOperation
{
      string[] columns;

      public IEnumerable<Row> Execute(IEnumerable<Row> rows)
      {
            columns = rows.First().Columns.ToArray();
            base.Execute(rows)
      }

      protected override void Aggregate(Row row, Row aggregate)
      {
          aggregate = row;
      }

      public string[] GetColumnsToGroupBy()
      {
            return columns;
      }
}
that's where I would start anyway.

"I'm using an aggregate operation, which is terribly slow" could you
elaborate? metrics, code, etc.

On May 21, 7:17 am, jalchr <[email protected]> wrote:
> hello,
> I'm joining data from several sources and I'm trying to output a
> unique row based on a number of columns, I'm using an aggregate
> operation, which is terribly slow. I wonder how you guys deal with
> this issue.
>
> I might got it wrong but I think there was a "DistinctOperation"
> check this 
> linkhttp://ayende.com/Blog/archive/2007/08/04/Rhino.ETL-Status-Report--Jo...
> ...
>
> which is now abscent ...
>
> --
> You received this message because you are subscribed to the Google Groups 
> "Rhino Tools Dev" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group 
> athttp://groups.google.com/group/rhino-tools-dev?hl=en.

-- 
You received this message because you are subscribed to the Google Groups 
"Rhino Tools Dev" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/rhino-tools-dev?hl=en.

Reply via email to