[
https://issues.apache.org/jira/browse/ARROW-16549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17538226#comment-17538226
]
Vibhatha Lakmal Abeykoon commented on ARROW-16549:
--------------------------------------------------
How about including `names` field within the aggregates? cc [~westonpace]
I have added the main change which is moving 'target` inside the `Aggregate`
object.
But the current change breaks the `R`-bindings. And looking at this specific
piece of code,
[https://github.com/apache/arrow/blob/c032290b9ea2699ce29f4fa26e6826911e13fcca/r/R/query-engine.R#L125]
I think, may be we can put the names inside the Aggregate? (optional or
user-provided).
Appreiciate your thoughts, [~npr]
> [C++] Simplify AggregateNodeOptions aggregates/targets
> ------------------------------------------------------
>
> Key: ARROW-16549
> URL: https://issues.apache.org/jira/browse/ARROW-16549
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Weston Pace
> Assignee: Vibhatha Lakmal Abeykoon
> Priority: Major
> Labels: pull-request-available
> Time Spent: 50m
> Remaining Estimate: 0h
>
> Currently AggregateNodeOptions is:
> {noformat}
> class ARROW_EXPORT AggregateNodeOptions : public ExecNodeOptions {
> public:
> // aggregations which will be applied to the targetted fields
> std::vector<internal::Aggregate> aggregates;
> // fields to which aggregations will be applied
> std::vector<FieldRef> targets;
> // output field names for aggregations
> std::vector<std::string> names;
> // keys by which aggregations will be grouped
> std::vector<FieldRef> keys;
> };
> {noformat}
> It is not very obvious how {{aggregates}} and {{targets}} are related. My
> initial read of the comments led me to think that each aggregate would be
> applied to each target and you would end up with {{len(aggregates) *
> len(targets)}} output fields. In reality the {{aggregate}} at index {{i}}
> only applies to the {{target}} at index {{i}}. It would be simpler to add a
> {{FieldRef target}} to {{internal::Aggregate}} (and {{Aggregate}} should not
> be {{internal}}).
> Alternatively, the entire {{internal::Aggregate}} could be replaced by a
> "call" {{arrow::compute::Expression}}
--
This message was sent by Atlassian Jira
(v8.20.7#820007)